
Enterprise AI has a knowledge downside. Regardless of billions in funding and more and more succesful language fashions, most organizations nonetheless can not reply primary analytical questions on doc storage. The perpetrator just isn’t mannequin high quality however structure: Conventional restoration augmentation technology (RAG) techniques have been designed to retrieve and summarize, not analyze and mixture throughout giant units of paperwork.
Snowflake is tackling this limitation head-on with a complete platform technique introduced at its BUILD 2025 convention. The corporate unveiled Snowflake Intelligence, an enterprise intelligence agent platform designed to unify structured and unstructured knowledge analytics, together with infrastructure enhancements that span knowledge integration and Openflowdatabase consolidation and On-line recreation Snowflakes Postgres with real-time evaluation and interactive tables. The purpose: Eradicate the info silos and architectural bottlenecks that forestall enterprises from working AI at scale.
A key innovation is Agentic Doc Analytics, a brand new functionality in Snowflake Intelligence that may analyze hundreds of paperwork concurrently. This strikes primary analysis enterprises similar to "What’s our password reset coverage?" in complicated analytical queries like "Present me the variety of weekly mentions by product space in my buyer assist tickets for the final six months."
Cap in RAG: Why samples fail for analytics
Conventional RAG techniques work by embedding paperwork into vector representations, storing them in a vector database and retrieving probably the most semantically comparable paperwork when a consumer asks a query.
"For RAG to work, it requires that every one the solutions you’re on the lookout for exist already ultimately revealed as we speak," Jeff Hollan, head of Cortex AI Brokers at Snowflake defined to VentureBeat throughout a press briefing. "The mannequin I consider with RAG is sort of a librarian, you get a query and it tells you, ‘This guide has the reply on this particular web page.’"
Nonetheless, this structure basically breaks down when organizations must carry out complete analytics. If, for instance, an enterprise has 100,000 stories and desires to determine all stories that discuss a selected enterprise entity and summarize all of the revenues mentioned in these stories, this can be a non-trivial job.
"That is one thing extra complicated than simply conventional RAG," Holland mentioned.
This limitation usually forces enterprises to take care of separate evaluation pipelines for structured knowledge in knowledge warehouses and unstructured knowledge in vector databases or doc shops. The result’s knowledge silos and governance challenges for enterprises.
How Agentic Doc Analytics works in a different way
Snowflake’s method unifies structured and unstructured knowledge evaluation in its platform by treating paperwork as queryable knowledge sources relatively than retrieval targets. The system makes use of AI to extract, construction and index doc content material in a manner that allows analytical operations similar to SQL throughout hundreds of paperwork.
The aptitude makes use of the prevailing Snowflake structure. Cortex AISQL handles doc parsing and extraction. Interactive tables and storage present sub-second search efficiency on large knowledge. By processing paperwork in the identical ruled knowledge platform with structured knowledge, enterprises can merge doc data with transactional knowledge, buyer information and different enterprise data.
"The worth of AI, the facility of AI, the productiveness and disruptive potential of AI, is created and enabled by connecting with company knowledge," says Christian Kleinerman, EVP of product at Snowflake.
The corporate’s structure retains all knowledge processing inside its safety boundaries, addressing governance considerations which have slowed enterprise AI adoption. The system works with paperwork from a number of sources. These embrace PDFs in SharePoint, Slack conversations, Microsoft Groups knowledge and Salesforce information via Snowflake’s zero-copy integration capabilities. This eliminates the necessity to extract and transfer knowledge to separate AI processing techniques.
Comparability with present market approaches
The announcement positions Snowflake in a different way from conventional knowledge storage distributors and AI-native startups.
Corporations like Databricks have targeted on bringing AI capabilities to lakehouses, however usually nonetheless depend on vector databases and conventional RAG fashions for unstructured knowledge. OpenAI’s Assistant API and Anthropic’s Claude each provide doc evaluation, however are restricted by the scale of their context window.
Vector database suppliers similar to Pinecone and Weaviate have constructed companies round RAG use circumstances however generally face challenges when clients want analytical queries relatively than retrieval-based ones. These techniques excel at discovering related paperwork however can not simply collect data throughout giant units of paperwork.
Among the many key high-value makes use of that have been beforehand troublesome with RAG-only architectures that Snowflow emphasizes for its method is buyer assist analytics. As a substitute of manually reviewing assist tickets, organizations can request patterns throughout hundreds of interactions. Questions like "What are the highest 10 product points talked about in assist tickets this season, damaged down by buyer phase?" be accountable in seconds.
What this implies for enterprise AI technique
For enterprises constructing AI methods, Agentic Doc Analytics represents a recreation changer "search and retrieve" The RAG paradigm in a "analysis and analyze" the extra acquainted paradigm of enterprise intelligence instruments.
Reasonably than deploying separate vector databases and RAG techniques for every use case, enterprises can consolidate doc analytics into their current knowledge platforms. This reduces infrastructure complexity whereas extending enterprise intelligence practices to unstructured knowledge.
The flexibility additionally democratizes entry. Querying doc analytics via pure language means the insights requested by knowledge science groups turn into accessible to enterprise customers.
For enterprises main in AI, the aggressive benefit comes not from having one of the best language mannequin, however from analyzing their very own unstructured knowledge at scale alongside structured enterprise knowledge. Organizations that may question total doc corpuses as simply as they question knowledge warehouses will discover information that rivals can not simply replicate.
"AI is a actuality as we speak," Kleinerman mentioned. "We’ve many organizations which are already discovering worth in AI, and whether or not somebody remains to be ready for it or sitting on the sidelines, our name to motion is to start out constructing now."

