The Models Are Smart Enough. Your Enterprise Isn't Talking to Them.

Enterprise Technology Analysis · shashi.co

Data + AI Summit 2026, San Francisco · Day One

By Shashi Bellamkonda · June 16, 2026

~50%

Accuracy: best coding agents on real enterprise data questions (Databricks internal benchmark; 2026)

30%+

Accuracy improvement with Genie Ontology layer (Databricks; 2026)

80%+

Neon databases provisioned by AI agents, not humans (Databricks/Neon; 2025)

$1B

Neon acquisition price, serverless Postgres for agentic workloads (Reuters; 2025)

Key Takeaway

Databricks CEO Ali Ghodsi argued at Data + AI Summit that the frontier model race is effectively over as a differentiator. The real enterprise problem is context: agents lack accurate, live knowledge of your organization, and without it, model intelligence doesn't convert into reliable decisions. Genie Ontology is the company's answer, and the accuracy data behind it is what CIOs need to examine.

Most hands in a room of 31,000 stayed down when Databricks CEO Ali Ghodsi asked whether artificial general intelligence had already arrived. Then he read a graduate-level topology problem, confirmed that every leading frontier model solves it without hesitation, and made his point: the models are already past the threshold. What hasn't crossed it yet is the connection between those models and your organization's data, processes, and institutional knowledge. That gap, not model capability, is what is holding enterprise AI deployment back.

The announcements drew attention: Genie One as a universal data co-worker, the Raider real-time engine, Unity Artificial Intelligence Gateway for cost and governance control, and the acquisition of Panther Labs for agentic security operations, Databricks' third cybersecurity deal in just over a year. The company's acquisition of Neon , its serverless Postgres play announced in May 2025, provided the technical foundation Ghodsi drew on repeatedly when discussing agentic database provisioning.

"Raider" is also Databricks' internal shorthand for "Raiders of the Last Data Silos," a nod to Lakehouse Federation and the founders' apparent fondness for naming things that annoy them.

The context gap is the actual bottleneck

The assumption in most enterprise AI deployment planning is that model selection determines outcome quality. Pick the right model, tune it, deploy it. Ghodsi's argument, backed by internal benchmark data his team presented, cuts against that assumption directly.

When Databricks tested the best available coding agents against a set of real enterprise data questions, questions that actual employees submitted through Genie, the most accurate agents answered correctly roughly half the time. A coin flip. The questions weren't exotic. Examples given included: what percentage of customers who use one product also use a third-party service; what is engagement broken down by region this week. The kind of questions a CFO or a product lead needs answered before a board meeting.

The problem isn't reasoning. Frontier models can reason at graduate-level mathematics. The problem is that these agents have no reliable map of how your organization captures data, what your internal terminology means, which tables hold which numbers, or who the authoritative source on a given metric is. Faced with a question, they go searching live, consuming time and tokens, and they frequently surface stale documents or hallucinate figures when the search fails.

"The problem we have is that AI has not completely permeated our organization. Inside of your companies, you don't have hundreds of agents working for you, doing autonomous work, collaborating with each other. Most of us are just using chatbots."

Ali Ghodsi, CEO, Databricks · Data + AI Summit, June 16, 2026

What This Means for You

If your AI pilot is returning inconsistent answers to basic business questions, the model is not the problem. The absent layer is a semantic map of your own data. Before you run another proof of concept, audit whether your agents have any reliable way to know what your data means, who owns it, and which source is authoritative. Most organizations do not have that map. Databricks is betting that gap is where the next platform war gets fought.

Genie Ontology is the bet on a different architecture

The Databricks answer is Genie Ontology, a background knowledge graph construction system that runs continuously against an organization's data assets rather than at query time. The distinction matters. Current agent architectures search for context when a question arrives, which is why they are slow, expensive, and unreliable on proprietary enterprise questions. Genie Ontology indexes data pipelines, dashboards, Jira tickets, calendar data, SharePoint, Google Drive, and any connected source into a knowledge graph that agents can query in milliseconds.

The ranking algorithm the research team developed, described as functioning on organizational data the way Google's PageRank algorithm worked on the web, surfaces high-authority snippets based on who created an asset, how frequently it is accessed, and what domains a given employee has expertise in. A definition of engagement stored in a mobile key performance indicators dashboard, authored by a data scientist with demonstrated authority in that domain, gets weighted accordingly.

A demo using Databricks' own internal instance showed more than 4.5 million ontology snippets already indexed for a single company deployment. No live search agent could traverse that corpus at query time and return a trustworthy answer in seconds. The ontology makes the traversal unnecessary.

The accuracy improvement claimed is a consistent 30% lift over coding agents on the same questions, with the gap widening as the ontology is refined. That number, presented by Senior Director of Product Management Ken Wong during the Genie One session, deserves stress-testing in any proof of concept before budget commitments follow. The underlying mechanism is sound: an organization that encodes its semantic layer, business definitions, and institutional context into an automatically-maintained knowledge graph gives agents a reference that static prompting cannot replicate.

What This Means for You

A 30% accuracy lift sounds compelling, but the number only matters if your ontology is well-maintained. Genie Ontology is not a one-time configuration; it runs continuously against your data assets, which means its quality is a function of your data governance discipline, not Databricks' product. Organizations with inconsistent metadata, competing definitions of key metrics, or poorly tagged assets will see diminishing returns. The product is as good as the inputs you bring to it.

The Customer Evidence Behind the Accuracy Claim

The customer cases on stage were not chosen at random. PepsiCo's Global Chief Data and Artificial Intelligence Officer Magesh Bhagavathi described six years of consolidation from more than 60 data silos to roughly 8, with 90 percent of the company's data now on Databricks. Within the first weeks of deploying Genie, the company recorded 30,000 query engagements from procurement and indirect spend leaders who had previously depended on dashboards and reports. The direction of the shift, from static reports toward live conversational queries against governed data, is exactly what Ghodsi is predicting becomes standard.

Mastercard presented a different dimension of the same problem. Operating across more than 200 countries, processing over 150 billion transactions per year, the company is building a shared agent foundation on Lakebase, the company's term for serverless Postgres on top of the lakehouse. The requirement is simultaneous isolation and learning: each issuing bank's data must remain separated for regulatory and trust reasons, while the agents must accumulate insights across the full portfolio.

That tension, isolation at the data layer and shared learning at the intelligence layer, is one of the harder unsolved architectural problems in enterprise AI right now. The Mastercard team built a concept-to-demo MVP in weeks rather than months, which they attributed to governance and guardrails being baked into the architecture before the first agent was deployed.

What This Means for You

PepsiCo's path from 60-plus silos to 8 took six years. Mastercard's multi-country isolation requirement adds regulatory complexity most enterprise teams underestimate. Both cases represent organizations with dedicated data engineering capacity that spent years on the foundation before the agent layer became viable. If your data estate is still fragmented or your governance layer is immature, the timeline to meaningful Genie adoption is longer than a vendor demo suggests.

The agent ops problem most teams haven't priced in

One announcement that got less attention than it deserves is Genie Zero Ops, a background agent that monitors data pipelines and machine learning models, detects anomalies, traces root causes through data lineage, and proposes fixes, including by writing and testing code in a shadow branch of production data before presenting it for human approval.

The case rests on a known cost: data engineering teams report spending more than half their time on incident response and maintenance rather than building new pipelines. As coding agents write more of those pipelines faster, the surface area for failures grows in proportion. Zero Ops runs inside the data plane, where it has access to production data lineage, compute logs, and the Unity Catalog governance layer. That governance boundary is what makes autonomous write access to production feasible at all, because the operations agent cannot exceed permissions that a human engineer would not have.

The cost argument Ghodsi made for the full platform is worth taking seriously. As agentic workloads scale, token costs that seem manageable in a pilot can become prohibitive at production volume. He referenced an unnamed chief executive who depleted a full year's artificial intelligence budget ahead of schedule. Unity Artificial Intelligence Gateway, which provides a single control plane for model spend, identity management, and compliance across all agents and all model providers, is Databricks' answer to that problem, and it is priced as a platform play rather than a premium add-on.

What This Means for You

Two separate budget risks live in this section. The first is operational: as agents write more pipeline code faster, incident surface area grows proportionally, and your data engineering team's maintenance burden does not shrink automatically without something like Zero Ops to absorb it. The second is financial: token costs at pilot scale feel negligible, but the unnamed CEO who burned through a full year's AI budget early is not an edge case. Unity AI Gateway is worth evaluating not as a governance nicety but as a cost containment requirement before you authorize any production agentic workload.

Key Takeaway

Genie Zero Ops positions the data platform as the right governance boundary for autonomous agent operations, not a separate orchestration layer. Organizations evaluating where to give agents write access to operational systems need to answer that question before pilots scale.

What the portability argument still leaves open

Earlier this year, in coverage of the Delta Lake and Apache Iceberg format convergence announced ahead of this summit, I traced a portability thesis: the format question is now resolved, but governance portability and performance portability remain open problems. That characterization held at Monday's keynote.

Ryan Blue, creator of Apache Iceberg and Chief Executive Officer of Tabular, joined Ghodsi on stage to confirm that Iceberg V3 support is generally available, and that a unified metadata layer built into Delta 5 and Iceberg V4 is targeted for later this year. The format argument is closing. But Genie Ontology opens a new portability question: the knowledge graph. If an organization builds several years of indexed institutional context inside Databricks' Genie system, what does migration of that ontology look like? This was not addressed on stage, and it is the question procurement teams should be asking now.

The Raider real-time engine, announced as the technology powering the new Lakehouse Real Time warehouse, adds a third layer to the portability question. Reynold Xin, who co-founded Databricks, demonstrated benchmark results showing Raider sustaining 12,000 queries per second at a 37-millisecond P90 latency under load that caused competing serving engines to crash or spike. The engine uses machine learning trained on Databricks' own query traces to predict and implement the right algorithmic approach before executing, and that learned optimization accumulates only inside the Databricks environment, with no equivalent available if you move to another platform.

What This Means for You

Format portability between Delta and Iceberg is close to resolved, and that is the right problem to have solved first. But two new lock-in vectors appeared at this summit that procurement teams should flag now. Raider's learned query optimization accumulates only inside Databricks, so performance advantages compound in ways that are not portable. Genie Ontology's indexed institutional knowledge raises the same question at a higher cost: if you spend two years building your organization's semantic layer inside this system, what does the exit look like? Ask for a documented answer before the contract closes, not after the ontology has 10 million snippets.

CIO/CTO Viability Question

If Genie Ontology becomes the indexed institutional memory of your organization, ask Databricks today what the export format of that knowledge graph looks like and what a migration would cost. Format portability is nearly solved. Ontology portability has not been addressed, and the longer you run the system, the more expensive that question becomes to leave unanswered.

Sources

Ghodsi, Ali. Opening keynote and product announcements. Data + AI Summit, Databricks, 16 June 2026, San Francisco. databricks.com
Wong, Ken. "Genie One: AI Co-Worker for the Enterprise." Data + AI Summit, Databricks, 16 June 2026, San Francisco. databricks.com
Bhagavathi, Magesh. Customer session with Ali Ghodsi. Data + AI Summit, Databricks, 16 June 2026, San Francisco. pepsico.com
Databricks. "Databricks Agrees to Acquire Neon to Deliver Serverless Postgres for Developers + AI Agents." Press release, 14 May 2025. databricks.com
Databricks. "Databricks Agrees to Acquire Panther, Further Establishing the Security Lakehouse Category." Press release, 16 June 2026. databricks.com
Blue, Ryan, and Ali Ghodsi. "Delta Lake and Apache Iceberg Format Convergence." Data + AI Summit, Databricks, 16 June 2026, San Francisco. databricks.com

Disclaimer: This blog reflects my personal views only. Content does not represent the views of my employer, Info-Tech Research Group. AI tools may have been used for brevity, structure, or research support. Please independently verify any information before relying on it.

Independent Analysis: AI, SaaS, and Revenue Growth Strategy

The Models Are Smart Enough. Your Enterprise Isn't Talking to Them.

The context gap is the actual bottleneck

Genie Ontology is the bet on a different architecture

The Customer Evidence Behind the Accuracy Claim

The agent ops problem most teams haven't priced in

What the portability argument still leaves open

Get new posts by email: