Nebius is assembling an inference stack through acquisition that competitors cannot replicate quickly. Eigen AI works at the model level. Clarifai works at the system level. The unresolved question is what happens to the Clarifai government business that was deliberately left out of the deal.
Two deals in two weeks tell a cleaner story than either announcement does on its own. Nebius spent approximately $643 million to acquire Eigen AI on May 1, then followed on May 12 with an acqui-hire of Clarifai's core engineering team and a license of its inference and compute orchestration intellectual property. The press releases use different language, but the architecture is the same: Nebius is buying the layers of the inference stack it cannot build fast enough on its own.
I have a personal stake in this story. Alfredo Ramos, who served as Chief Product and Technology Officer at Clarifai, is a former colleague from my time at Network Solutions. Sajai Krishnan, who led go-to-market at Clarifai, is a childhood friend. Knowing both of them makes me more curious about the operational reality of this deal, not less skeptical of the strategic framing around it.
The Stack Nebius Is Building Has a Specific Shape
Inference optimization splits into two distinct problems. The first is model-level: compressing, quantizing, and tuning a trained model so it generates more tokens per chip per second. That is what Eigen AI's team from the Massachusetts Institute of Technology's HAN Lab does, using techniques including post-training quantization and KV-cache optimization. The second is system-level: orchestrating the compute environment so that model serving, resource allocation, and workload routing operate efficiently at scale. That is what Clarifai spent more than a decade building.
Roman Chernin, co-founder and Chief Business Officer of Nebius, described it directly in the announcement: delivering efficient execution at scale requires model optimization, system design, and compute orchestration to work together. Token Factory now has both layers under one roof. Most competitors have one or neither.
"Delivering efficient execution at scale is a system optimization game: model optimization, system design, and compute orchestration all have to work together." — Roman Chernin, Co-Founder and CBO, Nebius
Matthew Zeiler, Clarifai's founder and chief executive, joins Nebius as Senior Vice President of Research, with a focus on multimodal agentic reasoning, world models, token efficiency, and long-term memory. That research agenda maps directly onto the workload types enterprise buyers are beginning to pressure-test in production: agents that need context persistence, multimodal inputs, and consistent latency under load.
The Defense Carve-Out Is the Part Worth Reading Twice
The deal announcement specifies that the license Nebius received covers Clarifai's modern AI inference and compute orchestration technology only. It explicitly excludes Clarifai's legacy computer vision models and any intellectual property, products, services, or commercial arrangements associated with Clarifai's United States government and defense programs.
That carve-out is not boilerplate. Clarifai has been an active vendor in the defense and intelligence community, with computer vision capabilities deployed in sensitive environments. The deliberate exclusion of that business from the Nebius transaction raises a question that neither party has answered publicly: who holds the defense business now, and what is its path forward? A carved-out government practice without its engineering leadership is a different asset than it was before this deal closed.
For enterprise buyers who are not in defense, the carve-out may seem irrelevant. It is not. It signals that at least part of Clarifai's operational infrastructure, contracts, and possibly personnel remains in a structure separate from Nebius. Understanding whether that creates continuity risk for any Clarifai commercial relationship requires a direct conversation with both parties.
The license scope is limited to modern inference and orchestration technology. Legacy computer vision models and U.S. government contracts were explicitly excluded. Enterprise buyers with existing Clarifai relationships should verify how their specific deployments are affected.
Nebius Is Making a Different Bet Than CoreWeave
The neocloud category has been framed primarily as a hardware story: companies with access to large inventories of graphics processing units, offering faster provisioning and AI-optimized infrastructure compared to the general-purpose hyperscalers. CoreWeave built its position on that premise. Nebius is now making a different argument.
By acquiring Eigen AI and licensing Clarifai's IP, Nebius is asserting that the competitive advantage in AI infrastructure will shift from raw compute availability to software-defined inference efficiency. Token efficiency reduces cost per token generated. Better compute orchestration reduces wasted capacity. Both matter more as enterprise AI moves from experimentation into production workloads running continuously at scale.
Nebius also announced in May 2026 that it had secured up to 1.2 gigawatts of power and land for a new owned AI factory in Pennsylvania, and broke ground on a flagship campus in Independence, Missouri. The infrastructure build is not slowing. But the Eigen and Clarifai moves suggest Nebius understands that having the most chips is not a durable moat if the software layer above them is undifferentiated.
The prior coverage here of Nebius's acquisition of Tavily, the agentic search company, fits the same pattern. Tavily grounds agent reasoning in real-time data. Eigen AI optimizes the model. Clarifai orchestrates the system. Token Factory is becoming a vertically integrated inference platform, not just a managed hosting product.
What This Means for the Infrastructure Buyer
Enterprise teams evaluating AI infrastructure in 2026 are increasingly asking a question that hyperscaler pricing sheets do not answer clearly: what does it actually cost to run this model at this latency threshold at this volume, reliably, in production? Nebius is positioning Token Factory as the place that question gets answered with a vertically integrated answer rather than an assemblage of vendor services.
The integration risk is real. Eigen AI's model-level work and Clarifai's system-level work were built by different teams with different design philosophies. Making them interoperate cleanly under one platform takes engineering time that the acquisition press releases do not account for. Zeiler leading research at Nebius helps, but organizational integration is a different problem than technical integration.
The talent question also matters. A select group of Clarifai engineers and researchers joined Nebius's infrastructure teams, per the announcement. The phrase "select group" leaves open how many did not make the transition, and what that means for continuity of Clarifai's existing customer base.
Nebius has assembled model-level and system-level inference optimization into a single platform faster than any neocloud competitor. The question for your infrastructure team is whether Token Factory's integrated stack can demonstrably reduce your cost per inference token at production scale, and whether the Clarifai defense carve-out creates any discontinuity in the commercial relationships or deployments you already have in place. Ask both questions before the next contract renewal.
- Nebius. "Nebius Welcomes Clarifai's Core Team and Licenses Inference IP to Strengthen Nebius Token Factory." Business Wire, 12 May 2026, nebius.com.
- Nebius. "Nebius Group Announces Agreement to Acquire Eigen AI." Business Wire, 1 May 2026, nebius.com.
- Clarifai. "About Clarifai." Clarifai, 2026, clarifai.com.
- Bellamkonda, Shashi. "Nebius Group NV Acquires Tavily to Consolidate Agentic Search Infrastructure." shashi.co, Feb. 2026, shashi.co.
- Clarifai. "Clarifai Is Heading to Santa Clara for the AI & Big Data Expo." X (formerly Twitter), 2025, x.com/clarifai.
