BAREBONE
Deep Dives

Cerebras: The Dinner-Plate Chip Going Public at 90x Sales

We read the amended S-1 ahead of the Cerebras IPO: $510M revenue, 86% from two Abu Dhabi entities, a $20B OpenAI deal, and a price near 90x sales.

Barebone

Barebone Research

||12 min read

The Slowest Thing on Your Phone

Everything on your phone is instant - Instagram, YouTube, Google Search. Then you ask an AI a question, and you wait, watching words appear one at a time like a telegraph.

There's a reason. Every time you use an AI model, a chip somewhere has to generate your answer token by token - a process called inference. Training is how models learn; inference is the unglamorous work of actually running them, billions of times a day, and it is fast becoming the dominant computing workload on earth.

This week, the company that bet its existence on inference goes public. Cerebras Systems - maker of a single chip the size of a dinner plate - is expected to price its Nasdaq IPO Wednesday and begin trading Thursday under the ticker CBRS. The order book, per Reuters reporting, is more than twenty times oversubscribed. OpenAI has signed a contract worth more than $20 billion. Amazon reportedly bought $270 million of stock. And NVIDIA - the company whose market Cerebras is explicitly attacking - just spent $20 billion of its own money arming itself against exactly this thesis.

We used Barebone AI to rebuild the Cerebras story from the amended prospectus, a decade of funding rounds, and eighteen months of benchmark wars. Four numbers frame everything: $510 million of 2025 revenue, up 76% in a year. About 86% of it from two Abu Dhabi-linked entities. A GAAP "profit" that disappears the moment you read the footnote. And an ask of up to $48.8 billion - roughly 90 to 96 times last year's sales.

The most interesting IPO of the year is not automatically the best-priced one. Let's take it apart.

One Chip the Size of a Dinner Plate

The idea Cerebras is going public on is older than most of its engineers - and it has a body count.

In 1980, Gene Amdahl, the legendary IBM mainframe architect, founded Trilogy Systems to build computers on one whole silicon wafer instead of cutting the wafer into hundreds of small chips. Trilogy raised roughly $230 million - the most lavishly funded startup of its era. At a test in December 1983, two microscopic wires crossed and the giant circuit glowed red before dying. By mid-1984 wafer-scale was abandoned; Trilogy became one of Silicon Valley's biggest financial failures of the pre-dotcom age.

The physics problem is brutal. Silicon wafers always carry defects, so for forty years the industry's answer was to dice each wafer into hundreds of small chips and throw away the broken ones. Make the whole wafer one chip, and a single flaw can kill the entire thing.

Cerebras - founded in 2016 by five alumni of SeaMicro, the server startup Andrew Feldman sold to AMD for $334 million - designed around the defects instead: tile the wafer with hundreds of thousands of tiny identical cores, expect some to arrive dead, and route around them. Not thousands of chips talking to each other. One chip. Three generations of it later, this is the WSE-3 next to NVIDIA's workhorse GPU:

Spec Cerebras WSE-3 NVIDIA H100
Silicon area 46,225 mm² 826 mm²
Transistors 4 trillion 80 billion
Compute cores 900,000 16,896
On-chip memory 44 GB SRAM 50 MB
Memory bandwidth 21 PB/s 3.35 TB/s
Peak FP16 compute 125 petaflops ~2 petaflops

Divide the last row and you get the marketing claim: one wafer is theoretically equivalent to about 62 H100s. The operative word is theoretically - that is peak arithmetic against NVIDIA's previous-generation part, not delivered performance.

The row that actually matters is memory bandwidth. Generating text is a memory-bound problem: for every single token, the chip has to touch the model's weights. A GPU keeps those weights in memory stacked next to the processor and pays a toll on every trip - and a cluster of GPUs pays a second toll every time data hops between chips. Cerebras keeps the weights in 44 GB of memory on the silicon itself, with more than 6,000 times the bandwidth of an H100's memory system. That is the entire trick. One chip, no hops.

Speed Became the Product

For years that trick looked like a science project, because the money was in training - a batch job where throughput beats latency. Two things changed.

First, the economics. Training is a capital expense for a handful of frontier labs. Inference is an operating expense for everyone, forever. The buyer pool goes from five labs to every enterprise on the planet.

Second, agents. An AI agent - software that books the flight, writes the code, runs the workflow - chains together dozens of model calls per task. A chatbot streaming at 60 tokens per second feels fine because you read along. An agent waiting on itself forty times in a row lives or dies by whether each step takes seconds or milliseconds. Latency compounds.

That's the demand curve Cerebras spent 2024 and 2025 benchmarking itself onto, with independent verification from Artificial Analysis:

Benchmark Cerebras speed Context
Llama 3.1 405B (Nov 2024) 969 tokens/sec A frontier-scale model at interactive speed
Llama 4 Scout (Apr 2025) 2,600+ tokens/sec 19x the fastest GPU provider at the time
Llama 4 Maverick (May 2025) 2,522 tokens/sec vs 1,038 for NVIDIA's tuned Blackwell submission

Benchmarks are marketing until somebody pays for them. Then, inside about six months, the three biggest names in AI infrastructure all paid.

Three Deals in Six Months

When Deal Terms
Late 2025 NVIDIA + Groq $20B licensing-and-acquihire: Groq's inference IP plus its engineering team
Early 2026 OpenAI + Cerebras $20B+ multi-year purchase of 750 MW of compute; warrants for a minority stake; ~$1B from OpenAI toward data centers
Mar 2026 AWS + Cerebras CS-3 systems deployed inside AWS data centers, sold through Bedrock; Amazon reportedly bought ~$270M of stock

Start with NVIDIA, because it's the tell. NVIDIA paid $20 billion for Groq - the other speed-first inference challenger - in a deal structured as a license-plus-acquihire of its technology and people rather than a purchase of the company. Jensen Huang explained the logic in writing:

"We plan to integrate Groq's low-latency processors into the NVIDIA AI factory architecture."

Incumbents do not spend $20 billion on categories they consider irrelevant. In one stroke, the deal validated the inference-speed thesis Cerebras is selling - and armed the only competitor that matters with dedicated low-latency silicon of its own. By GTC in March, Groq-derived accelerators were already on NVIDIA's roadmap slides.

OpenAI went the other way. Its agreement, reported by The Information in early 2026, commits more than $20 billion over multiple years for 750 megawatts of Cerebras capacity, hands OpenAI warrants for a minority stake, and includes about $1 billion from OpenAI toward data centers. One detail the louder headlines garbled: Sam Altman and Greg Brockman were early personal investors in Cerebras. A fair governance question for OpenAI - but disclosed in the IPO filings since 2024, and never a secret.

Amazon's deal may be the most telling. AWS will put Cerebras systems in its own data centers and resell them through Bedrock, with Amazon's Trainium 3 chips handling prefill - reading your prompt, a compute-bound job - and the Cerebras wafer handling decode, the memory-bound work of generating the answer. AWS says the pairing speeds up output by roughly 5x. Note what Amazon kept for itself: the half of inference its own silicon is good at.

None of this is revenue yet. Which brings us to the filing.

The Numbers Under the Wafer

Revenue grew 21x in three years — off a small base

Cerebras annual revenue, USD millions, per S-1 filings. Source: Barebone

The growth is real, and steep: $24.6 million in 2022, $78.7 million in 2023, $290 million in 2024, $510 million in 2025. Twenty-one-fold in three years; up 76% last year alone.

The profitability is not. The income statement shows 2025 GAAP net income of $237.8 million, a dramatic swing from a $481.6 million net loss in 2024. Then you read the footnote: $363.3 million of that "profit" is a one-time, non-cash gain from extinguishing a forward-contract liability owed to G42, its Abu Dhabi anchor customer and shareholder. On the company's own non-GAAP measure, 2025 was still a loss of roughly $75.7 million. The operations lose money. The accounting had a good year.

None of this is hidden - it's disclosed plainly. But "first profitable year" will appear in a lot of coverage this week, and it is not the economic reality underneath.

Two Customers in Abu Dhabi

The diversification is mostly one Abu Dhabi network

Share of annual revenue by customer, per the amended S-1. MBZUAI is identified in the filing as a related party of G42. Source: Barebone

G42MBZUAI (G42 related party)All other customers

In 2024, a single customer - G42, the Abu Dhabi AI conglomerate - was 85% of revenue. In 2025, G42 fell to 24%, which looks like diversification until you read who took its place: Mohamed bin Zayed University of Artificial Intelligence, at 62% - an entity the filing itself identifies as a related party of G42. Two names, one network, about 86% of revenue.

This is not a new problem; it's the reason this IPO took twenty months. Cerebras first filed in September 2024, then stalled while CFIUS - the US committee that reviews foreign investment for national-security risk - examined G42's $335 million stake. The company raised $1.1 billion privately instead (at an $8.1 billion valuation, led by Fidelity and Atreides), withdrew the old filing in October 2025, and refiled in April 2026 with the OpenAI and AWS logos on board.

Those two deals are precisely the diversification the skeptics demanded. But they live in the future. Not a dollar of OpenAI or AWS money appears in the 2025 income statement investors are being asked to pay roughly 90 times for. The revenue that exists was earned almost entirely in Abu Dhabi; the revenue that fixes the concentration exists, so far, as contracts.

Priced for Perfection

From $8.1B to a $48.8B ask in seven months

Post-money valuations of private rounds vs fully diluted value at the top of each IPO range. Source: Barebone

Realized private roundIPO range (top, fully diluted)

The repricing happened in real time. September 30: a $8.1 billion private valuation. April: the refiled S-1. May 4: 28 million shares at $115 - $125. May 11, after a roadshow Reuters sources described as more than twenty times oversubscribed: 30 million shares at $150 - $160 - up to $4.8 billion raised, and a fully diluted valuation of as much as $48.8 billion. Six times the September price in seven months, before a single public share has traded. On private secondary platforms, shares reportedly changed hands around $187 the day the range went up - above the top of the new range itself.

At the top of the range, Cerebras would be valued at roughly 96 times 2025 revenue (about 90x at the low end). One way to size that: for the multiple to compress to a still-premium 15x sales with the valuation merely flat, revenue has to reach roughly $3.3 billion - six and a half times 2025's total. That requires the OpenAI and AWS commitments to convert into recognized revenue on schedule, and the Abu Dhabi business to keep growing underneath them.

And the other side of the trade is wide awake. The honest bear case:

  • NVIDIA closed most of the gap when it tried. Cerebras beat Blackwell on Llama 4 Maverick 2,522 tokens per second to 1,038 - a 2.4x gap, not the 19x of friendlier benchmarks. And that was before NVIDIA owned Groq's low-latency technology and team.
  • The software moat points the wrong way. Most of the world's AI workloads are written for NVIDIA's CUDA ecosystem and already live in GPU-filled data centers. Cerebras has to stay enough faster, for long enough, to justify porting.
  • Its partners build chips too. The AWS deal pairs Cerebras with Amazon's own Trainium 3 - a reminder that every hyperscaler customer is also a competitor with a silicon roadmap.
  • Wafer-scale economics are still being proven. Trilogy's ghost has been answered - Cerebras ships, at production scale. But supporting a $48.8 billion valuation with wafers is a different test than setting benchmark records with them.

Everything has to go right. In markets, some things usually don't.

What This Means

This listing is the cleanest referendum the market has been offered on a single question: is AI's center of gravity moving from training to inference, and does that shift require different silicon? A twenty-times-oversubscribed book says investors believe the first half. The S-1 says the second half is still mostly a promise - $510 million of trailing revenue against $20-billion-plus of forward commitments.

The signals worth watching, in order:

  1. The first post-IPO quarterly filing. Does OpenAI or AWS revenue actually appear, and does the Abu Dhabi share fall toward half? Concentration is the thesis-killer if it doesn't.
  2. Where the stock settles, not where it opens. A book this oversubscribed all but guarantees a hot open; demand and froth look identical on day one. The distinction shows up over the following quarters.
  3. NVIDIA's first Groq-integrated benchmark. The day NVIDIA publishes a tokens-per-second number with Groq silicon in the loop, Cerebras' speed premium gets marked to market.
  4. The GAAP/non-GAAP spread. The one-time G42 gain flattered 2025. Watch whether genuine operating leverage shows up in 2026.

Cerebras spent a decade proving that one giant chip could survive physics. Gene Amdahl never got that far. The question this week is narrower and harder: whether a company with two customers and a $510 million top line can survive a $48.8 billion price.


Data: Barebone | Sources: Cerebras amended S-1 (SEC EDGAR), Cerebras and AWS press releases, The Information, Artificial Analysis benchmarks, Reuters | Data as of May 12, 2026

Activate Your AI Agentic Investment Research Terminal

$100M+connected
50,000+investors
Barebone home research screen
Share this article:

Disclaimer · Not Financial Advice

The content on this page is for informational and educational purposes only. It does not constitute financial, investment, legal, or tax advice, and is not a recommendation, offer, or solicitation to buy or sell any security or to adopt any investment strategy. Any securities or strategies mentioned are for illustration only. Market data may be delayed or inaccurate. Past performance is no guarantee of future results, and all investing involves risk, including the possible loss of principal. Barebone AI is not a registered investment adviser or broker-dealer. Always do your own research and consider consulting a licensed financial professional before making investment decisions.