We traced Apple's on-device AI playbook: a 3B-parameter model in every iPhone, a $1B-a-year Gemini rental, and the memory repricing hitting its supply chain.
Barebone Research
||12 min read
The Punchline
In June 2024, on a stage in Cupertino, Apple demoed a Siri that could see your screen, remember what was buried in your old messages, and take actions across your apps. It was the centerpiece of Apple Intelligence - the company's answer to two years of "Apple is behind on AI."
It is April 2026. That Siri still has not shipped.
In between came an official delay, a shareholder lawsuit, the loss of the executive who ran Apple's models team, the retirement of its AI chief, and a deal to rent a brain from Google. By the scoreboard everyone watches - model quality, shipped features, retained researchers - this is what losing looks like.
So we used Barebone to lay the entire saga end to end: two years of announcements, transcripts, executive moves, and supply-chain prints, set against what Apple's business and stock actually did over the same period. The two timelines barely look like they belong to the same company.
A record $143.8 billion December quarter, up +16%. A $4 trillion valuation touched on October 28 - the third company in history to get there. A stock up +50% from last April's low. The most mocked AI strategy in tech sits on top of the best run of results Apple has ever printed.
That gap is the story.
The Case That Apple Lost
First, the prosecution. Every date below is from the public record:
Date
What happened
Jun 2024
WWDC: Apple announces a personalized Siri - on-screen awareness, personal context, in-app actions - initially aimed at iOS 18.4
Mar 7, 2025
Apple officially delays the features, now expected "in the coming year"
Jun 2025
Shareholders sue over the delay; WWDC ships a developer framework, not the new Siri
Jul 2025
Ruoming Pang, who ran the foundation-models team, leaves for Meta on a package reported above $200 million
Nov 5, 2025
Bloomberg: Apple to pay Google roughly $1 billion a year for a custom 1.2-trillion-parameter Gemini model to power the new Siri
Dec 1, 2025
AI chief John Giannandrea announces his retirement; Amar Subramanya takes over as VP of AI
Feb 11, 2026
Bloomberg: the headline Siri features slip past iOS 26.4 and get spread across iOS 26.5 (May) and iOS 27 (September)
The March 2025 delay is worth re-reading, because Apple never posted it to its own newsroom. It was a statement circulated to a handful of journalists on a Friday:
"It's going to take us longer than we thought to deliver on these features and we anticipate rolling them out in the coming year."
Thirteen months later, the features that statement refers to still have not rolled out. The original target was iOS 18.4. The current plan, per Bloomberg's February reporting, spreads them across iOS 26.5 in May and iOS 27 in September - and the personal-data piece is "especially likely to slip" again.
The talent picture is no kinder. Pang, the man responsible for Apple's in-house models, was worth a nine-figure package to Meta - and then left Meta for OpenAI roughly seven months later. Giannandrea, the AI chief hired from Google in 2018, announced his retirement three weeks after the Gemini deal leaked, with Subramanya - hired in from Microsoft, and before that Google - taking over foundation models, machine-learning research, and AI safety under Craig Federighi.
A two-year-late feature, a raided lab, and a rented brain. Case closed, apparently.
The Scoreboard Disagrees
Now the defense. Apple reported its fiscal first quarter on January 29, covering the December holiday period - the first full quarter of iPhone 17 sales:
Metric (Q1 FY2026)
Result
vs. year ago
Revenue
$143.8B
+16%
Net income
$42.1B
-
Diluted EPS
$2.84
+19%
iPhone revenue
$85.3B
+23%
Services revenue
$30.0B (record)
+14%
Greater China revenue
-
+38%
It was the largest quarter in the company's history. iPhone set all-time records in every geographic segment. Tim Cook's words to CNBC: "The demand for iPhone was just simply staggering."
The stock told the same story in advance.
The market never priced Apple as the AI loser
AAPL month-end closes, Mar 2025 - Apr 6, 2026. The April 2025 panic low close was $172.42 (Apr 8). Source: Barebone
Twelve months ago, Apple was the poster child of the April 2025 tariff panic - the stock closed at $172.42 on April 8, 2025. From there it climbed all the way through the iPhone 17 launch, crossed a $4 trillion valuation on October 28 (only NVIDIA and Microsoft got there first), and peaked in late November. Today it closed at $258.86 - up +50% from that low, though down about -5% so far in 2026 and back below the $4 trillion line.
Hold that tension. The market spent 2025 deciding Apple's AI lag didn't matter, and has spent early 2026 having second thoughts. Both reactions are rational - because Apple is playing a different game than the one it keeps losing.
The Playbook: Rent the Brain, Own the Body
Strip away the keynote theater and Apple's actual AI strategy - the one visible in what it shipped rather than what it promised - has four moves.
A small, free brain in every pocket
At WWDC in June 2025, while the new Siri stayed offstage, Apple shipped the Foundation Models framework: a roughly 3-billion-parameter language model built into iOS 26 that any developer can call in a few lines of Swift. Parameters are the tuned weights inside a model - a rough proxy for capability - and 3 billion is tiny by frontier standards, compressed to 2 bits per weight so it fits in a phone's memory alongside everything else.
The number that matters isn't the parameter count. It's the price: inference - actually running the model - costs developers nothing. Every other AI platform meters usage by the token. Apple's meter doesn't exist, because the customer already bought the computer. Every app that builds on that free local model deepens the ecosystem's dependence on Apple silicon, at zero marginal cost to Apple.
Hardware rebuilt around it
The A19 Pro chip in September's iPhone 17 Pro added what Apple calls neural accelerators to every GPU core - dedicated matrix-math units that lift peak GPU compute for AI workloads by up to 4x over the prior generation. The iPhone 17 Pro carries 12GB of RAM, because models live in memory.
This is AI capex, same as the hyperscalers' - with one difference. A data center is a cost the company carries for years. Apple's AI infrastructure is sold to the customer, at retail, one handset at a time.
Rent what you cannot carry
A 3B model is, by Apple's own description, not designed for world knowledge or advanced reasoning. For that, requests route to Private Cloud Compute - Apple-run servers engineered so that, by design, Apple itself cannot read what you send them - currently running an Apple model of roughly 150 billion parameters. And for the new Siri, per Bloomberg, Apple will run a custom 1.2-trillion-parameter Gemini model on those same Apple servers, paying Google about $1 billion a year. Google supplies the brain; it never touches the customer.
The 400x gap Apple is renting, not racing
Reported parameter counts of the models behind Apple Intelligence, in billions. Source: Barebone
Apple modelsRented from Google
That chart is the strategy in one picture. The gap between the model in your pocket and the model Apple is renting is 400x - and Apple is not trying to close it. It is arbitraging it. Set against the December quarter, $1 billion a year is about 0.7% of a single quarter's revenue: Apple converted the most expensive arms race in business history into a procurement line item. Meanwhile, Bloomberg reports Apple has a roughly 1-trillion-parameter cloud model of its own in development, which it hopes to have ready as early as this year - with the explicit intent of eventually swapping Gemini back out.
Rent the brain. Own the body. Reserve the right to replace the brain.
It sells without Siri
Here is the quiet tell in the Q1 numbers: whatever drove the strongest iPhone cycle in years, it was not the new Siri, which did not exist. The hardware sells without the AI. When the AI finally lands, it lands on hundreds of millions of devices that are already in pockets, already paid for, already carrying the silicon to run it.
The Supply Chain Tell
If the strategy is "put the AI in the phone," the binding constraint is not models. It is memory.
On-device models occupy RAM permanently - and Apple is specifying more of it exactly as the memory industry walks away from phones. Suppliers have shifted capacity toward high-bandwidth memory for AI data centers, starving the conventional and mobile DRAM that consumer devices use. TrendForce's February 2 outlook for first-quarter contract prices - the prices device makers lock in with suppliers each quarter - reads like a misprint:
On-device AI runs on DRAM, and DRAM just repriced
Projected 1Q26 contract price increase vs 4Q25, by memory category. Source: Barebone
What goes in a phoneOther memory categories
PC DRAM projected up more than +100% quarter-over-quarter, a record. Mobile DRAM - the LPDDR chips that go in an iPhone - up roughly +90%, the steepest increase in its history. Industry trackers put memory at 10 - 15% of a typical smartphone's bill of materials, and the effects are already visible: TrendForce cut its 2026 global smartphone production forecast to a -2% decline and reported brands raising prices and downgrading specs to cope. In January, SK Hynix overtook Samsung in annual profit for the first time ever. The picks-and-shovels trade of pocket AI is the memory makers - and the market has noticed.
For Apple the same chart is a cost problem. A strategy that demands more DRAM per phone now collides with the sharpest mobile-memory repricing on record, just as the iPhone 18 generation gets specified. TrendForce notes most US smartphone vendors locked their contracts in late 2025, which buys time. It does not buy immunity.
Where the Thesis Breaks
Execution is not a detail. The same feature set has slipped from iOS 18.4 to "the coming year" to a phased rollout across two more releases. At some point a delay stops being a delay and becomes a statement about capability. If the May and September windows slip too, "different game" starts sounding like an excuse.
The rental cuts both ways. Apple is paying its fiercest platform rival because its own models are not good enough - and the in-house team that is supposed to fix that keeps losing its leaders. If the 1-trillion-parameter replacement slips the way Siri slipped, a temporary dependence hardens into a permanent one, and Google will price the renewal accordingly.
The body can be commoditized too. The deep risk to "own the body": if assistants become the interface, the device becomes the peripheral. Every frontier lab is circling hardware. Apple's moat assumes people choose the phone first and the AI second. That has been true for nineteen years of iPhone. It is not a law of physics.
The stock is not priced for failure. Down -5% this year but up +50% from last April and hovering just under the $4 trillion line, Apple trades as if the strategy works. A 16%-growth quarter bought a lot of forgiveness. Two more missed Siri windows would test how much is left.
What This Means
Apple's bet is that in consumer AI, distribution beats invention - that the company that owns the pocket can rent intelligence until it builds its own. The two-year Siri fiasco is the strongest evidence Apple cannot build the brain. The $143.8 billion quarter is the strongest evidence it may not need to, yet. Tonight, the market is pricing the second sentence.
The signals that will settle it:
The May and September windows. Bloomberg's February reporting points to iOS 26.5 and iOS 27. Either the personal-context Siri ships inside them, or the delay becomes the story again.
WWDC in June. Watch the on-device model, not the demos. If it grows past 3B parameters or the free-inference framework expands, the developer land grab is compounding.
The in-house trillion-parameter model. The Gemini deal is either a bridge or a crutch. The swap-out date - if it ever comes - is the verdict.
Mobile DRAM contract prints. TrendForce's quarterly numbers are the cost side of the AI phone, and the margin question hanging over the fall lineup.
The Services line. $30 billion a quarter and growing +14% - if owning the distribution layer of AI ever monetizes, this is where it shows up first.
Losing the model race while running the only store the winners must sell through has been done before. It used to be called search, and Apple collected roughly a billion dollars a month for it. The question for the next twelve months is whether AI rhymes - or whether, this time, the brain matters more than the body.
Data: Barebone | Sources: Apple Q1 FY2026 earnings release (Jan 29, 2026), Apple Newsroom (Dec 1, 2025), Apple WWDC25 Foundation Models documentation, Apple statement of Mar 7, 2025, Bloomberg reporting (Jul 2025, Nov 5, 2025, Feb 11, 2026), TrendForce 1Q26 memory price outlook (Feb 2, 2026) | Prices as of April 6, 2026 close
Activate Your AI Agentic Investment Research Terminal
$100M+connected
50,000+investors
Share this article:
Disclaimer · Not Financial Advice
The content on this page is for informational and educational purposes only. It does not constitute financial, investment, legal, or tax advice, and is not a recommendation, offer, or solicitation to buy or sell any security or to adopt any investment strategy. Any securities or strategies mentioned are for illustration only. Market data may be delayed or inaccurate. Past performance is no guarantee of future results, and all investing involves risk, including the possible loss of principal. Barebone AI is not a registered investment adviser or broker-dealer. Always do your own research and consider consulting a licensed financial professional before making investment decisions.