barcik.training — Publications

The Economics
of the Frontier

Seven ledgers on the business of the AI frontier —
and one habit for reading its numbers.

Robert Barcik Snapshot: May 2026 robert@barcik.training
Introduction

The Premise

In a single week in 2026, you could read that the same AI lab was a “$30 billion business,” a “$14 billion business,” and a business losing money on every customer it served. All three were published by serious outlets. None of them were wrong.

They were using three different rulers. One measured the best recent month and multiplied by twelve. One measured revenue an accountant would actually recognise. One measured what it costs to serve a customer against what that customer pays. The numbers disagree because the units disagree — and almost nobody tells you which unit they are holding.

This is the real problem with the economics of frontier AI. It is not that the numbers are secret. It is that they are enormous, fast-moving, and quoted in whichever form flatters the moment — and the human mind is not built to hold a $700 billion figure steady long enough to ask whether it means anything. Coverage of this industry drowns you in magnitudes. It rarely hands you a way to think with them.

This booklet is that way to think. It is organised as seven ledgers — seven separate accounts of how money actually moves at the frontier. A ledger is a record, not a forecast: each one shows you a different mechanism, and together they let a practitioner or an executive read any headline about Anthropic, OpenAI, Mistral, or the Chinese labs and know what they are looking at.

The seven ledgers are: How a Lab Works (the basic machine — what a model costs to build and how it is sold), Reading the Books (the toolkit — how headline numbers are constructed), The Vintage Problem (why a lab can profit on every model it sells and still lose billions), A Tale of Two Labs (why Anthropic and OpenAI are no longer the same kind of company), The Dark Factory (the race to become the utility company of intelligence), The Money Goes in a Circle (how investor dollars become revenue dollars), and The Rest of the World (Mistral, the Chinese labs, and the commodity tier).

The seller’s side

A companion booklet in this series, The Token Economics, looks at AI cost from the buyer’s side — what an organisation pays per token, and when self-hosting beats an API. This booklet is deliberately the mirror image: the seller’s side. Not “what does AI cost you,” but “how do the companies selling it make — or lose — money, and how sturdy is that business.” If you build on these labs, depend on them, or answer to someone who is writing them large cheques, that is the question underneath the question.

One recurring habit

Throughout, you will meet a small green box labelled Translate. Each one takes a number too big to feel and restates it as one you can — a yearly loss as a daily burn rate, an aggregate spend as a figure per person, a capital commitment as something physical. The boxes are not decoration. They are the single habit this booklet is trying to install: never repeat a frontier number you have not first converted into something a person can picture.

Translate

Anthropic reported a $30 billion revenue run rate in April 2026.1 “Run rate” means: take the best recent month, multiply by twelve. So the underlying claim is really “we earned about $2.5 billion in our best month.” That is still extraordinary — but it is a different, smaller, more honest sentence than the headline, and you can hold it in your head.

A note on what this booklet is — and isn’t

Every figure here is a snapshot dated May 2026. The dollar amounts will age within months; the industry reprices itself constantly. That is deliberate: the durable product of this booklet is the method, not the magnitudes. If a number is stale by the time you read it, the way of reading it still holds.

Two more honesties. First, the public record is uneven — OpenAI is more heavily reported on than Anthropic, and some of the most-quoted Anthropic figures come from the company itself or from sympathetic sources. Where that matters, the text says so, and every figure carries a small tag — reported projected estimated — telling you how solid it is. Second, the goal is comprehension, not a verdict. This is not a case that the frontier is a bubble, nor that it isn’t. It is a set of instruments for reading the gauges yourself.

Ledger 1

How a Lab Works
“Where the money comes from, and where it goes”

The build
A model is built once, like a factory or a film: a research team and one long training run produce a single finished artefact. That is a large, one-time cost — paid in full before the model has earned a cent.
The sale
Then it is sold three ways — metered API, monthly subscriptions, enterprise bundles — while every answer it gives costs a little compute to serve. Build cost, serving cost, and a clock: that is the whole machine.

Before the billions, a small machine. Every later ledger is a consequence of how a single AI lab turns money into a model and a model back into money — so it is worth walking that circuit once, slowly, with an imaginary lab we will call Frontier Labs, Inc.

Diagram: a research team and compute produce one model, a one-time build cost, which is then sold through a metered API, monthly subscriptions, and enterprise bundles.
The machine in one picture: a large one-time cost to build a model, then three ways to sell access to it — while every answer it gives also costs a little compute.

Cost one: building the model

Frontier Labs sets out to build a model. Two things go in. The first is people — a few hundred researchers and engineers, paid serious salaries, for many months. The second is compute — an enormous cluster of specialised chips, rented or owned, running one long, uninterrupted training run in which the model learns from a vast amount of text and code.

At the end of that run you have exactly one thing: a single trained model — a large file of numbers that can answer questions. Everything spent getting there was a one-time cost. It is much closer to building a factory, or shooting a film, than to a monthly bill. You pay it all up front, before the model has earned a single cent. Real frontier training runs have been described as costing anywhere from roughly $100 million to well over $1 billion, depending on the model’s size and generation.2 For Frontier Labs, say the all-in build cost — team plus the training run — comes to $1 billion.

Cost two: running the model

Here is the part newcomers miss. Building the model was not the end of the spending. Every time a customer asks the finished model a question, the model has to think — and that thinking runs on chips that draw power and cost money. This running cost is called inference, and unlike the build cost it never stops. It is a small charge, paid again and again, on every answer the model ever gives. A lab therefore carries two quite different costs: one huge bill to build, and a stream of little bills to run.

Translate

Labs talk about selling “tokens.” A token is about three-quarters of a word. The cleanest way to picture it is a water meter: the customer pays for the words that flow through the model, in and out, and the meter clicks over a million tokens — roughly 750,000 words, about a fat novel’s worth — at a time. Selling intelligence, in this business, looks a lot like selling a metered utility.

Three ways to sell it

Now Frontier Labs has a model that works and costs a little to run. It opens three doors to revenue.

The metered API. Other companies’ developers connect their software directly to the model and are billed per token, exactly like that water meter — so much per million words in, more per million words out. Pure pay-as-you-go.

Subscriptions. Individual people pay a flat monthly fee — a $20-a-month plan, a heavier $200-a-month plan — for generous but capped access through an app. The lab is making a bet: that the average subscriber will cost less to serve than they pay, even though the heaviest ones cost far more.

Enterprise bundles. Large organisations negotiate packages — blocks of seats for their staff, higher usage limits, security features, committed-use contracts at a discount. These deals are bigger, stickier, and slower to sign than the other two.

The shape of the whole business

Walk Frontier Labs’ one model from cradle to grave and the shape appears. It cost $1 billion to build. Over the roughly two years before a better model replaces it, suppose it earns $2 billion across those three doors — while the compute to serve all those answers costs, say, $0.6 billion. The model, on its own, cleared its build cost and then some.

Frontier Labs, Inc. — the life of one modelIllustrative figure
Build it — research team + one training run (one-time)−$1.0B
Run it — compute to answer every request, over ~2 years−$0.6B
Sell it — API + subscriptions + enterprise, over its life+$2.0B
What the model leaves behind+$0.4B

Illustrative numbers for an imaginary lab. The pattern that matters — lifetime revenue of roughly twice the build cost — is the one Ledger 3 examines in detail.

So the machine has three parts: a large fixed cost to build, a metered margin on every answer sold, and a clock — because a better model, yours or a rival’s, will make this one obsolete. The whole game is to earn back the build, at the metered margin, before the clock runs out.

That is the entire business in one paragraph. Everything that follows is a consequence of it. Ledger 2 shows how labs report these costs and revenues — and how the reporting can mislead. Ledger 3 takes this exact shape and asks what happens when you run it not once but every year, each time building a model ten times larger. The rest of the booklet follows the same machine outward, into the strange economics it produces at scale.

Sources: Dwarkesh Patel interview with Dario Amodei · DeepSeek technical report · The Register — see Sources & Notes

Ledger 2

Reading the Books
“Which ruler is this number on?”

The trap
Frontier-lab numbers are reported in whichever form impresses most. A “$30B business” and a “$13B business” can be the same company in the same year. Repeat the headline and you have repeated a measurement choice you never inspected.
The toolkit
Four questions defuse almost every misleading figure: Is this run rate or recognised revenue? Is it gross or net of the channel? Is this capital committed or spent? And whose money is it — a customer’s, or an investor’s?
×12
A “run rate” is one good month, annualised
≈½
Recognised (GAAP) revenue vs the run-rate headline
~$8B
Disputed gap between gross and net revenue at one lab3
5 yrs
Typical span a “$300B deal” is spread across

The headline number is a costume

Start with the most quoted figure in the industry: the run rate. When a lab says it has reached a “$30 billion run rate,” it has not earned $30 billion. It has taken its strongest recent month and multiplied by twelve. Run rate is a forecast wearing the costume of a result. It is not an accounting standard, it has no auditor, and in an industry growing this fast it always flatters — because the month it annualises is, by selection, the best one so far.

Set run rate next to the number an accountant would actually sign: recognised revenue, the GAAP figure, the money genuinely booked over a real trailing year. The two diverge sharply. OpenAI exited 2025 describing a $20 billion run rate; the revenue recognised on its books for that year was roughly $13 billion.4 As a rule of thumb for a company compounding this fast, recognised revenue runs near half the run-rate headline. Neither number is a lie. But they answer different questions — “how fast is the last month?” versus “what did a year actually produce?” — and the gap between them is the first thing a careful reader restores.

Gross, net, and the $8 billion argument

The second ruler is gross versus net. Frontier models are sold not only direct but through hyperscaler marketplaces — Amazon Bedrock, Google Vertex, Azure. When a customer buys Claude through Amazon, a cut stays with Amazon. Book the whole sale and your revenue is gross; book only your share and it is net. The choice can swing a top line by 20 to 40 per cent.

This is not abstract. In April 2026 a leaked internal memo from an OpenAI executive argued that Anthropic’s revenue figure was overstated by roughly $8 billion precisely because Anthropic books reseller sales gross while OpenAI’s comparison was net.3 Sort that out and the two companies’ 2026 revenues land far closer together than the duelling press releases suggest. The lesson is not that one side cheated. It is that two true numbers are not comparable until you know each one’s ruler — and that you, the reader, must do that work because the headline will not.

Committed is not spent

The third ruler separates a commitment from a cost. The frontier deals in numbers like “a $300 billion compute deal” or “$700 billion of capital expenditure.” Almost always these are multi-year commitments — a $300 billion cloud contract is spread across five years, so it is nearer $60 billion a year, and even that arrives only if the buyer keeps consuming. A pledge to build capacity is not cash that has left the building. When a headline collapses a five-year commitment into a single staggering figure, your job is to divide it back out and ask what is contractually firm versus contingent on milestones nobody has hit yet.

Translate

“A $300 billion deal” sounds like a number from national accounts. Spread it across its real five-year term and it is about $60 billion a year — and that only if every year is consumed in full. The same arithmetic shrinks “$700 billion of AI capex” from an incomprehensible total into an annual run of build-out you can at least compare, year on year, against the revenue it is supposed to earn.

Whose dollar is it?

The fourth ruler is the subtlest, and Ledger 6 is devoted to it, so here only the flag. A dollar of revenue is supposed to come from a customer — external demand, someone choosing to pay for a service. But at the frontier some revenue dollars began life as investment dollars: a hyperscaler buys equity in a lab, the lab spends that money on the hyperscaler’s cloud, and it returns as the hyperscaler’s revenue. The dollar is real. Whether it represents real external demand is a different question — and one worth holding open every time you see a growth chart.

Why this is not cynicism

None of this means the frontier labs are running a confidence trick. Run rate is a normal way for fast-growing software companies to talk; gross-versus-net is an ordinary accounting judgement; multi-year commitments are how infrastructure has always been financed. The problem is not deception — it is scale and speed. The numbers are large enough to switch off intuition and they move fast enough that the flattering version is always close at hand. So the discipline of this booklet is small and repeatable: before you repeat any frontier figure, ask which of the four rulers it is on, and convert it into something a human can picture. Do that, and the remaining five ledgers become legible. Skip it, and you are just moving very large numbers around.

Sources: VentureBeat · Sacra · CNBC · Om Malik (on Microsoft’s 10-Q) · The Information — see Sources & Notes

Ledger 3

The Vintage Problem
“Every model sells at a profit. Why is the company losing billions?”

The framing
Look at a single model as its own product — a vintage. Over its serving life each model earns roughly twice what it cost to train. On that view, the product is a good business. The company loses money only because it keeps choosing to finance a far bigger successor.
The interrogation
The framing holds only if three things stay true: each vintage gets its full earning life, each 10×-larger model still earns its keep, and demand is forecast correctly. Miss any one and the per-vintage logic stops being reassuring.
Two charts: the top shows each model's training cost beside its roughly doubled lifetime revenue; the bottom shows the company's annual result going steeply negative across the same years.
The same money, two views. Per model (top): lifetime revenue runs about twice the training cost. Company-wide and year by year (bottom): an accelerating loss, because each year a successor roughly 10× larger is being financed. A stylised illustration of the framing — not Anthropic’s accounts.

A vineyard that bleeds cash

Picture a vineyard. Every vintage it bottles sells out at a healthy profit. And every year it loses money — because every year it plants a new field ten times the size of the last. The books show widening losses. The wine is not the problem. The expansion is.

This is, near enough, the argument Anthropic’s chief executive Dario Amodei has made for why his company can be deeply unprofitable and a sound business at the same time.5 Stop looking at the company’s annual profit-and-loss statement, he says, and look instead at a single model as a product with its own lifetime accounts — its own vintage. His toy version runs like this. A model that cost about $100 million to train in 2023 goes on to earn roughly $200 million serving customers. The 2024 model costs around $1 billion and earns about $2 billion. Each vintage, taken alone, returns about twice its training cost — exactly the shape of the single model in Ledger 1.

Yet in any given year the company is also paying to train the next model — and that one costs roughly ten times more. So while the 2024 vintage is happily earning its $2 billion, the company is spending $10 billion training the 2025 model. The annual profit-and-loss statement nets these against each other and shows an accelerating loss: small, then large, then alarming. The conventional reading — “this company loses more money every year” — is arithmetically true and, Amodei argues, completely misleading. The losses are not a failing product. They are the visible cost of a choice to keep planting bigger fields.

The number under the framing: two costs, not one

That story only works if a vintage really does earn back its build, and to see whether it can you have to separate the two costs from Ledger 1 and keep them apart.

The training cost is the one-time bill to build the model — the $1 billion in our example. It is fixed: it does not change whether the model then serves ten customers or ten million.

The serving cost is what it takes to run the model for one customer’s request — a sliver of compute, paid every time. Whether a vintage ever earns back that fixed training bill depends entirely on the gap between what the lab charges for a request and what the request costs it to serve. That gap is the inference gross margin, and it is the number the whole vintage framing quietly rests on.

Working the margin

So work it. Take a million tokens of output from a frontier model — the meter from Ledger 1, clicked over once. The customer is billed on the order of $25. Producing those tokens — the GPU time, the electricity — costs the lab perhaps $6. The lab keeps the difference.

One million output tokens, a frontier modelIllustrative
Billed to the customer~$25
Compute cost to generate them−$6
Gross profit the lab keeps+$19
Inference gross margin~76%

Illustrative, and consistent with the research firm SemiAnalysis, which puts gross margins on frontier models north of 70%.6

A margin near 75 per cent is what makes the vintage arithmetic possible. The training cost is fixed and paid once; the serving margin is collected on every token, forever. Serve enough tokens at 75 cents of profit on the dollar and the cumulative serving profit climbs past the $1 billion training bill — and then past twice it. The famous “2×” is not magic. It is a high-margin meter, running long enough.

Why 2024 broke the framing — and 2026 mended it

The framing is only available in 2026 because that margin recently flipped. In 2024, Anthropic’s gross margin on paying customers was reported at roughly minus 94 per cent — it spent nearly two dollars serving every dollar it billed.7 With a negative margin, no amount of volume rescues you: every additional token served deepens the loss, and the fixed training bill can never be earned back. That was the “selling dollars for cents” the industry’s critics described, and in 2024 they were right.

Then, through 2025 and into 2026, cheaper hardware and far better serving software pushed that margin from deeply negative to comfortably positive — on SemiAnalysis’s reading, Anthropic’s inference margin moved from the high-30s to above 70 per cent inside a year. Only once a vintage clears a healthy profit on every token does “each model earns twice its cost” stop being a hope and start being arithmetic. The vintage framing did not prove itself. A margin flipped, and the framing became possible.

Translate

OpenAI’s internal financials, reviewed by the press, project an operating loss in the region of $74 billion for 2028.8 A yearly figure that size is just noise to the mind. Divide it down: it is roughly $200 million a day, every day, for a year. The vintage framing asks you to believe that daily burn is the cost of planting next year’s field — not the sound of a business falling apart. Whether you believe it is the rest of this ledger.

The interrogation: three ways the framing breaks

The vintage story is elegant, internally consistent, and the most attractive idea in this booklet. That is exactly why it deserves pressure rather than applause. It rests on three assumptions, and each one is contestable.

One: the vintage must get its full earning life. “Earns twice its cost” assumes the model serves customers long enough to collect that revenue. But frontier models are made obsolete fast — sometimes by a competitor, sometimes by the lab’s own successor. If a vintage is retired in months rather than years, it never earns out, and the training cost is stranded. The hidden variable is depreciation: how quickly the expensive asset loses its value. The vintage framing tends to assume a generous earning life. The market has not been generous.

Two: each 10×-bigger model must still earn its keep. The cascade only works if a model that costs ten times more can be sold for enough to justify it. That held while each generation delivered a clear capability jump. The critic Gary Marcus, among others, argues the jumps are shrinking — that scaling has entered diminishing returns, and Amodei’s own remark that the field is “near the end of the exponential” concedes the direction of travel. If a $100 billion model is only modestly better than a $10 billion one, customers will not pay ten times more, and the cascade quietly inverts from a growth engine into a trap.

Three: it is a bet that must be won every single round. Amodei is candid that the whole structure depends on forecasting demand correctly — spend the right share of compute on research, keep a serving margin above 50 per cent, predict demand, and you print money; misjudge demand and the result “could swing wildly.” The critic Ed Zitron puts the bear case more bluntly: the losses are not an investment phase, they are the business model, and a business that must raise ever-larger sums to cover ever-larger losses is one bad forecast from a cliff. Sequoia’s David Cahn framed the same worry at the level of the whole industry as the “$600 billion question”9 — the gap between what the sector has spent on infrastructure and the revenue that spending must eventually justify. The vintage framing is the most coherent answer anyone has given to that question. It is not the same thing as a settled one.

A quiet caveat: the framing depends on a choice

One last thing to carry into Ledger 4. “Each vintage earns twice its cost” is not a fact read off a meter; it depends on how you spread — amortise — a training cost across the years a model earns. Stretch the schedule and the vintage looks profitable sooner; compress it and the same model looks like a loss. OpenAI and Anthropic do not make that choice identically, which is one reason their numbers resist clean comparison. The vintage framing is a genuine insight and an accounting lens — and a lens can be adjusted.

Sources: Dwarkesh Patel interview with Dario Amodei · SemiAnalysis · The Information · The Wall Street Journal · Sequoia Capital (“AI’s $600B Question”) · Where’s Your Ed At — see Sources & Notes

Ledger 4

A Tale of Two Labs
“Why Anthropic and OpenAI can’t be valued the same way”

Anthropic
Converging on an enterprise-software company: high revenue per user, ~80% of revenue from business customers, and a first taste of operating profit. Value it like a B2B software firm — on margins, retention, and seat growth.
OpenAI
Converging on an infrastructure and consumer-platform company: hundreds of millions of users, video, a browser, a chip programme, a continent of data centres. Value it like an infrastructure bet, where product margin is almost beside the point.
Line chart of Anthropic and OpenAI annualised revenue run rate from 2024 to 2026, with Anthropic crossing above OpenAI in early 2026 to reach about $30B against OpenAI's $24B.
Annualised run rate (one strong month × 12 — see Ledger 2). Recognised revenue is roughly half of each figure shown. The lines cross in early 2026: Anthropic overtakes on run-rate revenue even as OpenAI’s ambitions point elsewhere.

Same costume, different company

Two companies are called frontier AI labs. Both are valued in the hundreds of billions. Both train models near the edge of what is possible. And it has become the most common mistake in the room to price them the same way — because they are, increasingly, two different kinds of company wearing one job title.

Anthropic is turning into an enterprise-software business. Around 80 per cent of its revenue comes from companies, not consumers. The number of customers paying it more than a million dollars a year roughly doubled, to over a thousand, in a single eight-week stretch of 2026. The revenue is concentrated, contracted, and sticky — the texture of a company that sells software to other companies.

OpenAI is turning into something else. It has the most famous consumer product in technology — ChatGPT, with hundreds of millions of weekly users — but it has also pushed outward into video generation with Sora, into a web browser, into a custom-chip programme, and into Stargate, a data-centre build-out measured in gigawatts and hundreds of billions of dollars. Those are not the moves of a company optimising a software margin. They are the moves of a company trying to own infrastructure and consumer distribution. For a business like that, the per-product gross margin is almost a rounding error next to the size and timing of the capital bet.

The eight-times gap

The clearest single number separating the two is revenue per user. Anthropic earns on the order of $211 from each of its (far fewer, mostly business) monthly users. OpenAI earns something closer to $25 per weekly user, the great majority of whom pay nothing at all.10 That is an eight-fold difference in how hard each user is monetised, and it is the whole bifurcation in miniature: one company has a smaller number of users it monetises intensely; the other has a vast number of users it monetises lightly and is betting it can convert attention and scale into something valuable later.

Translate

“900 million weekly users” sounds like the win condition. Put it on the per-user ruler instead: at roughly $25 a user against Anthropic’s $211, OpenAI’s audience is about 35 times larger but each user is worth about one-eighth as much. Reach and revenue are not the same gauge — and reading one as the other is how the two labs get mistakenly priced alike.

The profitable quarter that proves the point

In May 2026 Anthropic told investors it expected its first-ever operating profit — around $559 million on roughly $10.9 billion of quarterly revenue — in the second quarter of the year.11 It is a real milestone. It is also, by the company’s own admission, a blip: scheduled compute costs in the back half of 2026 are expected to push it back into loss, and Anthropic does not expect a full profitable year before 2028. Ledger 3 already explained why a single green quarter is perfectly consistent with structural loss — it is one vintage maturing in the gap before the next big training bill lands. The profitable quarter does not refute the vintage problem. It illustrates it.

OpenAI’s trajectory is not a blip in either direction. Its internal projections, shared with investors, describe losses that widen for years — and that is consistent with the strategy, not a contradiction of it. If you are building infrastructure and consumer distribution at continental scale, sustained loss is the entry fee. The mistake would be to read OpenAI’s losses as the same kind of fact as Anthropic’s.

Reading the two ledgers side by side

 AnthropicOpenAI
BecomingAn enterprise-software companyAn infrastructure & consumer-platform company
Run-rate revenue~$30B (Apr 2026) reported~$24B (Apr 2026) reported
Revenue mix~80% enterpriseConsumer-led; enterprise approaching ~40%
Revenue per user~$211 / monthly user estimated~$25 / weekly user estimated
Really sellingReliable models inside business workflowsReach, a platform, and owned compute
Value it onMargin, retention, seat growthThe scale & timing of the capital bet
Key riskCapacity limits; a narrower revenue baseSustained cash burn; capital-market appetite
Basis of reportingLeans on company disclosure & friendly sourcesMore independent reporting & filings exist

These columns are not strictly like-for-like: the two companies recognise revenue and count users differently, and — as the last row notes — the public record is thinner on Anthropic. Read the table as two profiles, not a scoreboard.

Why this booklet does not crown a winner

It is tempting to end a comparison with a verdict. This one does not, and the refusal is deliberate. “Which lab is winning?” assumes they are running the same race. They are not. One is trying to become the next great enterprise-software company; the other is trying to become something closer to a utility with a consumer brand attached. Each could succeed or fail on its own terms. The useful question is not which is ahead — it is which kind of company is each, and which kind do you want as a supplier, a partner, or an investment. That question you can actually answer. A winner you cannot, and anyone who offers you one is selling a ruler they have not shown you.

Sources: TechCrunch · CNBC · Bloomberg · Sacra · SemiAnalysis · The Wall Street Journal — see Sources & Notes

Ledger 5

The Dark Factory
“Why the labs can’t stop spending”

The prize
If intelligence becomes a metered utility — tokens as the electricity of the future — then being its utility company is one of the largest prizes in business history. Every lab and hyperscaler is racing for it, and a race like that cannot pause.
The toll
The race costs far more than the foundational business throws off. So labs seize lucrative verticals — augmented coding first — both to prove the technology works and to help pay their way to the prize.

Tokens as the electricity of the future

Step back from any single lab and ask what the whole industry is actually racing toward. The clearest answer: intelligence is on its way to becoming a metered utility. Ledger 1 already showed the shape — tokens sold through a meter, like water, like power. Carry that forward. Once electricity became a utility, you stopped caring which generator produced the current in your wall; you simply paid per kilowatt-hour and built your life on the assumption it would be there. Tokens are heading the same way: a metered unit of cognitive work, always on, drawn on without a thought.

If that is the destination, then the frontier labs are not, in the end, software companies. They are utilities in the making — and the contest between them is a contest to become the utility company of intelligence, the master vendor of the electricity of the future.

The datacenter is a dark factory

Where is this utility generated? In a building that deserves its industrial name. A “dark factory” is a real term from manufacturing: a plant so fully automated that the lights can be switched off, because no human needs to be on the floor. A modern AI datacenter is exactly that — a vast hall of humming machines, drawing the power of a small city, with almost nobody inside. The difference is only in the product. A dark factory in the old economy stamped out car parts. This one manufactures intelligence itself, by the trillion tokens, and ships it down a wire.

Translate

Industry capital-expenditure estimates for 2026 land around $700 billion12 estimated. Divide it by the world’s population and it is roughly $86 for every person alive — spent, in a single year, building dark factories whose only product is machine thought, whether or not that person ever uses one. Said that way, the scale of the bet on this one idea becomes hard to look away from.

Why nobody can stop spending

This reframes the enormous, relentless capital spending the other ledgers keep brushing against. If the prize is to be the utility — not a utility — then second place is worth a small fraction of first. A land grab for the foundational layer cannot be conducted at half speed: pause, and a rival lays claim to the ground. That is why capex does not respond to losses the way an ordinary company’s would; why the vintage treadmill of Ledger 3 keeps accelerating; why “we could be profitable if we stopped” is technically true and strategically irrelevant. The labs are not behaving like businesses optimising a quarter. They are behaving like contenders in a once-only race to own an entire layer of the future economy.

Chasing verticals to fund the race

And here is the consequence that explains so much of the labs’ restless behaviour. The race to be the utility costs more than the utility business itself yet earns. No lab can fund the journey on raw token sales alone. So whenever a vertical forms — a specific, high-value use of the models that throws off serious cash — the labs go after it, hard and fast.

The clearest example so far is augmented coding. Anthropic’s Claude Code went from public launch to a $1 billion revenue run rate in six months, and past $2.5 billion within a year; OpenAI’s rival, Codex, climbed to several million weekly users in the same window.13 Coding is doing double duty for the labs: it is proof that the technology produces real economic value, and it is an engine — a vertical lucrative enough to help finance the dark factories. Expect the pattern to repeat wherever a vertical’s economics are good enough — customer support, legal work, search, scientific research. When the labs see “ridiculous revenues” forming in a vertical, they will move, because the race needs the fuel.

Utility and product at once

This resolves a contradiction that otherwise makes the labs look incoherent. Are they horizontal infrastructure — utilities selling raw intelligence to everyone — or vertical product companies, building coding tools and browsers and apps? The honest answer is that the hypercompetition forces them to be both, at the same time, by necessity. The utility is the destination. The verticals are the fuel that pays for the trip. A lab that sold only raw tokens could not afford the race; a lab that sold only coding tools would be a feature, not a foundation. So they sprint in two directions at once — and that is why “what is an AI lab, exactly?” is a question that refuses to sit still.

Verticals are one source of fuel for the race. They are not the only one. The next ledger follows the other — the capital that arrives not from customers at all, but from the labs’ own investors, moving in a circle.

Sources: Anthropic · Neowin · Futurum · SemiAnalysis — see Sources & Notes

Ledger 6

The Money Goes in a Circle
“When an investor’s dollar comes back as revenue”

The loop
A hyperscaler invests equity in a lab. The lab spends that money on the hyperscaler’s cloud. The spend is booked as the hyperscaler’s revenue. The same dollar now appears in two companies’ growth stories — as investment and as revenue.
Not a scandal
This is mostly how strategic investors have always built customers — not fraud. But it means some growth is funded by the counterparty rather than the open market, and it can re-rate the valuations of everyone in the loop at once.
A four-step ring: a hyperscaler invests equity in a lab; the lab holds the cash; the lab spends it on the hyperscaler's compute; the hyperscaler books it as revenue; and the loop closes.
The loop in its simplest form. The same dollar travels the ring — counted as an investment on the way in and as revenue on the way round. The specific deals that fill this ring are in the table below.

Follow one dollar

Amazon wires a dollar of equity investment to Anthropic. Anthropic wires that dollar to Amazon Web Services to pay for computing power. AWS books it as cloud revenue. The dollar has not vanished and nothing improper has happened — but it is now doing double duty: it is counted as Amazon’s investment and as Amazon’s revenue, and it is propping up two growth stories at once. It is the same dollar.

That little circuit, repeated at enormous scale, is the most misunderstood feature of frontier-AI finance. The honest question is not “is this a scandal” — mostly it is not — but “how much of the growth I am looking at is the open market choosing to pay, and how much is an investor’s own money making a lap?”

The deals that fill the ring

Five partnerships fill the loop. In each, an equity investment flows toward a lab and a far larger compute commitment flows back — the round trip the diagram above describes.

Partner → LabEquity invested / heldCompute committed back
Amazon → Anthropic14~$13B, up to ~$33B with milestones$100B+ on AWS, over ~10 years
Google → Anthropic15up to $40B (cash + compute)~$200B to Google Cloud, over 5 years
Microsoft → OpenAI16~$135B stake (~27%)~$250B of Azure purchases
Nvidia → OpenAI17up to $100B, released in stagesreturns largely as GPU orders
Oracle → OpenAI18(compute partner, not equity)~$300B Stargate deal, over 5 years

Figures as reported, May 2026 — see notes 14–18. Equity flows one way; a much larger compute commitment flows back.

Two of these deserve a closer look, because they are where the loop tightens hardest.

Nvidia and OpenAI. Nvidia announced an investment of up to $100 billion in OpenAI, disbursed in stages as data-centre capacity comes online — capacity OpenAI builds substantially by buying Nvidia’s own chips.17 The chip vendor funds the customer to buy the vendor’s chips. Oracle and OpenAI. Oracle is not an equity investor, but it sits on the same circle: its reported five-year, roughly $300 billion Stargate compute deal with OpenAI18 is a commitment large enough to reshape Oracle’s own financial statements.

Translate

Nvidia’s “$100 billion investment” is not a cheque in a drawer. It is released gradually, and most of it is expected to return to Nvidia as chip purchases. Strip it to the mechanism and it reads: Nvidia lends OpenAI the means to become Nvidia’s largest customer. That can be a perfectly rational way to seed demand — but “investment” and “pre-paid future order” are different things, and only one of them tells you the open market wants the product.

How the loop inflates valuations

The circle does more than move cash. It moves valuations. When Nvidia commits capital to OpenAI and OpenAI orders Nvidia chips, Nvidia’s revenue and order backlog swell — and a company with a swelling backlog is worth more, so its market value rises. When Oracle signs the Stargate deal, its contracted backlog leaps; by its own disclosures, Oracle’s remaining performance obligations jumped past half a trillion dollars, and its share price moved with the news. Each deal, in other words, re-rates the worth of more than one company on the circle at the same time.

Stack enough of these and an effect appears that no single deal intends: the whole sector looks larger, busier, and more certain than independently verifiable end-customer demand alone would justify. Valuations lift each other. The capital, the revenue, and the market value all swell together — partly on real demand, and partly because the same money has been counted, in good faith, at several stops around the ring.

The fair rebuttal

Before the caveats, the case for all this, stated properly. A strategic investor seeding a company that will become a major customer is ordinary capitalism, not a trick — telecoms, airlines, and the early cloud were all built this way. The Azure compute OpenAI buys is genuine: real models run for real users on those servers. Microsoft is not paying itself; it is selling a service a willing buyer would want regardless. And there is a sturdier point: the labs are not only spending investor money. A great deal of their revenue is now ordinary customers — enterprises and consumers with no equity stake — paying for a product they chose. The loop is real, but it is not the whole circuit.

Three caveats to carry out of this ledger

So describe the loop soberly and land on three things, no more.

One — endogenous growth. When a portion of a lab’s revenue traces back to its own investors’ capital, the growth curve is partly endogenous: funded from inside the arrangement rather than won in the open market. It is not fake. It is just not the same evidence of demand as a dollar from an unrelated customer, and a revenue chart cannot tell you the mix on its own.

Two — concentration. The loop binds a handful of firms tightly together. If one major lab falters, the damage does not stay contained — it lands on a hyperscaler’s investment line, its revenue line, and its contracted backlog at once. The circle that amplifies growth on the way up amplifies stress on the way down.

Three — opacity. Because of the gross-versus-net problem from Ledger 2, an outside reader cannot reliably size the loop. You can see that it exists and that it is large. You cannot, from the public record, say precisely how much of any headline number it accounts for. Honesty about that limit is part of reading the books well.

On the spirit of this ledger

Circular financing is being described here, not condemned. There is no allegation of wrongdoing — these arrangements are disclosed, and most have sound commercial logic. The goal is only to let you see the loop and gauge its risk, so that a growth chart shaped partly by an investor’s own capital is not mistaken for one shaped purely by open-market demand. Seeing clearly is not the same as disapproving.

Sources: CNBC · Bloomberg · TechCrunch · Amazon · Nvidia · Data Center Dynamics · AI Magazine — see Sources & Notes

Ledger 7

The Rest of the World
“Everyone not named Anthropic or OpenAI”

Mistral — the structural also-ran
Europe’s frontier hope is technically credible and roughly 75× smaller than Anthropic — with the gap widening. The binding constraint is not engineering talent. It is access to capital and compute at the scale the frontier now demands.
China — a different game
The leading Chinese labs are not losing the API-margin race. They mostly aren’t running it. Their model economics are subordinate to traffic, ecosystems, and attention — which makes “they can’t profit on inference” a category error.
~75×
How much smaller Mistral’s revenue is than Anthropic’s
€1.7B
Mistral Series C, led by ASML, Sept 2025
$5.6M
DeepSeek V3’s reported marginal training cost
<20%
Gross margin on commodity-tier models

If you only watch the two leaders, you mistake the game

Ledgers 3 to 6 examined the American duopoly. But the most instructive economics at the frontier are happening at its edges — where a well-run European lab is losing the race anyway, and where another set of labs has decided that the race the Americans are running was never the point.

Mistral: credible, and structurally behind

Mistral is France’s entrant and Europe’s best one. Its revenue grew impressively in relative terms — from roughly $16 million at the end of 2024 to an estimated $400 million annualised a year later. Set that beside Anthropic’s $30 billion and the relative success disappears: Mistral is on the order of 75 times smaller, and because the leaders are growing faster from a vastly larger base, the gap is widening, not closing.

This is not a talent problem. Mistral ships genuinely good models. It is a capital problem. Its September 2025 Series C raised €1.7 billion, led by the Dutch chip-equipment maker ASML, which became its largest shareholder; in early 2026 it arranged an $830 million debt facility to buy some 13,800 Nvidia chips for a data centre near Paris.19 Those are serious European numbers. They are also roughly two orders of magnitude below the compute commitments Ledger 6 described on the American side. Frontier-tier monetisation is gated by frontier-tier compute, and frontier-tier compute is gated by capital Mistral cannot raise at American scale.

Translate

“75 times smaller” is easy to wave past. Make it concrete: if Anthropic were a large national airline with a global network, Mistral would be a competent regional carrier. The regional carrier can be well run and worth flying — but it is not going to out-fly the network by trying harder. Different fleet, different capital base, different game.

Mistral’s response is to stop competing on the duopoly’s terms: less pure model-API rivalry, more sovereign and on-premise offerings, vertical models for regulated European industries, and the pitch that a European customer may want a European supplier for reasons that are not only technical. That is a sane strategy. It is also, plainly, the strategy of a company that has accepted it will not win the raw-scale race.

China: the model is the bait, the ecosystem is the business

The Chinese labs invite a Western misreading. DeepSeek reported training its V3 model for a marginal cost near $5.6 million — a figure that, taken alone, made the American spend look absurd.20 The honest accounting is less dramatic: that number excludes prior research, failed runs, and the cost of the hardware itself, and the all-in figure is many times higher. But chasing the exact training cost misses the larger point — which is that for the leading Chinese players, the model was never the product.

So what is? The product is the ecosystem, and the ecosystem makes money the way it always has. Alibaba’s Qwen models feed Taobao and its cloud: the money is made on a cut of every commerce transaction and on cloud contracts. ByteDance’s models feed its apps: the money is made on advertising sold against attention. Across the sector the pattern repeats — super-app engagement, payment fees, subscriptions to services that have nothing to do with an API meter. In that design a frontier-grade model given away near-free is not a failed business; it is a customer-acquisition cost for a far larger one. The model is the bait. The ecosystem is the hook, and the hook is where the revenue has always been.

Two of these companies, Zhipu and MiniMax, listed in Hong Kong in January 2026 and saw their valuations multiply within weeks.21 The public market is funding the strategy — and profitability on inference is simply not the case being made to those investors. “Chinese labs can’t make money selling tokens” is true and almost irrelevant: selling tokens was never the plan.

The commodity tier eats the floor

There is a third force, and it presses on everyone. Below the frontier sits a fast-improving commodity tier — cheap open-weight models, many of them Chinese, and discount API offerings. For tasks that do not need frontier capability, these are nearly free. SemiAnalysis’s reading is stark: margins on frontier models sit north of 70 per cent, while margins on trailing models exposed to open-source competition fall below 20.

From the buyer’s side — the subject of the companion booklet, The Token Economics — this is simply good news: cheaper tokens. From the seller’s side, the one this booklet is about, it is a slow squeeze. Every routine workload that drains from a premium API to a near-free open-weight model erodes the pricing power the whole vintage cascade of Ledger 3 depends on. The frontier labs are protected only at the genuine frontier — and only for as long as the frontier stays far enough ahead of the commodity tier to be worth paying for.

Closing the books

Seven ledgers, and a single habit holding them together. A frontier number means nothing until you know which ruler it is on, whose dollar it really is, and what kind of company — or country, or business model — produced it. The figures in this booklet will be stale within months; the way of reading them will not. When the next staggering headline arrives, do what every ledger here has done: find the unit, translate it into something a person can picture, and ask what kind of bet it really represents. That habit, not any number, is what this booklet was for.

Sources: CNBC · Mistral AI · Sacra · The Register · DeepSeek technical report · SCMP · SemiAnalysis — see Sources & Notes

Reference

Sources & Notes

Numbered notes for the load-bearing and contestable figures in this booklet. All links were retrieved in May 2026; figures are a snapshot of that date. Inline tags — reported (confirmed by a company or filing), projected (a forward estimate, often investor-facing), estimated (a third-party calculation) — indicate how solid each figure is. A few sources are paywalled or were originally leaked; these are cited by outlet without a stable free link.

Premise

  1. Anthropic’s ~$30B revenue run rate (April 2026) and the “80×” growth remark — VentureBeat, “Anthropic says it hit a $30 billion revenue run rate”: venturebeat.com. Revenue history compiled by Sacra: sacra.com/c/anthropic.

Ledger 1 — How a Lab Works

  1. Order-of-magnitude training-run costs ($100M–$1B+) draw on Dario Amodei’s public framing in the Dwarkesh Patel interview: dwarkesh.com; a cheaper-tier counterpoint is DeepSeek’s reported V3 figure — DeepSeek-V3 technical report: arxiv.org. The Frontier Labs, Inc. numbers are an explicit illustration, not a real company.

Ledger 2 — Reading the Books

  1. The disputed ~$8B gross-vs-net gap traces to a leaked April 2026 internal memo from OpenAI’s CRO Denise Dresser, arguing Anthropic books reseller revenue on a gross basis; reported by multiple outlets and summarised by Sacra: sacra.com/c/anthropic.
  2. OpenAI’s ~$20B run rate vs ~$13B recognised 2025 revenue, and the run-rate–to–GAAP relationship — CNBC on the $852B round and revenue: cnbc.com; Microsoft’s 10-Q as a window into OpenAI’s losses, Om Malik: om.co.

Ledger 3 — The Vintage Problem

  1. Dario Amodei’s per-model (“vintage”) framing, the ~2×-return toy model, the 50%-compute-on-research heuristic, and “near the end of the exponential” — Dwarkesh Patel interview, February 2026: dwarkesh.com.
  2. Inference gross margins — frontier models north of 70%, trailing models below 20%, and Anthropic’s shift from ~38% to >70% inside a year — SemiAnalysis, “AI Value Capture”: newsletter.semianalysis.com. The per-token worked example is illustrative and consistent with this range. estimated.
  3. Anthropic’s 2024 gross margin of roughly −94% on paying customers (−109% all-in) — The Information, November 2025 reporting (subscription).
  4. OpenAI’s projected ~$74B operating loss for 2028 and breakeven pushed to 2029–2030 — The Wall Street Journal, reporting on confidential investor financials, May 2026 (subscription). projected.
  5. “AI’s $600B Question” — David Cahn, Sequoia Capital: sequoiacap.com. The bear case on losses-as-business-model is associated with Ed Zitron’s “Where’s Your Ed At” newsletter.

Ledger 4 — A Tale of Two Labs

  1. Revenue-per-user estimates (~$211 for Anthropic, ~$25 for OpenAI) and the enterprise revenue mix — Sacra: sacra.com/c/anthropic. estimated.
  2. Anthropic’s projected first operating profit (~$559M on ~$10.9B revenue, Q2 2026) and the caveat that profitability may not hold through the year — TechCrunch: techcrunch.com; figures originate in WSJ reporting on investor materials. projected.

Ledger 5 — The Dark Factory

  1. 2026 hyperscaler / AI capital-expenditure estimates — widely reported in the $660–700B+ range; Futurum, “AI Capex 2026”: futurumgroup.com. estimated.
  2. Claude Code — $1B run rate in six months, past $2.5B within a year — Anthropic: anthropic.com. OpenAI’s Codex reaching several million weekly users — Neowin: neowin.net.

Ledger 6 — The Money Goes in a Circle

  1. Amazon’s ~$13B cumulative investment in Anthropic (up to ~$33B with milestones) and Anthropic’s $100B+ AWS commitment — Amazon: aboutamazon.com; Project Rainier: aboutamazon.com.
  2. Google’s up-to-$40B Anthropic investment (cash + compute) and Anthropic’s ~$200B, 5GW Google Cloud / TPU commitment — TechCrunch: techcrunch.com.
  3. Microsoft’s post-restructuring stake (~27% / ~$135B), the 20% revenue share capped at $38B, and OpenAI’s ~$250B Azure commitment — CNBC: cnbc.com; cap detail, AI Magazine: aimagazine.com.
  4. Nvidia’s up-to-$100B investment in OpenAI, disbursed progressively as ~10GW of capacity is deployed — CNBC: cnbc.com; Nvidia newsroom: nvidianews.nvidia.com.
  5. The reported ~$300B / 4.5GW five-year Oracle–OpenAI Stargate deal and Oracle’s backlog (remaining performance obligations) swelling past $500B — Data Center Dynamics: datacenterdynamics.com; Data Center Frontier: datacenterfrontier.com.

Ledger 7 — The Rest of the World

  1. Mistral’s €1.7B Series C led by ASML (largest shareholder, ~11%) at an €11.7B valuation — CNBC: cnbc.com; Mistral: mistral.ai. Mistral revenue (~$400M ARR) per Sacra. estimated.
  2. DeepSeek V3’s reported ~$5.58M marginal training cost — DeepSeek-V3 technical report: arxiv.org; the case that true all-in cost is far higher — The Register: theregister.com.
  3. The Hong Kong IPOs of Zhipu and MiniMax (January 2026) and their subsequent valuation surges — CNBC: cnbc.com; SCMP: scmp.com.

Also drawn on

Sam Altman’s “profitable on inference” remark — Axios, 14 August 2025. SemiAnalysis (notes 6 and elsewhere) is a paid research firm; its margin breakdowns are the most-cited public estimates but are not independently audited. Run-rate figures throughout are 12× a single month and are not GAAP-recognised revenue — see Ledger 2.

The Economics of the Frontier — barcik.training Publications · Snapshot May 2026 · A companion to The Token Economics