Posts tagged with inference-economics

GPU Economics · Jul 7 ·5 min read

Twenty Percent for Three Million Dollars

Three million dollars. That's the total exercise cost if both OpenAI and Meta decide to cash in every last warrant AMD handed them.

amdnvidiafinancial-engineering

GPU Economics · Jul 4 ·5 min read

Fifty-Two Weeks to Break Even

The math on GPU ownership used to be simple.

gpu-shortagesupply-chaincowos

GPU Economics · Jun 2 ·5 min read

$200 Billion Starts With a Laptop

On Monday morning, three chipmakers woke up to a problem they didn't have on Friday. Intel dropped 6%.

nvidiartx-sparkpc-market

GPU Economics · May 30 ·5 min read

Two Chips Are Cheaper Than One

Google just did something no major chip designer has tried at this scale: they designed two separate processors for a job everyone else handles with one.

google-tpuinference-economicsspecialized-silicon

Synthetic Media · May 24 ·4 min read

Fifteen Million Dollars a Day

OpenAI spent roughly thirty months building the most impressive video generation demo the industry had ever seen, then watched it bleed $15 million per day in...

video-generationsoraopenai

GPU Economics · May 23 ·5 min read

NVIDIA Paid $25 Billion to Compete With Itself

Between December 2025 and March 2026, NVIDIA wrote two checks that contradict everything the company built over the last decade.

nvidiagroqinference-economics

GPU Economics · May 21 ·5 min read

Your GPU's Most Expensive Part Isn't the GPU

Pick up an NVIDIA B200 and trace where the roughly 6,400 manufacturing cost actually goes.

hbmmemory-economicsinference-economics

GPU Economics · May 19 ·4 min read

Blackwell Got 5x Cheaper Without Changing a Transistor

NVIDIA's Blackwell B200 debuted at 0.11 per million tokens on SemiAnalysis's InferenceMAX benchmarks.

inference-economicsnvidiablackwell

GPU Economics · May 16 ·4 min read

Same Silicon, Twelve-to-One

An H100 SXM5 costs 1.03 per hour or 12.

gpu-pricingcloud-computeh100

GPU Economics · May 14 ·5 min read

AMD Hit a Million Tokens Per Second. Now What?

AMD just crossed a threshold that matters more than any spec sheet: one million tokens per second from a single cluster, verified by MLPerf.

inference-economicsamdmi355x

GPU Economics · May 12 ·4 min read

Token Prices Crashed 99%. GPU Prices Didn't.

Three years ago, OpenAI charged 30 per million input tokens for GPT-4. Today, budget-tier models go for 0.

inference-economicstoken-pricingopenai

GPU Economics · May 9 ·4 min read

AMD Raised GPU Prices 67% Because They Finally Can

A chip vendor hiking prices 67% overnight would normally send customers scrambling.

amdmi355xinference-economics

GPU Economics · May 5 ·5 min read

Google Is a Chip Vendor Now

Google buried the announcement inside a Q1 earnings call that had plenty of other headline-worthy numbers — 109.

googletpunvidia

GPU Economics · May 2 ·5 min read

What Microsoft's 750-Watt Chip Actually Saves

Scott Guthrie called Maia 200 "30 percent cheaper than any other AI silicon on the market" when he unveiled it in January.

microsoftmaia-200custom-silicon

GPU Economics · Apr 30 ·5 min read

The CPU Line Item Just Got 20% More Expensive

Somewhere between your third GPU cluster purchase order and your fifth HBM allocation call, Intel quietly raised server CPU prices again.

intelxeoninference-economics

GPU Economics · Apr 28 ·5 min read

What $5 Billion Buys in the Chiplet Economy

NVIDIA doesn't do charity — so when Jensen Huang wrote a 5 billion check for 214 million Intel shares at 23.

nvidiaintelnvlink

GPU Economics · Apr 25 ·4 min read

Three Factories Control Half the Cost of Every AI Chip

Epoch AI published a manufacturing teardown of NVIDIA's B200 last month.

hbmmemory-shortagesupply-chain

GPU Economics · Apr 23 ·6 min read

Every Chip Startup's Exit Strategy Is NVIDIA

Scroll through the investor list on SiFive's freshly closed 400 million round and you hit a name that shouldn't be there: NVIDIA.

ai-chipsventure-capitalnvidia

GPU Economics · Apr 21 ·4 min read

Why Google Needs Four Chip Vendors to Beat One

When Bloomberg reported Sunday that Google is in active talks with Marvell Technology to co-develop two new custom AI chips, Marvell stock popped and Broadcom...

googlecustom-siliconmarvell

GPU Economics · Apr 18 ·4 min read

H100s Are Appreciating Assets Now

Hardware depreciates.

gpu-rentalh100semianalysis

1 / 2 Next →