The GPU Theater

Abstract

Between 2023 and 2025, Jensen Huang delivered statements projecting AI infrastructure investment exceeding $1 trillion annually, accompanied by reports of GPUs sold out quarters in advance. Industry analyses (including estimates from SemiAnalysis, Epoch AI, and cloud infrastructure analysts) of actual GPU utilization rates at major hyperscalers reported figures in the 30–40% range for H100s during the same period — meaning 60–70% of purchased compute sat idle or underutilized. This paper examines the GPU Theater as a case study in Demand Fabrication: the production of reported demand through narrative expectations that make the demand real enough to report, while underlying utilization tells a different story. The paper distinguishes Demand Fabrication from fraud — no false statements are required — and names the structural loop that makes it self-sustaining.

The Trillion-Dollar Statement

Between 2023 and 2025, Nvidia CEO Jensen Huang made a series of public statements projecting that the AI industry would require infrastructure investment exceeding one trillion dollars. The projection was delivered at keynotes, earnings calls, and industry conferences — broadcast simultaneously to financial media, institutional investors, and the retail market. It was not a private assessment shared with preferred counterparties. It was a public statement, designed to be heard by everyone, and it was heard by everyone.

The structural observation is not that the statement was false. It may prove accurate. The observation is that the statement originated from the CEO of the company that would be the primary beneficiary of the investment it predicted. Nvidia designs and sells the GPUs that constitute the majority of AI infrastructure spending. A projection of trillion-dollar demand for GPUs, issued by the dominant GPU supplier, is not a neutral forecast. It is a market signal whose fulfillment enriches the entity that produced it. The prediction and the financial interest in the prediction's adoption are held by the same party.

The market treated the statement as informational rather than interested. Nvidia's market capitalization moved from approximately $300 billion in early 2023 to peaks exceeding $3.5 trillion by mid-2025 (per NASDAQ market data) — a swing of more than $2 trillion driven substantially by AI narrative cycles. The trillion-dollar infrastructure projection became the foundational assumption underwriting these valuations. Analysts cited it. Hyperscalers referenced it to justify their own capital expenditure programs. Enterprise CTOs used it in board presentations to argue for GPU procurement budgets. The statement did not merely describe anticipated demand. It organized the capital allocation decisions that would generate the demand it described.

This is the entry point to the GPU Theater: a market in which the primary narrator of future demand is also the primary financial beneficiary of the demand the narration produces. No deception is required. The narrator believes the projection. The audience acts on the projection. The actions fulfill the projection. The fulfilled projection validates the narrator's credibility for the next cycle's projection. The structure is self-sustaining as long as the audience continues to act on the narrative — and the audience continues to act on the narrative because the previous cycle's results confirm it was correct to do so.

The Utilization Gap

The GPU Theater's structural claim rests on a measurable discrepancy: the gap between GPU capacity purchased and GPU capacity utilized. During the same period in which trillion-dollar infrastructure projections drove record GPU procurement, reported GPU utilization rates across major cloud providers and AI companies consistently fell in the range of 30 to 40 percent, according to industry analysts including SemiAnalysis and Epoch AI. The figures varied by provider, workload type, and measurement methodology, but the central finding was stable across these analyses: the majority of purchased GPU compute capacity sat idle or underutilized at any given time.

A 30 to 40 percent utilization rate means that 60 to 70 percent of purchased GPU capacity is not performing productive work at the moment of measurement. This is not unusual for compute infrastructure generally — enterprise server utilization has historically run below 50 percent. But the GPU market's procurement behavior is not driven by historical utilization norms. It is driven by narrative expectations about future AI workloads that have not yet materialized at the scale the procurement anticipates. Companies purchase H100 clusters not because their current AI workloads require them, but because the narrative environment has established that companies without GPU capacity will be competitively disadvantaged when the anticipated workloads arrive.

The utilization gap is the structural evidence that distinguishes narrative-driven demand from utilization-driven demand. In a utilization-driven market, procurement follows demonstrated need: capacity is purchased when existing capacity is substantially consumed. In a narrative-driven market, procurement follows anticipated need: capacity is purchased based on expectations about future requirements, where those expectations are set by the same entities that sell the capacity. The gap between the two — between what is purchased and what is used — is the measurable footprint of the Demand Fabrication loop.

Fear of scarcity accelerates the dynamic. When GPUs are reported as sold out quarters in advance, procurement decisions shift from "do we need this capacity now" to "can we afford to not have this capacity when we need it." The sold-out reports are accurate — the GPUs are genuinely sold. But they are sold into a market where the buying pressure is generated by the narrative of scarcity, which is generated by the buying pressure, which is generated by the narrative. The utilization gap persists because the purchases are driven by anticipated future need, and the anticipated future need is set by projections from the primary beneficiary of the purchasing behavior.

III

The Self-Referential Loop

The Demand Fabrication loop is self-referential at every stage. Stage one: a credible narrator produces a narrative about future demand — "AI will transform every industry, and the transformation requires massive GPU infrastructure." Stage two: the narrative creates purchase behavior, as companies buy GPU capacity to avoid being left behind in the projected transformation. Stage three: the purchase behavior generates reported metrics — sold-out quarters, months-long waitlists, record revenue figures for GPU suppliers. Stage four: the reported metrics validate the original narrative, because the metrics are real. The GPUs are genuinely sold out. The revenue is genuinely record-breaking. The demand is genuinely reported. Stage five: the validated narrative drives further purchase behavior, and the loop iterates.

The critical structural feature is that no false statement is required at any stage. The narrator's projection may be genuinely believed. The purchase behavior is a rational response to the projection, given the competitive risk of inaction. The reported metrics accurately describe real transactions. The validation of the narrative by the metrics is logically sound — if GPUs are sold out, demand is real. The further purchase behavior that follows validation is rational for the same reasons the initial purchase behavior was rational. Each individual step is defensible. The loop's self-referential character is visible only when the stages are examined as a system rather than as independent events.

The loop is structurally analogous to demand cycles documented in other asset classes. The housing bubble of 2003 to 2007 followed an identical pattern: narrative ("housing prices always go up") produced purchase behavior, purchase behavior produced price increases, price increases validated the narrative, validated narrative produced further purchase behavior. The dot-com bubble of 1997 to 2000 followed it: narrative ("the internet changes everything") produced capital allocation, capital allocation produced revenue growth at internet companies, revenue growth validated the narrative. In each case, the loop operated without fraud. The houses were real. The internet companies had real revenue. The GPUs are really sold out. The question is not whether the reported demand is real. It is whether the reported demand would exist at its current scale without the narrative loop that produces it.

The GPU Theater is distinguished from prior cycles by the concentration of narrative authority. The housing bubble's narrative was distributed across thousands of real estate agents, mortgage brokers, and financial commentators. The GPU cycle's narrative is concentrated in a small number of platform authority holders — primarily the CEOs of Nvidia, the major hyperscalers, and the leading AI companies — whose financial interests are directly served by the narrative they produce. This concentration makes the loop more efficient and the self-referential structure more visible to structural analysis, even as it remains invisible to participants inside the loop who experience each stage as an independent, rational response to real market conditions.

The Distributional Outcome

Every demand fabrication cycle produces a characteristic distributional outcome: early position holders exit into the liquidity created by late entrants. In the GPU Theater, early position holders include Nvidia insiders who held equity before the AI narrative cycle began, early investors in AI infrastructure companies, and the first wave of hyperscalers whose GPU procurement secured favorable pricing and allocation priority. Late entrants include enterprise companies purchasing GPUs at peak pricing to build AI capabilities they have not yet defined, venture-funded startups whose business plans assume continued GPU scarcity, and retail investors who purchased Nvidia equity after the narrative had already been priced into the market capitalization.

The wealth transfer between these groups is invisible during the expansion phase of the loop. While the narrative is validated by each cycle's metrics, every participant appears to be benefiting: early holders see their positions appreciate, late entrants acquire assets whose value appears to be rising, and the market as a whole reports record activity. The transfer becomes visible only when the utilization gap becomes undeniable — when the discrepancy between purchased capacity and productive use reaches a scale that narrative repetition cannot obscure.

This is the characteristic signature of demand fabrication cycles across every market this corpus documents. The expansion phase is collectively affirming. The contraction phase is distributionally revealing. The parties who established positions before the narrative reached saturation exit with realized gains. The parties who established positions in response to the saturated narrative hold capacity whose value depends on utilization rates that have not yet justified the purchase price. The GPU Theater's version of this dynamic involves physical hardware with rapid depreciation curves: an H100 purchased at peak pricing in 2024 faces both the utilization gap and the technological obsolescence introduced by next-generation architectures, compressing the window in which the purchase can generate returns.

The GPU Theater is the Narrative Market's most capital-intensive specimen. Prior demand fabrication cycles documented in this corpus — cryptocurrency narrative cycles, meme stock episodes, SPAC proliferation — involved billions of dollars. The GPU Theater involves trillions. The infrastructure projections that anchor the narrative are denominated in trillions. The market capitalizations that respond to the narrative moved by trillions. The capital expenditure programs justified by the narrative are measured in hundreds of billions across the major hyperscalers alone. The scale does not change the structure. It changes the magnitude of the distributional outcome when the cycle completes.

The Structural Observation

The GPU Theater does not mean that artificial intelligence is not real, that GPUs are not useful, or that infrastructure investment is not necessary. AI workloads genuinely require GPU compute. Large language models genuinely require massive training infrastructure. The companies purchasing GPUs are not irrational, and the engineers deploying them are solving real problems. The structural observation is narrower and more specific: the scale of investment is driven by narrative expectations that exceed current utilization by a documented margin, and the primary narrator of those expectations is also the primary financial beneficiary of the investment they produce.

The gap between narrative-driven investment and utilization-justified investment is the premium extracted by the Demand Fabrication loop. If GPU procurement were driven entirely by current utilization rates and documented near-term demand, the investment level would be some fraction of its current scale — the fraction corresponding to the 30 to 40 percent of capacity that is actually utilized. The remainder — the 60 to 70 percent that sits idle — represents capacity purchased on the basis of narrative expectations rather than operational requirements. This is the Demand Fabrication premium: the difference between what utilization would justify and what narrative produces.

This premium is not fraud. No law is broken in its production. The CEO who projects trillion-dollar demand is exercising protected speech. The companies that purchase GPUs in response are making voluntary procurement decisions. The investors who bid up GPU supplier equity are responding to reported metrics that accurately describe real transactions. The premium is the structural product of a market in which the primary narrator is also the primary beneficiary — a configuration that does not violate any existing regulatory framework because the framework was not designed for a market in which narrative authority and financial interest are held by the same party at this scale.

The connection to NM-001 is direct. The Platform Authority Premium, named in that paper, describes the financial value embedded in an individual's platform authority — the capacity to move markets through the act of speaking. Jensen Huang's platform authority is the enabling condition for the GPU Theater's Demand Fabrication loop. Without a narrator whose projections are treated as authoritative by the market, the self-referential loop cannot initiate. Without the loop, procurement tracks utilization rather than narrative. Without narrative-driven procurement, the utilization gap does not open, the sold-out reports do not materialize, and the validation stage that sustains the loop does not occur. The Platform Authority Premium is the mechanism. The Demand Fabrication is the loop it enables. The GPU Theater is where the loop operates at its largest documented scale.

The Trillion-Dollar Statement

The Utilization Gap

The Self-Referential Loop

The Distributional Outcome

The Structural Observation

References