Huang's Law vs Moore's Law

Comparison

Huang's Law and Moore's Law are the two exponential curves that define modern computing — but they measure fundamentally different things, operate on different timescales, and carry different implications for the economics of intelligence. Moore's Law tracked the relentless miniaturization of transistors for five decades, creating the digital age. Huang's Law tracks the far steeper performance curve of AI-optimized GPUs, driven not just by smaller transistors but by architectural innovation, specialized silicon, and software co-optimization. Understanding how these two laws relate — where they overlap, where they diverge, and which one matters more for the future — is essential to understanding why the cost of intelligence is collapsing and what that collapse will produce.

Feature Comparison

DimensionHuang's LawMoore's Law
Named AfterJensen Huang, CEO of NVIDIA (articulated at GTC 2018)Gordon Moore, co-founder of Intel (1965 paper)
What It MeasuresGPU performance for AI workloads (inference and training throughput)Transistor density on integrated circuits (transistors per unit area)
Doubling RateRoughly every 1–2 years (NVIDIA claims ~1,000× per decade; independent measurement by EpochAI suggests ~2.5 years for FLOPS/$)Every ~2 years historically; has slowed to ~2.5–3 years since the mid-2010s
Primary MechanismArchitecture innovation: tensor cores, sparsity, numerical formats (FP8, FP4), memory bandwidth, NVLink/NVSwitch interconnects, CUDA/TensorRT software stackTransistor miniaturization via photolithography advances (deep UV → EUV → high-NA EUV)
Current Status (2026)Actively accelerating: Blackwell (2024) → Blackwell Ultra/GB300 (2025) → Rubin (2026) → Rubin Ultra (2027), each delivering 2–4× generational leapsSlowing but not dead: TSMC N2 in mass production, Intel 18A ramping, but gains are harder won and more expensive per node
Performance ScopeNarrow: AI and parallel compute workloads (matrix multiplication, neural network inference/training)Broad: all general-purpose computing, from CPUs to memory to embedded systems
Cost TrajectoryCost per unit of AI inference dropping ~10× every 2–3 years; total system cost rising (GB200 NVL72 racks cost $2–3M+)Cost per transistor declining, but cost per leading-edge wafer rising exponentially ($20K+ for 2nm wafers)
Limiting FactorsPower delivery and cooling at rack scale (70 kW+ per GPU), memory bandwidth walls, interconnect latencyQuantum tunneling at atomic scales, Dennard Scaling breakdown (2006), extreme lithography costs
Economic ImpactDirectly drives the cost of intelligence downward, fueling Jevons' Paradox in AI demandCreated the economic foundation for PCs, internet, mobile, and cloud — the infrastructure AI now runs on
Relationship to SoftwareTightly co-optimized: hardware and software (CUDA, TensorRT, compiler optimizations) evolve together to extract maximum throughputLoosely coupled: software generally benefited passively from faster hardware without requiring co-design
Industry DependencyConcentrated: NVIDIA dominates, with AMD and custom silicon (Google TPUs, Amazon Trainium) competingDistributed: Intel, TSMC, Samsung, and hundreds of fabless design firms all participated

Detailed Analysis

Different Engines, Different Eras

Moore's Law and Huang's Law are often presented as sequential — Moore's Law carried computing for fifty years, and now Huang's Law picks up where it left off. The reality is more nuanced. Moore's Law was an observation about manufacturing: as lithography improved, you could etch more transistors per square millimeter, and each generation delivered more compute essentially for free. It was a rising tide that lifted all boats — CPUs, GPUs, memory, embedded processors, and smartphones all rode the same curve.

Huang's Law is an observation about specialization. GPU performance gains come from designing silicon specifically for the mathematical operations that AI requires — matrix multiplications, convolutions, attention mechanisms — and then co-optimizing the software stack to exploit every available transistor for that narrow purpose. This means Huang's Law doesn't replace Moore's Law; it rides on top of it while adding architectural and software multipliers that general-purpose CPUs cannot match.

The Numbers: Claimed vs. Measured

NVIDIA's marketing claims are dramatic: 1,000× improvement in AI inference over a decade, and Jensen Huang stated at CES 2025 that the company's systems are progressing at "hyper Moore's Law" rates. The jump from A100 (2020) to H100 (2022) to Blackwell B200 (2024) did deliver roughly 30× improvement in large language model inference throughput. Looking ahead, NVIDIA's roadmap shows Rubin NVL144 (2026) offering 3.6 exaFLOPS of dense FP4 compute versus Blackwell Ultra's 1.1 exaFLOPS — a 3.3× improvement in a single generation.

Independent measurement tells a more moderate story. Research by EpochAI found that GPU price-performance (FLOPS per dollar) has doubled approximately every 2.5 years between 2006 and 2025 — faster than Moore's Law's recent pace but well short of the "doubling every year" narrative. The gap reflects the difference between measuring raw peak throughput (which benefits from format changes like FP4) versus measuring useful work per dollar across real workloads. Both framings matter: peak throughput determines what's possible; cost-performance determines what's economical.

Why the Divergence Matters for AI Economics

The practical consequence of Huang's Law outpacing Moore's Law is that the cost of AI inference is falling far faster than the cost of general-purpose compute. This creates the conditions for Jevons' Paradox to operate at full force: as each GPU generation halves or thirds the cost of running a large language model query, total demand for AI inference doesn't merely double — it explodes. Enterprise adoption accelerates, new use cases become viable, and agents that were too expensive to run continuously become economically rational.

This flywheel connects directly to Wright's Law: as NVIDIA and TSMC produce more GPUs and wafers, manufacturing learning curves drive unit costs down further, which stimulates more demand, which finances the next round of R&D. The result is a self-reinforcing cycle where Huang's Law (performance), Wright's Law (cost), and Jevons' Paradox (demand) compound each other — a dynamic that has no precedent in the Moore's Law era because CPU performance gains were never steep enough to trigger this kind of demand explosion.

The Manufacturing Foundation Still Matters

Huang's Law depends on Moore's Law continuing in some form. NVIDIA's Rubin GPUs (2026) will be manufactured on TSMC's 3nm process; future generations will move to 2nm. Each process node improvement delivers 10–15% more transistors per unit area and meaningful power efficiency gains — improvements that Huang's Law then multiplies through architectural innovation. If transistor scaling stopped entirely, Huang's Law would slow dramatically, though it wouldn't stop: architectural innovation, numerical format changes, and software optimization can still deliver meaningful gains without smaller transistors.

The economics of advanced manufacturing increasingly favor this co-evolution. TSMC's 2nm node entered mass production in late 2025 with capacity already fully booked for 2026, and Intel's 18A process is ramping at competitive yields of 55–70%. But leading-edge wafer costs now exceed $20,000, which means only products with extremely high value per transistor — like AI accelerators selling for $30,000–$40,000 per chip — can justify the investment. Moore's Law's economics now serve Huang's Law's market.

Concentration Risk vs. Distributed Innovation

One critical difference between the two laws is market structure. Moore's Law was a broadly shared phenomenon: hundreds of chip companies rode the same lithography improvements, and competition kept prices low and innovation distributed. Huang's Law, by contrast, is overwhelmingly concentrated in a single company. NVIDIA controls roughly 80–90% of the AI accelerator market, and the CUDA software ecosystem creates powerful lock-in. AMD's MI300X and Google's TPU v5 offer competitive hardware, but neither has matched NVIDIA's full-stack integration of hardware, software, networking, and developer ecosystem.

This concentration creates both opportunity and fragility. The opportunity is that a single company with massive R&D budgets ($12B+ annually) and a clear roadmap can push the performance curve faster than a fragmented market. The fragility is that any disruption to NVIDIA's execution — supply chain issues, architectural missteps, or a breakthrough from competitors — could slow the pace of improvement that the entire AI industry now depends on.

The Road Ahead: 2026–2030

NVIDIA's published roadmap extends through 2028: Blackwell Ultra (2025), Rubin (2026), Rubin Ultra (2027), and Feynman (2028), with each generation targeting 2–4× improvement in AI workload performance. If this cadence holds, total inference performance per dollar will improve roughly 100× between 2024 and 2028 — a pace that would make today's most expensive AI applications trivially cheap.

Moore's Law, meanwhile, has a clear if slower path forward. Gate-all-around (GAA) nanosheet transistors at 2nm, backside power delivery at sub-2nm nodes, and 3D chip stacking (TSMC SoIC, Intel Foveros) will continue delivering transistor density improvements through the end of the decade, albeit at 18–24 month cadences rather than the original 24-month doubling. The two laws will continue to co-evolve: Moore's Law providing the transistor budget, Huang's Law spending that budget far more efficiently than general-purpose designs ever could.

Best For

Forecasting AI Infrastructure Costs

Huang's Law

Huang's Law is the directly relevant curve for projecting how much it will cost to run AI inference and training workloads over the next 3–5 years. Moore's Law understates the pace of AI-specific improvement.

Moore's Law

For CPUs, memory, storage, and embedded systems, Moore's Law (and its derivatives) remains the relevant framework. Not everything in computing is an AI workload.

Predicting the Cost of Intelligence

Huang's Law

The cost per token, per inference call, or per agent-hour is driven by GPU performance curves. Huang's Law, combined with Jevons' Paradox, is the right lens for modeling how cheap intelligence will become.

Semiconductor Investment Analysis

Both

Moore's Law governs the foundry economics (TSMC, Intel, Samsung process nodes); Huang's Law governs the design economics (what NVIDIA, AMD, and Google build on those nodes). Both curves matter for investment thesis construction.

Enterprise AI Deployment Planning

Huang's Law

Enterprises deciding when to deploy AI at scale should track GPU price-performance curves. Waiting one generation often delivers 2–3× better economics — a dynamic Huang's Law quantifies directly.

Historical Technology Analysis

Moore's Law

For understanding how the PC, internet, and mobile revolutions happened, Moore's Law is the foundational framework. It explains the five decades of exponential improvement that created the digital economy.

Building an AI Hardware Startup

Both

You need Moore's Law to understand what TSMC and Intel can manufacture, and Huang's Law to understand the performance bar you have to clear. Competing with NVIDIA means matching their full-stack optimization, not just their transistor count.

Modeling the Agentic Economy

Huang's Law

The agentic economy runs on AI inference. Huang's Law is the supply curve that determines how many agents can operate economically, how complex their reasoning can be, and how fast the market for AI-powered services expands.

The Bottom Line

Moore's Law built the world that Huang's Law is now transforming. Moore's Law is the foundation — without five decades of transistor scaling, there would be no GPUs, no CUDA, no deep learning revolution. But for anyone focused on the future of AI, the agentic economy, and the cost of intelligence, Huang's Law is the curve that matters most. It's steeper, it's driven by architectural specialization rather than brute-force miniaturization, and it's the engine behind the fastest cost collapse in the history of computing. The two laws are not competitors — they're layers in the same stack, with Moore's Law providing the silicon substrate and Huang's Law extracting exponentially more useful AI computation from each generation of that substrate. Track both, but bet on the one that's accelerating.