Figure AI vs Physical Intelligence

Comparison

The race to build general-purpose robotic intelligence has produced two breakout companies with fundamentally different strategies. Figure AI is building the whole stack — humanoid hardware and AI brain together — while Physical Intelligence is building the foundational AI models that could power any robot, from any manufacturer. Both have raised billions, both use vision-language-action architectures, and both believe 2026 is the year physical AI goes mainstream. But their bets on where value accrues in the robotics stack could not be more different.

Figure AI, valued at $39 billion after its 2026 Series C, has shipped its third-generation humanoid (Figure 03), opened a dedicated manufacturing facility targeting 100,000 units over four years, and demonstrated full-body autonomous tasks like unloading dishwashers and navigating kitchens with no human intervention. Physical Intelligence, now valued at roughly $5.6 billion after its $600 million Series B, has open-sourced its π0 model weights, published breakthroughs in long-horizon robot memory and online reinforcement learning, and signed deployment partnerships with hardware makers like Weave and AgiBot.

This comparison breaks down how these two approaches differ across funding, technology, go-to-market strategy, and long-term positioning — and where each company has the advantage heading into the second half of 2026.

Feature Comparison

DimensionFigure AIPhysical Intelligence
Founded2022, by Brett Adcock2024, by former Google robotics researchers
Valuation (2026)$39 billion~$5.6 billion
Total FundingOver $1 billion (Series C at $39B post-money)~$1.1 billion ($400M Series A + $600M Series B)
Business ModelVertically integrated: builds and sells humanoid robotsPlatform play: builds foundation models for other companies' robots
Core AI ArchitectureHelix dual-system: separate VLM (reasoning) + VLA (motor control) at 200Hzπ0 unified VLA with flow-matching action generation, trained across 7 robot platforms
HardwareFigure 03: 5'6", 61 kg, 20 kg payload, 5-hour battery, wireless charging, tactile sensorsNo proprietary hardware — deploys on partner robots (ALOHA, DROID, Weave, AgiBot, etc.)
Manufacturing ScaleBotQ facility: 12,000 units/year initially, targeting 100,000 over 4 yearsN/A — scales through software distribution, not manufacturing
Open SourceClosed/proprietaryπ0 and π0.5 weights and fine-tuning code open-sourced via openpi repository
Key DeploymentsBMW Spartanburg plant (90,000+ parts loaded, 30,000+ vehicles assisted)Weave laundromat (live commercial deployment), AgiBot manufacturing pilot with Longcheer Technology
Target MarketsManufacturing, logistics, and home (Figure 03 designed for domestic use)Cross-industry via partners: manufacturing, logistics, food service, laundry
Latest Breakthrough (2026)Helix 02: full-body autonomy with 4-minute uninterrupted kitchen tasksMulti-Scale Embodied Memory (MEM) enabling 10+ minute complex tasks; online RL for precision manipulation
Generalization ApproachSingle-embodiment: optimized for Figure robots specificallyCross-embodiment: one model transfers across different robot arms, grippers, and form factors

Detailed Analysis

Vertical Integration vs. Platform Play

The most consequential difference between Figure AI and Physical Intelligence is where each company draws its system boundary. Figure builds the entire robot — chassis, actuators, sensors, hands, and the AI that controls them — optimizing every layer of the stack together. Physical Intelligence builds only the intelligence layer, designing it to be robot-agnostic from the start.

Figure's approach mirrors Apple's playbook: tight hardware-software integration enables capabilities that general-purpose solutions struggle to match. Their Helix system's dual VLM/VLA architecture is co-designed with Figure 03's sensor suite, from the palm-embedded cameras to the tactile sensors that detect forces as small as three grams. Physical Intelligence's approach mirrors Android or, more precisely, the way OpenAI's GPT models became a general-purpose reasoning layer adopted across thousands of applications. If π0 becomes the default foundation model for robotic control, Physical Intelligence captures value across the entire ecosystem.

The risk for Figure is building hardware in a market where Chinese manufacturers like Unitree already ship roughly 36 times more units. The risk for Physical Intelligence is that robot intelligence may be too tightly coupled to specific hardware to truly generalize — that the "write once, run anywhere" promise breaks down when physics is involved.

AI Architecture: Dual-System vs. Unified Model

Figure's Helix uses a deliberately split architecture: a vision-language model handles scene understanding and task reasoning, while a separate vision-language-action model generates motor commands at 200Hz. This separation lets Figure update its reasoning capabilities without retraining its motor control, and vice versa — a practical engineering advantage when deploying to production robots that need both intelligence and real-time responsiveness.

Physical Intelligence's π0 takes the opposite bet: a single unified vision-language-action model that processes camera input, language instructions, and action generation in one forward pass. The flow-matching architecture lets π0 generate smooth, continuous action trajectories rather than discrete motor commands. Their March 2026 work on Multi-Scale Embodied Memory (MEM) adds both long-term and short-term memory to the model, enabling complex tasks exceeding ten minutes — addressing a key limitation of earlier VLA approaches.

Both architectures have demonstrated impressive results. Helix 02 completed a four-minute autonomous dishwasher task with no resets. π0 has demonstrated zero-shot generalization across robot platforms it was never explicitly trained on. The question is whether the future favors specialized, tightly-coupled systems or general-purpose models that improve with scale.

Data Strategy and the Scaling Hypothesis

Both companies are betting heavily on the foundation model scaling hypothesis — the idea that more data and compute will yield increasingly capable robot intelligence — but they are collecting data in very different ways.

Figure operates its own teleoperation fleet where human operators remote-control Figure 02 and Figure 03 robots to generate manipulation data at scale. This data is proprietary, specific to Figure's hardware, and tightly quality-controlled. Physical Intelligence collects data across multiple robot embodiments from multiple partners, combines it with internet-scale pre-training, and has open-sourced its model to encourage community contributions via the openpi repository on GitHub.

Physical Intelligence's cross-embodiment approach potentially gives it access to a much larger and more diverse training set. Their RECAP training pipeline — which combines demonstration, coaching through corrections, and autonomous experience — doubled throughput on precision tasks in their v0.6 release. Figure's single-embodiment data may be narrower but deeper, with every demonstration perfectly matched to the robot that will execute the task.

Commercial Traction and Go-to-Market

Figure AI has the more visible commercial deployments. Their 11-month pilot at BMW's Spartanburg plant — loading 90,000+ automotive parts and assisting production of 30,000+ BMW X3 vehicles — is one of the most substantial humanoid robot deployments in industry history. The Figure 03, designed for home use at a target price of $20,000, signals Figure's ambition to move beyond industrial settings, though consumer availability is not expected until late 2026 at earliest.

Physical Intelligence's commercial traction is earlier-stage but potentially broader. Partner Weave has robots performing live deployments in a San Francisco laundromat. AgiBot has deployed Physical Intelligence's reinforcement learning system in a manufacturing pilot. Because Physical Intelligence doesn't need to manufacture or ship hardware, it can scale deployment through software updates to existing partner robots — a fundamentally different growth curve.

The BotQ manufacturing facility gives Figure a concrete production roadmap: 12,000 humanoids per year initially, scaling to 100,000 over four years. Physical Intelligence's scaling is measured in model downloads and partner integrations rather than units shipped.

Open Source and Ecosystem Strategy

Physical Intelligence made a significant strategic move by open-sourcing its π0 and π0.5 model weights and fine-tuning code. This mirrors the playbook of companies like Meta with Llama — building ecosystem adoption and developer familiarity that can later be monetized through enterprise offerings and premium model access. The openpi repository includes checkpoints for multiple robot platforms, example inference code, and fine-tuning pipelines.

Figure AI keeps its Helix system entirely proprietary. This makes sense for a vertically integrated hardware company — the AI is the differentiation that justifies the hardware price — but it limits Figure's ability to build a broader ecosystem. If the robotics market evolves like the smartphone market, the question is whether Figure is building the iPhone (premium, integrated, profitable) or whether Physical Intelligence is building Android (ubiquitous, platform-dominant, ecosystem-defining).

Funding, Valuation, and Investor Thesis

Figure AI's $39 billion valuation is roughly 7x Physical Intelligence's ~$5.6 billion valuation, reflecting investor conviction in the vertically integrated approach and the sheer scale of the addressable market for humanoid robots. Figure's backers — Microsoft, NVIDIA, OpenAI, Intel, and Jeff Bezos — read like a who's-who of the agentic AI ecosystem.

Physical Intelligence's funding trajectory is remarkable in its own right: from founding in 2024 to $1.1 billion raised in under two years. The platform thesis — that Physical Intelligence could become the "intelligence layer" for all robots — justifies a venture-scale bet even at a lower valuation. If the robotics market fragments across many hardware makers (as seems likely given Chinese manufacturing advantages), the software platform that powers them all could be more valuable than any single hardware company.

Best For

Automotive Manufacturing

Figure AI

Figure has a proven 11-month deployment at BMW with measurable production impact. Their vertically integrated hardware-software stack is purpose-built for repetitive, high-precision factory tasks with consistent environments.

Multi-Robot Fleet with Mixed Hardware

Physical Intelligence

If your operation uses robots from multiple manufacturers, Physical Intelligence's cross-embodiment π0 model is the only option that provides a unified intelligence layer across different hardware platforms.

Home Assistance

Figure AI

Figure 03 was explicitly designed for domestic environments — soft-goods exterior, wireless charging, voice interaction, and household task demonstrations. Physical Intelligence has no consumer-facing product.

Robotics R&D and Prototyping

Physical Intelligence

The open-sourced π0 model with fine-tuning code gives researchers and startups a state-of-the-art VLA model they can adapt to their own robots and tasks without building foundation models from scratch.

Warehouse Logistics

Figure AI

Figure's combination of bipedal navigation, dexterous manipulation, and 20 kg payload capacity makes their humanoid well-suited for warehouse environments designed for human workers. Their BMW deployment validates real-world logistics capability.

Scaling Intelligence Across Existing Robot Fleets

Physical Intelligence

Companies that already own robot hardware but need smarter control software should look to Physical Intelligence. Their platform model means upgrading robot intelligence without replacing hardware.

Precision Assembly Tasks

Tie

Both companies have demonstrated fine manipulation capability. Figure's 3-gram tactile sensors excel at hardware-level precision; Physical Intelligence's online RL pipeline rapidly adapts to new precision tasks with minimal data.

Service Industry (Food, Laundry, Hospitality)

Physical Intelligence

Physical Intelligence's partner Weave is already operating in a live San Francisco laundromat. The platform model lets service-industry robot makers integrate frontier AI without building their own models.

The Bottom Line

Figure AI and Physical Intelligence are not really competitors — they are building different layers of the robotics stack, and the success of one does not preclude the success of the other. Figure AI is the stronger choice if you need a complete humanoid robot solution deployed today, particularly for manufacturing, logistics, or (eventually) home use. Their BMW deployment, BotQ manufacturing facility, and Figure 03 hardware represent the most commercially mature humanoid robot program outside of Tesla. The $39 billion valuation is steep, but it reflects a genuine lead in real-world deployment.

Physical Intelligence is the better bet if you believe the robotics market will be fragmented across many hardware makers and that the intelligence layer will be the most valuable piece of the stack. Their open-source strategy, cross-embodiment generalization, and rapid research cadence — three major publications in the first three months of 2026 alone — position π0 as the leading candidate for a "foundation model for robots" in the same way GPT-4 became the foundation model for language applications. For robotics startups, researchers, and companies with existing robot fleets, Physical Intelligence offers capability that would otherwise require tens of millions in AI research investment.

The most likely outcome is that both approaches coexist: Figure AI as a premium, vertically integrated humanoid maker (the Boston Dynamics successor with real AI), and Physical Intelligence as the ubiquitous intelligence platform that powers the long tail of robotics hardware. If forced to pick which company has more structural upside over the next five years, Physical Intelligence's platform economics and open ecosystem give it the wider moat — but Figure AI will likely generate meaningful revenue first.