The Foundation Model Arms Race: Shipping Reality for RT-2, Groot, and Physical Intelligence
Introduction: The Shift from Code to Policy
The robotics industry is undergoing a fundamental shift in its control architecture. For decades, the standard operating procedure involved hard-coding task-specific scripts into robots—a rigid approach that failed to scale beyond structured factory environments. Today, the leading contenders are pivoting toward Robotics Foundation Models. These large-scale AI systems are trained on vast datasets of human interaction to generalize behaviors, moving robots from scripted execution to adaptive decision-making.
However, in an ecosystem prone to hype, RobotWale maintains a strict grading hierarchy: shipping hardware takes precedence, followed by pilot deployments, with announcements ranked last. This article analyzes the current state of the race between Google's RT-2, Tesla's Groot, and Physical Intelligence (often colloquially referenced as 'Pi' in industry circles), focusing on verifiable technical data, deployment status, and the specific implications for the Indian market.
Google RT-2: Vision-Language-Action Models
Google DeepMind's RT-2 (Robotic Transformer 2) represents one of the most mature research frameworks in the field. Unlike traditional reinforcement learning approaches that require millions of simulated trials, RT-2 treats robot control as a language problem. It takes image inputs and natural language instructions as tokens, generating action tokens that dictate robot movements.
Technical Validation: According to Google DeepMind's official technical report, RT-2 was trained on a combination of robot interaction data and internet-scale datasets. This allows the model to transfer knowledge from the web to physical robots. For example, if a robot is instructed to 'pick up the red cup,' the model can generalize this from web images to a physical object in a novel environment.
Deployment Status: As of late 2023 and early 2024, RT-2 remains largely in the research and development phase. Google has not released a general-purpose API for external developers to purchase. Deployments are restricted to specific research partners and internal testing environments. While the architecture is powerful, there is no public 'shipping hardware' component that includes the RT-2 model as a standalone purchase.
Relevance to India: For Indian manufacturers and system integrators, RT-2 is currently inaccessible as a commercial product. It is not available via cloud API for general robotics deployment in India. The nearest approximation is Google's broader robotics research initiatives, which may offer limited access through partnerships, but no direct INR pricing or import channels exist for this specific model.
Tesla Groot: Video-to-Action at Scale
Tesla's Groot foundation model is designed specifically for the humanoid Optimus robot. The core innovation lies in its ability to learn from video demonstrations. Instead of requiring explicit programming, the model observes human video data and infers the corresponding control signals.
Technical Validation: Tesla's engineering updates highlight that Groot is trained on data collected from the Optimus fleet. This 'sim-to-real' transfer is critical for safety. The model processes video frames to predict action sequences, effectively allowing the robot to 'watch and learn.' This approach reduces the reliance on manual teleoperation for new tasks.
Deployment Status: Tesla's approach is anchored in shipping hardware. The Optimus robot is a physical unit that is currently being manufactured. While Elon Musk has announced timelines for production, the actual volume of 'shipping hardware' running Groot is still in the pilot deployment phase. The model is not a standalone software package; it is proprietary to the Optimus hardware ecosystem.
India Availability: Currently, the Optimus robot is not available for sale in India. There is no official distributor, and the landed cost (INR) cannot be calculated as the base hardware price remains in the 'engineering prototype' or 'early pilot' category. Estimates suggest a future unit price around $20,000 to $30,000 USD, which would translate to approximately ₹16-24 Lakhs INR (excluding import duties). Until Tesla establishes a localized supply chain in India, this remains a speculative figure.
Physical Intelligence (Pi): The Third Pillar
The term 'Pi' in this context generally refers to Physical Intelligence, the company behind the Figure 01 humanoid robot. In industry discussions, 'Pi' is often used as shorthand for their underlying foundation model architecture, which focuses on open-ended task handling.
Technical Validation: Physical Intelligence has demonstrated its model through the Figure 01 robot. The system is designed to handle complex manipulation tasks using a combination of vision and language. Unlike RT-2, which emphasizes language tokens, Figure's architecture prioritizes real-time video processing for dexterity.
Deployment Status: Physical Intelligence has secured significant partnerships, including a collaboration with BMW. This indicates that pilot deployments are moving beyond the lab. The Figure 01 has been shown performing assembly tasks in controlled environments. However, the model is not sold as a standalone software solution.
India Availability: There is no direct availability for the Figure 01 or its underlying foundation model in India. The company focuses on North American and European enterprise clients. For Indian enterprises interested in this technology, the cost is likely to be enterprise-grade, potentially exceeding ₹50 Lakhs INR for a complete deployment package, including hardware and licensing.
The General Policy Challenge
The race to a general policy is not merely about intelligence; it is about safety and reliability. Foundation models introduce risks such as hallucinations—where a robot might attempt an impossible physical action based on a misinterpreted instruction. This is why hardware validation is the primary metric for success.
Key Differentiators:
- Google RT-2: Strongest in semantic understanding (Language).
- Tesla Groot: Strongest in video-based physical transfer (Sim-to-Real).
- Physical Intelligence: Strongest in dexterity and manipulation (Physical Action).
None of these models have achieved 'General Purpose' status in the commercial sense. They are specialized foundation models optimized for specific hardware constraints.
India Market Context: Hardware, Imports, and Costs
For the Indian robotics sector, the implications of these foundation models are significant. The current regulatory landscape in India requires strict safety compliance for humanoid robots, particularly regarding industrial automation.
Import Duties and Pricing: If any of these systems enter India, they will face high import duties. For example, if a Tesla Optimus unit were to be imported at $25,000 USD, the landed cost including customs and GST could exceed ₹25 Lakhs INR. This pricing places the technology out of reach for Small and Medium Enterprises (SMEs) in India.
Local Integration: Until these foundation models are available as cloud APIs with localized Indian data centers, the 'shipping hardware' remains the only viable path for Indian companies. This means purchasing the physical unit and running the model locally.
Availability Timeline: As of early 2024, none of these three models are available for purchase by Indian entities. They remain in the 'announcement' or 'pilot' category. Shipping hardware is limited to pilot fleets in the US and Europe.
Conclusion: Grounding Expectations in Reality
The robotics foundation model race is progressing rapidly, but the gap between research and commercial deployment remains wide. Google RT-2, Tesla Groot, and Physical Intelligence's architecture represent the cutting edge of what is possible, but they are not yet general-purpose solutions available for purchase.
For Indian buyers and developers, the priority should be on hardware that ships with verified models. Until an Indian distributor offers a contract for these foundation models, claims of availability should be treated as announcements rather than facts. The industry must prioritize pilot deployments in controlled environments before scaling to general-purpose operations.
RobotWale continues to monitor these developments, prioritizing data from manufacturer spec sheets and independent testing reports over marketing materials.
✓ Key takeaways
- •Hands-on view of The Foundation Model Arms Race: Shipping Reality for RT-2, Groot, and Physical Intelligence inside our Robotics Foundation Models library.
- •Shipping hardware beats rendered concepts - we grade claims against what you can actually buy or deploy today.
- •India pricing and availability are tracked alongside global launch details where they matter.
References
Related articles
More in Robotics Foundation Models →

