Google Cloud introduced two new Tensor Processing Unit generations at its annual conference.
The TPU 8t handles training for frontier AI models. TPU 8i focuses on inference tasks across agent deployments. Both chips launch later this year through cloud services.
Executives described the split as response to workload evolution. Training demands peak compute density while inference requires memory bandwidth. TPU 8i triples high-bandwidth memory over prior versions.
Google Cloud CEO Thomas Kurian called power efficiency the key constraint. The chips cut energy per operation significantly from seventh-generation Ironwood. Configurations scale to thousands of units per pod.
Customers including Anthropic expanded TPU commitments. The AI developer secured access to million-unit clusters for Claude model iterations. Apple deployed prior TPUs for device intelligence training.
Nvidia retains cloud presence through Google partnerships. Vera Rubin GPUs arrive later this year alongside TPU options. The arrangement lets customers mix architectures by workload.
Google maintains internal TPU production for Gemini development. Cloud sales generated billions last year as enterprises sought alternatives. Inference growth drives hardware revenue acceleration.
TPU 8i addresses memory wall limitations. Rapid data access enables real-time agent responses without performance cliffs. Training counterpart boosts model scale for research labs.
Conference timing aligned with competitive escalation. Amazon and Microsoft announced GPU expansions last quarter. Google positions custom silicon for cost leadership in sustained AI runs.
Early benchmarks showed TPU 8t matching Nvidia H100 clusters on select metrics. Inference tests highlighted lower latency for customer-facing applications.
Alphabet stock rose modestly on the news. Investors parsed long-term Nvidia displacement potential against near-term partnership continuity.
The launches mark Google's most direct hardware challenge. Enterprise adoption will determine market share gains.