google cloud AI News & Updates

Google Cloud Unveils Specialized TPU 8t and TPU 8i Chips for AI Training and Inference

Google Cloud announced its eighth generation tensor processing units (TPUs), splitting into two specialized chips: TPU 8t for model training and TPU 8i for inference. The new chips promise 3x faster training, 80% better performance per dollar, and support for clusters exceeding 1 million TPUs. Despite this advancement, Google continues to offer Nvidia's latest chips alongside its own custom processors, with both companies collaborating on networking optimization.

Google Launches Gemini Enterprise Agent Platform for IT Teams at Cloud Next Conference

Google announced its Gemini Enterprise Agent Platform at the Cloud Next conference, a tool designed for building and managing AI agents at enterprise scale, positioning it as a competitor to Amazon Bedrock AgentCore and Microsoft Foundry. The platform is specifically targeted at IT and technical teams, while business users are directed to the separate Gemini Enterprise app for simpler agent-based tasks. The platform supports multiple models including Google's Gemini and Anthropic's Claude family (Opus, Sonnet, and Haiku).

Thinking Machines Lab Secures Multi-Billion Dollar Google Cloud Deal for Advanced AI Infrastructure

Mira Murati's startup Thinking Machines Lab has signed a multi-billion-dollar agreement with Google Cloud for access to advanced AI infrastructure, including systems powered by Nvidia's latest GB300 GPUs. The deal supports the company's reinforcement learning workloads for Tinker, a tool that automates the creation of custom frontier AI models, and marks Google's strategy to lock in emerging AI labs early. Thinking Machines previously raised $2 billion at a $12 billion valuation and this represents its first major cloud provider partnership.

Google and Intel Expand Multi-Year Partnership for AI Infrastructure and Custom Chip Development

Google and Intel announced an expanded multi-year partnership where Google Cloud will utilize Intel's Xeon 6 processors for AI, cloud, and inference workloads. The companies will also continue co-developing custom infrastructure processing units (IPUs) to accelerate data center tasks, addressing the growing industry demand for CPUs needed to run AI models.

Anthropic Secures Massive 3.5 Gigawatt Compute Expansion with Google and Broadcom

Anthropic has signed an expanded agreement with Google and Broadcom to secure 3.5 gigawatts of additional compute capacity using Google's TPUs, coming online in 2027. This deal supports the company's explosive growth, with run rate revenue jumping from $9 billion to $30 billion and over 1,000 enterprise customers spending $1M+ annually. The expansion reflects unprecedented demand for Claude AI models despite some U.S. government supply chain concerns.

Google Cloud VP Outlines Three Frontiers of AI Model Capability: Intelligence, Latency, and Scalable Cost

Michael Gerstenhaber, VP of Google Cloud's Vertex AI platform, describes three distinct frontiers driving AI model development: raw intelligence for complex tasks, low latency for real-time interactions, and cost-efficient scalability for mass deployment. He explains that agentic AI adoption is slower than expected due to missing production infrastructure like auditing patterns, authorization frameworks, and human-in-the-loop safeguards, though software engineering has seen faster adoption due to existing development lifecycle protections.

Google Launches Managed MCP Servers to Streamline AI Agent Integration with Cloud Services

Google has launched fully managed, remote MCP (Model Context Protocol) servers that enable AI agents to easily connect to Google and Cloud services like Maps, BigQuery, Compute Engine, and Kubernetes Engine. This infrastructure reduces the complexity of integrating agents with enterprise tools by providing standardized, pre-built connectors with built-in security and governance through Google Cloud IAM and Model Armor. The launch follows Google's Gemini 3 model release and aims to make Google "agent-ready by design" while supporting the open-source MCP standard developed by Anthropic.

Reliance Industries Launches Massive AI Infrastructure Initiative with Google and Meta Partnerships

India's richest man Mukesh Ambani has launched Reliance Intelligence, a new subsidiary aimed at building India's national AI infrastructure through strategic partnerships with Google Cloud and Meta. The initiative includes a dedicated AI cloud region starting with a data center in Gujarat, and a $100 million joint venture with Meta to deploy Llama-based enterprise AI solutions across India and international markets.

Google Cloud Partners with OpenAI Despite Search Competition Threat

Google CEO Sundar Pichai expressed excitement about Google Cloud's partnership with OpenAI, providing cloud computing resources to train and serve OpenAI's AI models. This creates a complex relationship where Google is supplying infrastructure to its biggest AI competitor, which poses a major threat to Google's core search business. Google Cloud revenue grew to $13.6 billion in Q2 2025, with significant growth attributed to serving AI companies including OpenAI, Anthropic, and other major AI labs.