Gemini AI News & Updates

Google Integrates Intrinsic Robotics Platform to Advance Physical AI Capabilities

Alphabet is moving its robotics software subsidiary Intrinsic under Google's umbrella to accelerate physical AI development. Intrinsic, which builds AI models and software for industrial robots, will work closely with Google DeepMind and leverage Gemini AI models while remaining a distinct entity. The move aims to make robotics more accessible to manufacturers and advance factory automation, particularly through Intrinsic's partnership with Foxconn.

Google Expands Gemini AI with Multi-Step Task Automation on Android Devices

Google announced updates to its Gemini AI features on Android, including beta multi-step task automation for ordering food and rideshares on select devices like Pixel 10 and Galaxy S26. The update also expands scam detection for calls and texts, and enhances Circle to Search to identify multiple items on screen simultaneously. The automation feature includes safety protections like explicit user commands, real-time monitoring, and limited app access within a secure virtual window.

Google Releases Gemini 3 Pro-Powered Deep Research Agent with API Access as OpenAI Launches GPT-5.2

Google launched a reimagined Gemini Deep Research agent based on its Gemini 3 Pro model, now offering developers API access through the new Interactions API to embed advanced research capabilities into their applications. The agent, designed to minimize hallucinations during complex multi-step tasks, will be integrated into Google Search, Finance, Gemini App, and NotebookLM. Google released this alongside new benchmarks showing its superiority, though OpenAI simultaneously launched GPT-5.2 (codenamed Garlic), which claims to best Google on various metrics.

Google Implements Multi-Layered Security Framework for Chrome's AI Agent Features

Google has detailed comprehensive security measures for Chrome's upcoming agentic AI features that will autonomously perform tasks like booking tickets and shopping. The security framework includes observer models such as a User Alignment Critic powered by Gemini, Agent Origin Sets to restrict access to trusted sites, URL verification systems, and user consent requirements for sensitive actions like payments or accessing banking information. These measures aim to prevent data leaks, unauthorized actions, and prompt injection attacks while AI agents operate within the browser.

DeepMind Unveils SIMA 2: Gemini-Powered Agent Demonstrates Self-Improvement and Advanced Reasoning in Virtual Environments

Google DeepMind released a research preview of SIMA 2, a generalist AI agent powered by Gemini 2.5 that can understand, reason about, and interact with virtual environments, doubling its predecessor's performance to achieve complex task completion. Unlike SIMA 1, which simply followed instructions, SIMA 2 integrates advanced language models to reason internally, understand context, and self-improve through trial and error with minimal human training data. DeepMind positions this as a significant step toward artificial general intelligence and general-purpose robotics, though no commercial timeline has been announced.

Google Expands Gemini AI Integration in Chrome with Agentic Browsing and Advanced Search Capabilities

Google is rolling out Gemini AI integration in Chrome to all U.S. desktop users, enabling AI assistance across web pages and multiple tabs. The company announced upcoming agentic capabilities that will allow Gemini to autonomously complete tasks like booking appointments and online shopping, while also introducing AI Mode search directly in the address bar.

Apple Considers Google Gemini Partnership to Enhance Siri's AI Capabilities

Apple is reportedly in talks with Google to use Gemini technology for a major Siri revamp, as the company falls behind competitors in AI assistant capabilities. Apple has also approached OpenAI and Anthropic for similar partnerships, with Google already training a model that could run on Apple's servers.

Google Launches Gemini 2.5 Deep Think Multi-Agent AI System with Advanced Reasoning Capabilities

Google DeepMind has released Gemini 2.5 Deep Think, a multi-agent AI reasoning model that explores multiple ideas simultaneously to provide better answers, available to $250/month Ultra subscribers. The system achieved state-of-the-art performance on challenging benchmarks including Humanity's Last Exam and LiveCodeBench6, outperforming competitors like OpenAI's o3 and xAI's Grok 4. This represents part of an industry-wide convergence toward multi-agent AI systems, though these computationally expensive models remain gated behind premium subscriptions.

Google Expands Gemini AI Assistant to Wear OS Smartwatches and Enhances Circle to Search with AI Mode

Google is rolling out its Gemini AI assistant to Wear OS smartwatches from multiple manufacturers, replacing Google Assistant as part of its broader platform integration strategy. The company is also enhancing Circle to Search with AI Mode capabilities, allowing users to ask follow-up questions and explore complex topics directly within visual search results.

Google Deploys Veo 3 Video Generation AI Model to Global Gemini Users

Google has rolled out its Veo 3 video generation model to Gemini users in over 159 countries, allowing paid subscribers to create 8-second videos from text prompts. The service is limited to 3 videos per day for AI Pro plan subscribers, with image-to-video capabilities planned for future release.