Agentic AI AI News & Updates

OpenAI Launches Enhanced Agents SDK with Sandboxing for Safer Enterprise AI Agent Deployment

OpenAI has updated its Agents SDK to help enterprises build AI agents with new safety features including sandboxing capabilities that allow agents to operate in controlled environments. The update includes an in-distribution harness for frontier models and aims to enable development of long-horizon, complex multi-step agents while mitigating risks from unpredictable agent behavior. Initial support is available in Python with TypeScript and additional features planned for future releases.

Anthropic Releases Mythos: Powerful Frontier AI Model for Cybersecurity Vulnerability Detection

Anthropic has released a limited preview of Mythos, described as one of its most powerful frontier AI models, to over 40 partner organizations including Amazon, Apple, Microsoft, and Cisco for defensive cybersecurity work. The model has reportedly identified thousands of zero-day vulnerabilities in software systems, some dating back one to two decades. While designed as a general-purpose model with strong coding and reasoning capabilities, concerns exist about potential weaponization by bad actors to exploit rather than fix vulnerabilities.

Anthropic Acquires Computer-Use AI Startup Vercept in Strategic Talent Play

Anthropic has acquired Vercept, an AI startup that developed tools for complex agentic tasks including a cloud-based computer-use agent capable of operating remote Macbooks. The acquisition brings several co-founders and researchers to Anthropic, though one co-founder had already been poached by Meta for $250 million, and Vercept's product will be shut down on March 25th. The deal follows Anthropic's December acquisition of coding agent engine Bun as part of its strategy to scale Claude Code capabilities.

Google Cloud VP Outlines Three Frontiers of AI Model Capability: Intelligence, Latency, and Scalable Cost

Michael Gerstenhaber, VP of Google Cloud's Vertex AI platform, describes three distinct frontiers driving AI model development: raw intelligence for complex tasks, low latency for real-time interactions, and cost-efficient scalability for mass deployment. He explains that agentic AI adoption is slower than expected due to missing production infrastructure like auditing patterns, authorization frameworks, and human-in-the-loop safeguards, though software engineering has seen faster adoption due to existing development lifecycle protections.

Analyst Report Warns AI Agents Could Double Unemployment and Crash Markets Within Two Years

Citrini Research published a scenario analysis exploring how agentic AI integration could cause severe economic disruption over the next two years, projecting doubled unemployment and a 33% stock market decline. The report focuses on economic destabilization through AI agents replacing human contractors and optimizing inter-company transactions, rather than traditional AI alignment concerns. While presented as a scenario rather than a firm prediction, the analysis has generated significant debate about the plausibility of rapid AI-driven economic transformation.

Apple Integrates Agentic AI Coding Assistants into Xcode Development Environment

Apple has released Xcode 26.3, integrating agentic coding tools from Anthropic (Claude Agent) and OpenAI (Codex) directly into its development environment. These AI agents can autonomously explore projects, write code, run tests, fix errors, and access Apple's developer documentation using the Model Context Protocol (MCP). The feature aims to automate complex development tasks while maintaining transparency through step-by-step breakdowns and visual code highlighting.

OpenAI Releases MacOS Codex App with Multi-Agent Coding Capabilities

OpenAI has launched a new MacOS application for its Codex coding tool, incorporating agentic workflows that allow multiple AI agents to work independently on programming tasks in parallel. The app features background automations, customizable agent personalities, and leverages the GPT-5.2-Codex model, though benchmarks show it performs similarly to competing models from Gemini 3 and Claude Opus. CEO Sam Altman claims the tool enables sophisticated software development in hours, limited only by how fast users can input ideas.

Anthropic Launches Cowork: Simplified AI Agent for Non-Technical Users

Anthropic has announced Cowork, a more accessible version of Claude Code built into the Claude Desktop app that allows users to designate folders for Claude to read and modify files through a chat interface. Currently in research preview for Max subscribers, the tool is designed for non-technical users to accomplish tasks like assembling expense reports or managing media files without requiring command-line knowledge. Anthropic warns of potential risks including prompt injection and file deletion, recommending clear instructions from users.

Nvidia Unveils Rubin Architecture: Next-Generation AI Computing Platform Enters Full Production

Nvidia has officially launched its Rubin computing architecture at CES, described as state-of-the-art AI hardware now in full production. The new architecture offers 3.5x faster model training and 5x faster inference compared to the previous Blackwell generation, with major cloud providers and AI labs already committed to deployment. The system includes six integrated chips addressing compute, storage, and interconnection bottlenecks, with particular focus on supporting agentic AI workflows.

AWS Launches Autonomous AI Coding Agents Capable of Multi-Day Independent Operation

Amazon Web Services announced three new AI agents, including Kiro autonomous agent that can independently write production code for days at a time with minimal human intervention. The agents handle coding, security reviews, and DevOps tasks by learning team workflows and maintaining persistent context across sessions. AWS claims Kiro can autonomously complete complex, multi-step coding tasks assigned from backlogs while following company specifications.