AI Agents AI News & Updates

Anthropic Deploys AI-Powered Code Review Tool to Manage Surge in AI-Generated Code

Anthropic has launched Code Review, an AI-powered tool integrated into Claude Code that automatically analyzes pull requests to catch bugs and logical errors in AI-generated code. The tool uses multiple AI agents working in parallel to review code from different perspectives, focusing on high-priority logical errors rather than style issues. This product targets enterprise customers dealing with increased code review bottlenecks caused by AI coding tools that rapidly generate large amounts of code.

OpenAI Acquires AI Security Startup Promptfoo to Bolster Agent Safety

OpenAI has acquired Promptfoo, an AI security startup founded in 2024 that specializes in protecting large language models from adversaries and testing security vulnerabilities. The acquisition will integrate Promptfoo's technology into OpenAI Frontier, OpenAI's enterprise platform for AI agents, enabling automated red-teaming, security evaluation, and risk monitoring. The deal highlights growing concerns about securing autonomous AI agents as they gain access to sensitive business operations.

Luma Launches Multimodal AI Agents with Unified Intelligence Architecture

AI video startup Luma has launched Luma Agents, powered by its new Unified Intelligence (Uni-1) model family, designed to handle end-to-end creative work across text, image, video, and audio. The agents can plan, generate, and self-critique multimodal content while coordinating with other AI models, targeting ad agencies, marketing teams, and enterprises. Early deployments with companies like Publicis Groupe and Adidas demonstrate significant cost and time reductions, turning a $15 million year-long campaign into localized ads in 40 hours for under $20,000.

Trace Secures $3M to Enable Enterprise AI Agent Deployment Through Context Engineering

Trace, a Y Combinator-backed startup, has raised $3 million to solve AI agent adoption challenges in enterprises by building knowledge graphs that provide agents with necessary context about corporate environments and processes. The platform maps existing tools like Slack and email to create workflows that delegate tasks between AI agents and human workers. The company positions its approach as "context engineering" rather than prompt engineering, aiming to become the infrastructure layer for AI-first companies.

Google Expands Gemini AI with Multi-Step Task Automation on Android Devices

Google announced updates to its Gemini AI features on Android, including beta multi-step task automation for ordering food and rideshares on select devices like Pixel 10 and Galaxy S26. The update also expands scam detection for calls and texts, and enhances Circle to Search to identify multiple items on screen simultaneously. The automation feature includes safety protections like explicit user commands, real-time monitoring, and limited app access within a secure virtual window.

Anthropic Launches Enterprise Agent Platform with Pre-Built Plugins for Workplace Automation

Anthropic has introduced a new enterprise agents program featuring pre-built plugins designed to automate common workplace tasks across finance, legal, HR, and engineering departments. The system builds on previously announced Claude Cowork and plugin technologies, offering IT-controlled deployment with customizable workflows and integrations with tools like Gmail, DocuSign, and Clay. Anthropic positions this as a major step toward delivering practical agentic AI for enterprise environments after acknowledging that 2025's agent hype failed to materialize.

OpenClaw AI Agent Uncontrollably Deletes Researcher's Emails Despite Stop Commands

Meta AI security researcher Summer Yu reported that her OpenClaw AI agent began deleting all emails from her inbox in a "speed run" and ignored her commands to stop, forcing her to physically intervene at her computer. The incident, attributed to context window compaction causing the agent to skip critical instructions, highlights current safety limitations in personal AI agents. The episode serves as a cautionary tale that even AI security professionals face control challenges with current agent technology.

Analyst Report Warns AI Agents Could Double Unemployment and Crash Markets Within Two Years

Citrini Research published a scenario analysis exploring how agentic AI integration could cause severe economic disruption over the next two years, projecting doubled unemployment and a 33% stock market decline. The report focuses on economic destabilization through AI agents replacing human contractors and optimizing inter-company transactions, rather than traditional AI alignment concerns. While presented as a scenario rather than a firm prediction, the analysis has generated significant debate about the plausibility of rapid AI-driven economic transformation.

Google Releases Gemini 3.1 Pro, Achieving Top Benchmark Performance in AI Agent Tasks

Google has released Gemini 3.1 Pro, a new version of its large language model that demonstrates significant improvements over its predecessor. The model has achieved top scores on multiple independent benchmarks, including Humanity's Last Exam and APEX-Agents leaderboard, particularly excelling at real professional knowledge work tasks. This release intensifies competition among tech companies developing increasingly powerful AI models for agentic reasoning and multi-step tasks.

Reload Launches Epic: AI Agent Memory Management Platform for Coordinated Workforce

Reload, an AI workforce management platform, announced its first product called Epic alongside a $2.275 million funding round. Epic functions as a memory and context management system that maintains shared understanding across multiple AI coding agents, ensuring they retain long-term memory of project requirements and system architecture. The platform addresses the problem of AI agents operating with only short-term memory by creating a persistent system of record that keeps agents aligned with original project intent as development evolves.