Commercial Release AI News & Updates

xAI Releases Grok 3 API with Reasoning Capabilities at Premium Pricing

Elon Musk's AI company xAI has launched an API for its flagship Grok 3 model, offering both standard and mini versions with reasoning capabilities. The pricing is relatively high compared to competitors, with Grok 3 costing $3 per million input tokens and $15 per million output tokens, while also falling short of previously claimed capabilities like its context window.

Google Introduces Agentic Capabilities to Gemini Code Assist for Complex Coding Tasks

Google has enhanced its Gemini Code Assist with new agentic capabilities that can complete multi-step programming tasks such as creating applications from product specifications or transforming code between programming languages. The update includes a Kanban board for managing AI agents that can generate work plans and report progress on job requests, though reliability concerns remain as studies show AI code generators frequently introduce security vulnerabilities and bugs.

Google Launches Gemini 2.5 Flash: Efficiency-Focused AI Model with Reasoning Capabilities

Google has announced Gemini 2.5 Flash, a new AI model designed for efficiency while maintaining strong performance. The model offers dynamic computing controls allowing developers to adjust processing time based on query complexity, making it suitable for high-volume, cost-sensitive applications like customer service and document parsing while featuring self-checking reasoning capabilities.

Google Sets Premium Pricing for Gemini 2.5 Pro Amid Rising Costs for Top AI Models

Google has announced pricing for its Gemini 2.5 Pro model at $1.25 per million input tokens and $10 per million output tokens, making it Google's most expensive AI offering to date. This pricing, while higher than some competitors like OpenAI's o3-mini, reflects an industry-wide trend of increasing costs for flagship AI models, potentially driven by high demand and significant computing expenses.

Microsoft Enhances Copilot with Web Browsing, Action Capabilities, and Improved Memory

Microsoft has significantly upgraded its Copilot AI assistant with new capabilities including performing actions on websites, remembering user preferences, analyzing real-time video, and creating podcast-like content summaries. These features, similar to those offered by competitors like OpenAI's Operator and Google's Gemini, allow Copilot to complete tasks such as booking tickets and reservations across partner websites.

Cognition Introduces Affordable Pay-as-you-go Plan for Devin AI Coding Assistant

Cognition has launched a new entry-level pricing plan for its autonomous coding tool Devin, starting at $20 with a pay-as-you-go structure after initial credits are used. The company claims Devin 2.0 is significantly improved from its December release, now featuring project planning capabilities and better documentation features, though independent evaluations suggest it still struggles with complex coding tasks.

OpenAI Faces Capacity Issues as ChatGPT Usage Surges to 500 Million Weekly Users

OpenAI CEO Sam Altman announced that unexpected demand for ChatGPT's new image generation tool has created significant capacity challenges, resulting in delayed product releases and service issues. ChatGPT has now reached 500 million weekly users and 20 million paying subscribers, with a million new users joining in a single hour as the company struggles to scale infrastructure fast enough.

Amazon Launches Nova Act: An AI Agent Capable of Browser Control

Amazon has unveiled Nova Act, a general-purpose AI agent that can independently control web browsers to perform simple tasks like making reservations or ordering food. The technology, developed by Amazon's San Francisco-based AGI lab, will power features in the upcoming Alexa+ and is being released alongside a developer SDK for building agent prototypes.

Browser Use Raises $17M to Help AI Agents Navigate Websites More Effectively

Browser Use, a startup making websites more accessible to AI agents, has secured $17 million in seed funding led by Felicis. The company's technology breaks down website elements into a text-like format that AI agents can better understand, enabling more reliable automation of web-based tasks without relying on vision-based systems that frequently break.

1X Announces In-Home Tests of Neo Gamma Humanoid Robots Starting in 2025

Norwegian robotics startup 1X plans to begin testing its humanoid robot, Neo Gamma, in several hundred to thousand homes by the end of 2025. These initial tests will rely heavily on teleoperators—humans remotely controlling the robots—to gather data that will help train AI models for future autonomous capabilities.