Large Language Models AI News & Updates

Anthropic Releases Claude Sonnet 4.5 with Advanced Autonomous Coding Capabilities

Anthropic launched Claude Sonnet 4.5, a new AI model claiming state-of-the-art coding performance that can build production-ready applications autonomously. The model has demonstrated the ability to code independently for up to 30 hours, performing complex tasks like setting up databases, purchasing domains, and conducting security audits. Anthropic also claims improved AI alignment with lower rates of sycophancy and deception, along with better resistance to prompt injection attacks.

South Korea Invests $390 Million in Domestic AI Companies to Challenge OpenAI and Google

South Korea has launched a ₩530 billion ($390 million) sovereign AI initiative, funding five local companies to develop large-scale foundational models that can compete with global AI giants. The government will review progress every six months and narrow the field to two frontrunners, with companies like LG AI Research, SK Telecom, Naver Cloud, and Upstage developing Korean-language optimized models.

Hugging Face Co-founder Thomas Wolf to Discuss Open-Source AI Future at TechCrunch Disrupt 2025

Thomas Wolf, co-founder and chief science officer of Hugging Face, will speak at TechCrunch Disrupt 2025 about making AI research and models open and accessible. The session will focus on how open-source development, rather than closed labs and big tech budgets, can drive the next wave of AI breakthroughs. Wolf has been instrumental in launching key open-source AI tools like the Transformers library and the BigScience Workshop that produced the BLOOM language model.

OpenAI Research Identifies Evaluation Incentives as Key Driver of AI Hallucinations

OpenAI researchers have published a paper examining why large language models continue to hallucinate despite improvements, arguing that current evaluation methods incentivize confident guessing over admitting uncertainty. The study proposes reforming AI evaluation systems to penalize wrong answers and reward expressions of uncertainty, similar to standardized tests that discourage blind guessing. The researchers emphasize that widely-used accuracy-based evaluations need fundamental updates to address this persistent challenge.

Mistral AI Secures $14 Billion Valuation in Major European AI Investment Round

French AI startup Mistral AI is finalizing a €2 billion investment round at a $14 billion post-money valuation, making it one of Europe's most valuable tech startups. The OpenAI rival, founded by former DeepMind and Meta researchers, develops open source language models and has raised over €1 billion from prominent investors since its founding two years ago.

OpenAI Launches GPT-5 with Aggressive Pricing Strategy to Challenge Competitors

OpenAI released GPT-5, which CEO Sam Altman calls "the best model in the world," though it only marginally outperforms competitors like Anthropic and Google on benchmarks. The model is priced significantly lower than competitors, particularly undercutting Anthropic's Claude Opus 4.1, potentially sparking an industry-wide price war among AI model providers.

xAI Releases Grok 4 with Frontier-Level Performance Despite Recent Antisemitic Output Controversy

Elon Musk's xAI launched Grok 4, claiming PhD-level performance across all academic subjects and state-of-the-art scores on challenging AI benchmarks like ARC-AGI-2. The release comes alongside a $300/month premium subscription and follows recent controversy where Grok's automated account posted antisemitic comments, forcing xAI to modify its system prompts.

Apple Explores Third-Party AI Integration for Next-Generation Siri Amid Internal Development Delays

Apple is reportedly considering using AI models from OpenAI and Anthropic to power an updated version of Siri, rather than relying solely on in-house technology. The company has been forced to delay its AI-enabled Siri from 2025 to 2026 or later due to technical challenges, highlighting Apple's struggle to keep pace with competitors in the AI race.

OpenAI Revenue Doubles to $10B Annually as ChatGPT Reaches 500M Weekly Users

OpenAI has reached $10 billion in annual recurring revenue, nearly doubling from $5.5 billion last year, driven by its consumer and business AI products. The company now serves over 500 million weekly active users and 3 million paying business customers, while targeting $125 billion in revenue by 2029.

DeepSeek Releases Updated R1 Reasoning Model with MIT License on Hugging Face

Chinese AI startup DeepSeek has released an updated version of its R1 reasoning AI model on Hugging Face under a permissive MIT license, allowing commercial use. The updated model contains 685 billion parameters, making it a substantial upgrade that requires significant computational resources to run.