Reasoning AI AI News & Updates

Google Launches Gemini 2.5 Pro with Advanced Reasoning Capabilities

Google has unveiled Gemini 2.5, a new family of AI models with built-in reasoning capabilities that pauses to "think" before answering questions. The flagship model, Gemini 2.5 Pro Experimental, outperforms competing AI models on several benchmarks including code editing and supports a 1 million token context window (expanding to 2 million soon).

New ARC-AGI-2 Test Reveals Significant Gap Between AI and Human Intelligence

The Arc Prize Foundation has created a challenging new test called ARC-AGI-2 to measure AI intelligence, designed to prevent models from relying on brute computing power. Current leading AI models, including reasoning-focused systems like OpenAI's o1-pro, score only around 1% on the test compared to a 60% average for human panels, highlighting significant limitations in AI's general problem-solving capabilities.

OpenAI's Noam Brown Claims Reasoning AI Models Could Have Existed Decades Earlier

OpenAI's AI reasoning research lead Noam Brown suggested at Nvidia's GTC conference that certain reasoning AI models could have been developed 20 years earlier if researchers had used the right approach. Brown, who previously worked on game-playing AI including Pluribus poker AI and helped create OpenAI's reasoning model o1, also addressed the challenges academia faces in competing with AI labs and identified AI benchmarking as an area where academia could make significant contributions despite compute limitations.