Mathematical Reasoning AI News & Updates

OpenAI Criticized for Overstating GPT-5 Mathematical Problem-Solving Capabilities

OpenAI researchers initially claimed GPT-5 solved 10 previously unsolved Erdős mathematical problems, prompting criticism from AI leaders including Meta's Yann LeCun and Google DeepMind's Demis Hassabis. Mathematician Thomas Bloom clarified that GPT-5 merely found existing solutions in the literature that were not catalogued on his website, rather than solving truly unsolved problems. OpenAI later acknowledged the accomplishment was limited to literature search rather than novel mathematical problem-solving.

OpenAI and Google AI Models Achieve Gold Medal Performance in International Math Olympiad

AI models from OpenAI and Google DeepMind both achieved gold medal scores in the 2025 International Math Olympiad, demonstrating significant advances in AI reasoning capabilities. The achievement marks a breakthrough in AI systems' ability to solve complex mathematical problems in natural language without human translation assistance. However, the companies are engaged in disputes over proper evaluation protocols and announcement timing.

DeepSeek Updates Prover V2 for Advanced Mathematical Reasoning

Chinese AI lab DeepSeek has released an upgraded version of its mathematics-focused AI model Prover V2, built on their V3 model with 671 billion parameters using a mixture-of-experts architecture. The company, which previously made Prover available for formal theorem proving and mathematical reasoning, is reportedly considering raising outside funding for the first time while continuing to update its model lineup.

DeepMind's AlphaGeometry2 Surpasses IMO Gold Medalists in Mathematical Problem Solving

Google DeepMind has developed AlphaGeometry2, an AI system that can solve 84% of International Mathematical Olympiad geometry problems from the past 25 years, outperforming the average gold medalist. The system combines a Gemini language model with a symbolic reasoning engine, demonstrating that hybrid approaches combining neural networks with rule-based systems may be more effective for complex mathematical reasoning than either approach alone.