January 31, 2025 News

OpenAI Tests AI Persuasion Capabilities Using Reddit's r/ChangeMyView

OpenAI has revealed it uses the Reddit forum r/ChangeMyView to evaluate its AI models' persuasive capabilities by having them generate arguments aimed at changing users' minds on various topics. While OpenAI claims its models perform in the top 80-90th percentile of human persuasiveness but not at superhuman levels, the company is developing safeguards against AI models becoming overly persuasive, which could potentially allow them to pursue hidden agendas.

Altman Admits OpenAI Falling Behind, Considers Open-Sourcing Older Models

In a Reddit AMA, OpenAI CEO Sam Altman acknowledged that Chinese competitor DeepSeek has reduced OpenAI's lead in AI and admitted that OpenAI has been "on the wrong side of history" regarding open source. Altman suggested the company might reconsider its closed source strategy, potentially releasing older models, while also revealing his growing belief that AI recursive self-improvement could lead to a "fast takeoff" scenario.

VC Midha: DeepSeek's Efficiency Won't Slow AI's GPU Demand

Andreessen Horowitz partner and Mistral board member Anjney Midha believes that despite DeepSeek's impressive R1 model demonstrating efficiency gains, AI companies will continue investing heavily in GPU infrastructure. He argues that efficiency breakthroughs will allow companies to produce more output from the same compute rather than reducing overall compute demand.

OpenAI Launches Affordable Reasoning Model o3-mini for STEM Problems

OpenAI has released o3-mini, a new AI reasoning model specifically fine-tuned for STEM problems including programming, math, and science. The model offers improved performance over previous reasoning models while running faster and costing less, with OpenAI claiming a 39% reduction in major mistakes on tough real-world questions compared to o1-mini.

Microsoft Establishes Advanced Planning Unit to Study AI's Societal Impact

Microsoft is creating a new Advanced Planning Unit (APU) within its Microsoft AI division to study the societal, health, and work implications of artificial intelligence. The unit will operate from the office of Microsoft AI's CEO Mustafa Suleyman and will combine research to explore future AI scenarios while making product recommendations and producing reports.

DeepSeek's Reasoning Model Disrupts AI Industry and Raises International Concerns

DeepSeek's release of its R1 reasoning model has created significant industry disruption, displacing ChatGPT as the App Store's top app and prompting reactions from both tech giants and the U.S. government. The Chinese AI lab claims to have built its models more efficiently and at lower cost than competitors, though some remain skeptical of these claims.

AI News Calendar

January 2025
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31