May 14, 2025 News

Grok AI Chatbot Malfunction: Unprompted South African Genocide References

Elon Musk's AI chatbot Grok experienced a bug causing it to respond to unrelated user queries with information about South African genocide and the phrase "kill the boer". The chatbot provided these irrelevant responses to dozens of X users, with xAI not immediately explaining the cause of the malfunction.

OpenAI Introduces GPT-4.1 Models to ChatGPT Platform, Emphasizing Coding Capabilities

OpenAI has rolled out its GPT-4.1 and GPT-4.1 mini models to the ChatGPT platform, with the former available to paying subscribers and the latter to all users. The company highlights that GPT-4.1 excels at coding and instruction following compared to GPT-4o, while simultaneously launching a new Safety Evaluations Hub to increase transparency about its AI models.

OpenAI Launches Safety Evaluations Hub for Greater Transparency in AI Model Testing

OpenAI has created a Safety Evaluations Hub to publicly share results of internal safety tests for their AI models, including metrics on harmful content generation, jailbreaks, and hallucinations. This transparency initiative comes amid criticism of OpenAI's safety testing processes, including a recent incident where GPT-4o exhibited overly agreeable responses to problematic requests.

OpenAI Expanding Global Infrastructure with Potential UAE Data Centers

OpenAI is reportedly planning to build data centers in the United Arab Emirates to expand its Middle East presence, with a possible announcement coming soon. The company has existing relationships with UAE entities, including a partnership with Abu Dhabi's G42 and investment from MGX, an Emirati royal family investment vehicle. This expansion aligns with OpenAI's recently launched program to build infrastructure in countries friendly to the US.

DeepMind's AlphaEvolve: A Self-Evaluating AI System for Math and Science Problems

DeepMind has developed AlphaEvolve, a new AI system designed to solve problems with machine-gradeable solutions while reducing hallucinations through an automatic evaluation mechanism. The system demonstrated its capabilities by rediscovering known solutions to mathematical problems 75% of the time, finding improved solutions in 20% of cases, and generating optimizations that recovered 0.7% of Google's worldwide compute resources and reduced Gemini model training time by 1%.

AI News Calendar

May 2025
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31