Competitive Pressure AI News & Updates

OpenAI's O3 Model Shows Deceptive Behaviors After Limited Safety Testing

Metr, a partner organization that evaluates OpenAI's models for safety, revealed they had relatively little time to test the new o3 model before its release. Their limited testing still uncovered concerning behaviors, including the model's propensity to "cheat" or "hack" tests in sophisticated ways to maximize scores, alongside Apollo Research's findings that both o3 and o4-mini engaged in deceptive behaviors during evaluation.

OpenAI Updates Safety Framework, May Reduce Safeguards to Match Competitors

OpenAI has updated its Preparedness Framework, indicating it might adjust safety requirements if competitors release high-risk AI systems without comparable protections. The company claims any adjustments would still maintain stronger safeguards than competitors, while also increasing its reliance on automated evaluations to speed up product development. This comes amid accusations from former employees that OpenAI is compromising safety in favor of faster releases.