AI Accuracy AI News & Updates

OpenAI's Reasoning Models Show Increased Hallucination Rates

OpenAI's new reasoning models, o3 and o4-mini, are exhibiting higher hallucination rates than their predecessors, with o3 hallucinating 33% of the time on OpenAI's PersonQA benchmark and o4-mini reaching 48%. Researchers are puzzled by this increase as scaling up reasoning models appears to exacerbate hallucination issues, potentially undermining their utility despite improvements in other areas like coding and math.