professional tasks AI News & Updates

OpenAI's GPT-5 Shows Near-Human Performance Across Professional Tasks in New Economic Benchmark

OpenAI released GDPval, a new benchmark testing AI models against human professionals across 44 occupations in nine major industries. GPT-5 performed at or above human expert level 40.6% of the time, while Anthropic's Claude Opus 4.1 achieved 49%, representing significant progress from GPT-4o's 13.7% score just 15 months prior.