coding benchmarks AI News & Updates

K Prize AI Coding Challenge Reveals Stark Reality: Winner Scores Only 7.5% on Contamination-Free Programming Test

The K Prize, a new AI coding challenge designed to test models on real-world programming problems without benchmark contamination, announced its first winner who scored only 7.5% correct answers. This stands in stark contrast to existing SWE-Bench scores of up to 75%, suggesting either widespread benchmark contamination or that current AI coding capabilities are far more limited than previously believed.