coding benchmarks AI News & Updates

Research Breakthrough

The K Prize, a new AI coding challenge designed to test models on real-world programming problems without benchmark contamination, announced its first winner who scored only 7.5% correct answers. This stands in stark con...

Model Evaluation coding benchmarks SWE-Bench AI programming benchmark contamination

-0.08% +1 days

-0.06% +1 days

Full analysis

coding benchmarks AI News & Updates

K Prize AI Coding Challenge Reveals Stark Reality: Winner Scores Only 7.5% on Contamination-Free Programming Test