Large Language Models AI News & Updates

Ai2 Claims New Open-Source Model Outperforms DeepSeek and GPT-4o

Nonprofit AI research institute Ai2 has released Tulu 3 405B, an open-source AI model containing 405 billion parameters that reportedly outperforms DeepSeek V3 and OpenAI's GPT-4o on certain benchmarks. The model, which required 256 GPUs to train, utilizes reinforcement learning with verifiable rewards (RLVR) and demonstrates superior performance on specialized knowledge questions and grade-school math problems.