February 19, 2025 News

AI Model Benchmarking Faces Criticism as xAI Releases Grok 3

The AI industry is grappling with the limitations of current benchmarking methods as xAI releases its Grok 3 model, which reportedly outperforms competitors in mathematics and programming tests. Experts are questioning the reliability and relevance of existing benchmarks, with calls for better testing methodologies that align with real-world utility rather than esoteric knowledge.

Mistral's Le Chat Reaches 1 Million Downloads in Two Weeks

Mistral's AI assistant, Le Chat, has reached one million downloads in just 14 days, becoming the top free app on the iOS App Store in France. This success places it alongside other rapidly adopted AI apps, including ChatGPT and DeepSeek, while facing competition from established tech giants like Google and Microsoft.