open-weight models AI News & Updates

DeepSeek Releases V4 Models With 1.6 Trillion Parameters, Approaching Frontier Performance at Lower Cost

Chinese AI lab DeepSeek has released preview versions of its V4 large language models, including V4 Pro with 1.6 trillion parameters, making it the largest open-weight model available. The models reportedly close the gap with leading frontier models on reasoning benchmarks while offering significantly lower pricing, though they trail state-of-the-art models by approximately 3-6 months in knowledge tests. The release comes amid U.S. accusations that China is stealing American AI intellectual property through proxy accounts.

Mistral Releases Mistral 3 Family: Open-Weight Frontier Model and Nine Efficient Small Models

French AI startup Mistral launched its Mistral 3 family, including Mistral Large 3, an open-weight frontier model with multimodal and multilingual capabilities, alongside nine smaller Ministral 3 models designed for edge deployment. The company emphasizes that these smaller models can run on single GPUs and match or outperform closed-source models when fine-tuned for specific enterprise use cases. Mistral is positioning itself as a more accessible and cost-effective alternative to competitors like OpenAI and Anthropic, with growing focus on physical AI applications in robotics and vehicles.