Multimodal AI News & Updates

Mistral Releases Cost-Efficient AI Model Rivaling Industry Leaders

French AI startup Mistral has launched Mistral Medium 3, a new AI model focused on efficiency without compromising performance. The model reportedly performs at 90% of Anthropic's Claude Sonnet 3.7 at lower cost, excels at coding and STEM tasks, and can be deployed on various cloud platforms or self-hosted with minimal hardware requirements.

Google Launches Gemini 2.5 Pro with Advanced Reasoning Capabilities

Google has unveiled Gemini 2.5, a new family of AI models with built-in reasoning capabilities that pauses to "think" before answering questions. The flagship model, Gemini 2.5 Pro Experimental, outperforms competing AI models on several benchmarks including code editing and supports a 1 million token context window (expanding to 2 million soon).

OpenAI Enhances Voice and Transcription AI Models with Advanced Control Features

OpenAI has released new AI models for transcription and voice generation that offer improved accuracy and control over previous versions. The new text-to-speech model allows developers to steer voice characteristics using natural language, while the transcription models reduce hallucinations but show significant error rates for certain languages.