Open-Source AI AI News & Updates

Mistral AI: France's AI Champion Scales Globally with Models and Le Chat App

Mistral AI, a French startup valued at $6 billion, has positioned itself as Europe's answer to OpenAI with its suite of AI models and Le Chat assistant, which recently reached 1 million mobile downloads. Founded in 2023 by former DeepMind and Meta researchers, the company has raised approximately $1.04 billion in funding and forged strategic partnerships with Microsoft, IBM, and various government agencies, while maintaining its commitment to open-source AI development.

Ai2 Releases High-Performance Small Language Model Under Open License

Nonprofit AI research institute Ai2 has released Olmo 2 1B, a 1-billion-parameter AI model that outperforms similarly-sized models from Google, Meta, and Alibaba on several benchmarks. The model is available under the permissive Apache 2.0 license with complete transparency regarding code and training data, making it accessible for developers working with limited computing resources.

JetBrains Releases Open Source AI Coding Model with Technical Limitations

JetBrains has released Mellum, an open AI model specialized for code completion, under the Apache 2.0 license. Trained on 4 trillion tokens and containing 4 billion parameters, the model requires fine-tuning before use and comes with explicit warnings about potential biases and security vulnerabilities in its generated code.

Meta's Llama AI Models Reach 1.2 Billion Downloads

Meta announced that its Llama family of AI models has reached 1.2 billion downloads, up from 1 billion in mid-March. The company also revealed that thousands of developers are contributing to the ecosystem, creating tens of thousands of derivative models, while Meta AI, the company's Llama-powered assistant, has reached approximately one billion users.

Meta's Llama Models Reach 1 Billion Downloads as Company Pursues AI Leadership

Meta CEO Mark Zuckerberg announced that the company's Llama AI model family has reached 1 billion downloads, representing a 53% increase over a three-month period. Despite facing copyright lawsuits and regulatory challenges in Europe, Meta plans to invest up to $80 billion in AI this year and is preparing to launch new reasoning models and agentic features.

Sesame Releases Open Source Voice AI Model with Few Safety Restrictions

AI company Sesame has open-sourced CSM-1B, the base model behind its realistic virtual assistant Maya, under a permissive Apache 2.0 license allowing commercial use. The 1 billion parameter model generates audio from text and audio inputs using residual vector quantization technology, but lacks meaningful safeguards against voice cloning or misuse, relying instead on an honor system that urges developers to avoid harmful applications.

DeepSeek Announces Open Sourcing of Production-Tested AI Code Repositories

Chinese AI lab DeepSeek has announced plans to open source portions of its online services' code as part of an upcoming "open source week" event. The company will release five code repositories that have been thoroughly documented and tested in production, continuing its practice of making AI resources openly available under permissive licenses.

Elon Musk Leads $97.4 Billion Bid to Purchase OpenAI, Promising Return to Open Source Roots

Elon Musk, along with investors including his AI company xAI, has submitted an unsolicited $97.4 billion bid to purchase OpenAI. Musk, who co-founded OpenAI in 2015 and is currently in legal disputes with the company, claims the acquisition would return OpenAI to its original mission as an open-source, safety-focused organization, contrasting this with his approach at xAI where he claims to have made the Grok model open source.

Stanford Researchers Create Open-Source Reasoning Model Comparable to OpenAI's o1 for Under $50

Researchers from Stanford and University of Washington have created an open-source AI reasoning model called s1 that rivals commercial models like OpenAI's o1 and DeepSeek's R1 in math and coding abilities. The model was developed for less than $50 in cloud computing costs by distilling capabilities from Google's Gemini 2.0 Flash Thinking Experimental model, raising questions about the sustainability of AI companies' business models.

VC Midha: DeepSeek's Efficiency Won't Slow AI's GPU Demand

Andreessen Horowitz partner and Mistral board member Anjney Midha believes that despite DeepSeek's impressive R1 model demonstrating efficiency gains, AI companies will continue investing heavily in GPU infrastructure. He argues that efficiency breakthroughs will allow companies to produce more output from the same compute rather than reducing overall compute demand.