Video Generation AI News & Updates

Microsoft Launches Three Multimodal Foundation Models to Compete in AI Market

Microsoft AI announced three new foundational models: MAI-Transcribe-1 for speech-to-text across 25 languages, MAI-Voice-1 for audio generation, and MAI-Image-2 for video generation. Developed by Microsoft's MAI Superintelligence team led by Mustafa Suleyman, these models are positioned as cost-competitive alternatives to offerings from Google and OpenAI, with pricing starting at $0.36 per hour for transcription. The release represents Microsoft's effort to build its own AI model stack while maintaining its partnership with OpenAI.

OpenAI Shuts Down Sora Video Generation Platform After Six Months

OpenAI announced it is shutting down its Sora video generation app and related models just six months after launch, signaling a strategic shift toward enterprise and productivity tools ahead of a potential IPO. The decision reflects OpenAI's recognition that consumer-facing video products lack the same market fit as ChatGPT, while ByteDance's reported delay of Seedance 2.0 due to IP concerns suggests broader challenges in the AI video generation space. Industry observers view this as a reality check for claims that AI video tools would rapidly replace traditional content creation.

Runway Secures $315M Series E at $5.3B Valuation to Develop Advanced World Models for AGI

AI video startup Runway raised $315 million at a $5.3 billion valuation to develop next-generation world models, AI systems that create internal representations of environments to predict future events. The company, which recently released its Gen 4.5 video generation model that outperformed Google and OpenAI offerings, plans to expand world model capabilities beyond media into medicine, climate, energy, and robotics. This strategic shift positions Runway among competitors like Fei-Fei Li's World Labs and Google DeepMind in the race to build world models viewed as essential for surpassing large language model limitations.

Runway Launches GWM-1 World Model with Physics Simulation and Native Audio Generation

Runway has released GWM-1, its first world model capable of frame-by-frame prediction with understanding of physics, geometry, and lighting for creating interactive simulations. The model includes specialized variants for robotics training (GWM-Robotics), avatar simulation (GWM-Avatars), and interactive world generation (GWM-Worlds). Additionally, Runway updated its Gen 4.5 video model to include native audio and one-minute multi-shot generation with character consistency.

OpenAI's Sora Video Generation App Achieves Massive Launch Success, Rivaling ChatGPT Adoption

OpenAI's video-generating app Sora recorded approximately 627,000 iOS downloads in its first week in the U.S. and Canada, nearly matching ChatGPT's first-week performance of 606,000 U.S. downloads. Despite being invite-only, Sora reached the No. 1 position on the U.S. App Store and has driven widespread creation of AI-generated videos, including controversial deepfakes of deceased individuals.

OpenAI Launches GPT-5 Pro, Sora 2 Video Model, and Cost-Efficient Voice API at Dev Day

OpenAI announced major API updates at its Dev Day, introducing GPT-5 Pro for high-accuracy reasoning tasks, Sora 2 for advanced video generation with synchronized audio, and a cheaper voice model called gpt-realtime mini. These releases target developers across finance, legal, healthcare, and creative industries, aiming to expand OpenAI's developer ecosystem with more powerful and cost-effective tools.

OpenAI Launches Sora Social App with Controversial Deepfake 'Cameo' Feature

OpenAI has released Sora, a TikTok-like social media app with advanced video generation capabilities that allow users to create realistic deepfakes through a "cameo" feature using biometric data. The app is already filled with deepfakes of CEO Sam Altman and copyrighted characters, raising significant concerns about disinformation, copyright violations, and the democratization of deepfake technology. Despite OpenAI's emphasis on safety features, users are already finding ways to circumvent guardrails, and the realistic quality of generated videos poses serious risks for manipulation and abuse.

OpenAI Launches Sora 2 Video Generator with TikTok-Style Social Platform

OpenAI released Sora 2, an advanced audio and video generation model with improved physics simulation, alongside a new social app called Sora. The platform features a "cameos" function allowing users to insert their own likeness into AI-generated videos and share them on a TikTok-style feed. The app raises significant safety concerns regarding non-consensual content and misuse of personal likenesses.

AI Video Companies Luma and Runway Target Robotics and Autonomous Vehicles for Revenue Expansion

AI video-generating startups Luma and Runway are exploring partnerships with robotics and self-driving car companies as potential new revenue streams beyond their current focus on movie studios. Luma is particularly positioned for this expansion given their announced goal of building 3D AI world models that can understand and interact with physical environments.

Google Deploys Veo 3 Video Generation AI Model to Global Gemini Users

Google has rolled out its Veo 3 video generation model to Gemini users in over 159 countries, allowing paid subscribers to create 8-second videos from text prompts. The service is limited to 3 videos per day for AI Pro plan subscribers, with image-to-video capabilities planned for future release.