Physical World Understanding AI News & Updates

Google Plans to Combine Gemini Language Models with Veo Video Generation Capabilities

Google DeepMind CEO Demis Hassabis announced plans to eventually merge their Gemini AI models with Veo video-generating models to create more capable multimodal systems with better understanding of the physical world. This aligns with the broader industry trend toward "omni" models that can understand and generate multiple forms of media, with Hassabis noting that Veo's physical world understanding comes largely from training on YouTube videos.