Google Unveils Gemini Omni Video Generation and Gemini 3.5 Flash for Agentic AI
Google announced Gemini Omni, a multimodal model that generates and edits video through natural language, and Gemini 3.5 Flash, optimized for complex agent workflows.
Google announced Gemini Omni, a multimodal model that generates and edits video through natural language, and Gemini 3.5 Flash, optimized for complex agent workflows.
Google unveiled Gemini Omni for video generation, Gemini 3.5 Flash for agents and coding, and autonomous Search agents that monitor the web 24/7.
Google unveiled Gemini 3.5 Flash at I/O 2026 as an agent-first model claiming frontier intelligence at sub-flagship latency, with Gemini Omni adding physics-aware video generation.
At I/O 2026, Google DeepMind unveiled Gemini Omni, a multimodal family that generates video from combined image, audio, and text inputs, signaling a shift from generative to simulational AI.
Google unveiled Gemini 3.5 Flash as the new default model, introduced Gemini Omni for text-to-video generation, and previewed always-on agents powered by Gemini Spark.
Gemini Omni Flash enables users to generate and edit videos through natural language prompts, combining multimodal inputs with real-world knowledge.
Google debuts AI agents, a redesigned search box, and Gemini 3.5 Flash integration in Search, targeting a billion-user base
Google unveiled Gemini Omni, a multimodal model capable of video creation, alongside Gemini 3.5 Flash and expanded agent capabilities across Search, Gmail, and shopping.
A new analysis shows that large language models excel at language tasks but struggle with seemingly simple visual reasoning—like reading analog clocks.