What is Gemini Omni and how does it differ from earlier Gemini models?

Gemini Omni is Google's new multimodal model that generates video from any combination of text, images, audio, and video inputs. It grounds video generation in Gemini's real-world knowledge and supports conversational editing. Earlier Gemini models (3.5, 4.0) were primarily text-focused; Omni adds native video synthesis as its flagship capability.

When will Gemini 3.5 Pro be available?

According to the Google AI Blog, Gemini 3.5 Pro is in internal testing with a planned rollout targeted for June 2026. Only Gemini 3.5 Flash is currently generally available.

What are Search agents and how do they work?

Search agents are autonomous AI agents running within Google Search that reason across the web 24/7 to monitor topics you care about. They send proactive updates with relevant information from blogs, news, social posts, and real-time data (finance, shopping, sports) at moments when new information is most relevant.

Which platforms is Gemini Omni Flash available on?

Gemini Omni Flash is rolling out to Google AI Plus, Pro, and Ultra subscribers via the Gemini app and Google Flow. It is also available at no cost to YouTube Shorts and YouTube Create App users.

Google I/O 2026: Gemini Omni, Multimodal Search, and AI Agents debut

BLUF

At Google I/O 2026, Google unveiled three major releases: Gemini Omni Flash, a multimodal model that generates video from text, images, audio, and video inputs; Gemini 3.5 Flash, now generally available for agentic reasoning and coding tasks; and Search agents, autonomous AI assistants that monitor the web 24/7 to deliver proactive, real-time updates on topics users care about. These announcements expand Gemini’s reach across consumer platforms (YouTube, Search, the Gemini app) and enterprise tools (Android Studio, Gemini Enterprise).

Gemini Omni: Video Generation From Multimodal Input

According to the Google AI Blog, Gemini Omni Flash is the first model in Google’s Omni family, capable of generating high-quality video from combinations of images, audio, video, and text as input. The model’s generation is grounded in Gemini’s real-world knowledge, and users can edit outputs through natural-language conversation—a conversational editing loop that reduces friction compared to traditional post-production interfaces.

Gemini Omni Flash is rolling out immediately to Google AI Plus, Pro, and Ultra subscribers via the Gemini app and Google Flow. Notably, Google is offering the model at no cost to YouTube Shorts and YouTube Create App users, signaling an intent to drive adoption among content creators and short-form video platforms. This dual-channel distribution—premium subscribers plus free access on YouTube—positions Omni as a competitor to text-to-video models like Runway and Pika, while leveraging YouTube’s 2+ billion logged-in users as a distribution base.

Gemini 3.5 Flash: General Availability for Agents and Long-Horizon Tasks

Gemini 3.5 Flash is now generally available across multiple distribution channels. According to the Google AI Blog, the model combines “frontier intelligence with action,” excelling at complex long-horizon tasks and coding. It is available via the Gemini API, Google AI Studio, Android Studio, Gemini Enterprise Agent Platform, and Gemini Enterprise. The model is also accessible to all users globally within Search’s AI Mode and the Gemini app.

Google is also developing Gemini 3.5 Pro, which is currently in internal testing with a targeted rollout in June 2026. The Pro variant is expected to deliver higher-capability agentic reasoning than the Flash tier, following Google’s established model-family stratification (Flash for speed and efficiency, Pro for capability).

Search Agents: Autonomous Web Monitoring at Scale

Google’s most significant announcement is the rollout of information agents within Google Search. According to the Google AI Blog, these agents operate autonomously in the background 24/7, reasoning across the web—including blogs, news sites, social posts, and real-time data sources (finance, shopping, sports)—to monitor topics relevant to individual users.

Rather than requiring explicit search queries, information agents send proactive, comprehensive updates at moments when new information is available, paired with links for further exploration. This represents a shift from reactive search (user-initiated query) to autonomous monitoring (agent-initiated delivery). The agents are designed to handle multiple, customizable instances per user, enabling task-specific monitoring (job alerts, competitor news, fantasy sports scores) without manual query repetition.

Why This Matters

These three releases address distinct adoption bottlenecks in AI’s consumer and enterprise migration.

For content creators, Gemini Omni Flash’s free access on YouTube Shorts and YouTube Create App removes friction to video synthesis—the primary barrier is now capability, not cost. If the model’s video quality matches or exceeds Runway’s fidelity, YouTube’s 2+ billion users represent a significantly larger addressable market than Runway’s direct customer base.

For enterprise development teams, Gemini 3.5 Flash’s general availability across Android Studio, Google AI Studio, and the Gemini API signals maturity in agentic coding use cases. Teams currently piloting agent-based code generation with Claude or GPT-4 can now evaluate Gemini 3.5 Flash’s long-horizon reasoning without separate procurement.

For product teams in finance, e-commerce, and real estate, Search agents represent a retention mechanism worth piloting. Autonomous monitoring creates habitual check-in behavior—users do not need to remember to search; agents push updates proactively. This shifts the engagement model from sporadic search traffic to scheduled, anticipated notifications, increasing daily active user metrics and average session length.

The timing is significant: information agents formalize what Perplexity, specialized vertical search engines, and monitoring platforms have only partially automated. Google’s integration directly into Search gives agents distribution at zero acquisition cost, a structural advantage unavailable to standalone startups.