How does Gemini 3.5 Flash compare to Gemini 3.1 Pro on coding tasks?

According to Google AI Blog, Gemini 3.5 Flash outperforms Gemini 3.1 Pro on coding and agentic benchmarks including Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), and MCP Atlas (83.6%).

When will Gemini 3.5 Pro be available?

Google AI Blog reports Gemini 3.5 Pro is being used internally and will roll out next month (expected June 2026).

What can Gemini Omni generate?

Gemini Omni starts with video generation and will eventually support any output type from any input, with physics-aware rendering for gravity, kinetic energy, and fluid dynamics.

How is Gemini Omni content watermarked?

Videos include Google's imperceptible SynthID digital watermark, verifiable through the Gemini app, Gemini in Chrome, and Search.

Google launches Gemini 3.5 Flash with agentic benchmarks outpacing Pro, rolls Pro in June

Google shifted its model lineup toward agentic efficiency at I/O 2026 on May 20, launching Gemini 3.5 Flash as a frontier-intelligence-at-speed tier and previewing a physics-aware video generator called Gemini Omni. The Flash release compresses what Google positions as flagship-class reasoning into latency profiles previously associated with smaller models, while deferring the Pro-tier counterpart to next month.

Gemini 3.5 Flash: agentic benchmarks and cost positioning

According to Google AI Blog, Gemini 3.5 Flash delivers performance gains over Gemini 3.1 Pro on challenging benchmarks: Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), and MCP Atlas (83.6%). The model is immediately available via Google Antigravity, the Gemini API, Google AI Studio, and Android Studio.

Google frames Gemini 3.5 Flash as optimized for long-horizon agentic tasks—planning, building, and iterating across codebases, application development, and document preparation. The company claims that work previously requiring days for developers or weeks for auditors can now be completed in a fraction of the time, often at less than half the cost of other frontier models. This claim targets engineering teams evaluating inference infrastructure for autonomous-coding workflows; the latency-cost ratio directly influences ROI calculations for agentic pipelines in Q3 2026 vendor selection.

The model also extends Gemini 3’s multimodal foundation with richer, more interactive web UI and graphics generation—a signal that Google is deepening its play in frontend-automation agents.

Gemini Omni: video generation with physics understanding

Gemini Omni introduces physics-aware generative video, combining Gemini’s knowledge base (history, science, culture) with intuitive understanding of gravity, kinetic energy, and fluid dynamics. According to Google AI Blog, this bridges photorealism and storytelling, addressing a known weakness in prior video models—unrealistic object behavior and force interaction.

Videos include Google’s imperceptible SynthID digital watermark, verifiable through Gemini, Gemini in Chrome, and Search. This watermarking-at-generation addresses authenticity concerns for creative and professional workflows, though independent validation of watermark robustness against removal attacks remains pending.

Over time, Omni will support any output type from any input type; the video-first release suggests Google is anchoring the product in a high-signal use case before expanding the generative scope.

Gemini 3.5 Pro and agent-first platform expansion

Google is hard at work on Gemini 3.5 Pro, already in internal use, with a rollout planned for next month (June 2026). The company’s announcement of 100 items across I/O signals a broader push beyond model releases—including agent frameworks, search integration, and tool-building surfaces across Google AI Studio and Android Studio.

Why This Matters

For engineering leads choosing between Claude and Gemini for autonomous-coding workflows in Q3 2026, the 3.5 Flash latency-cost profile and agentic benchmark lead shifts the ROI baseline. Teams building long-horizon planning agents will test whether the Terminal-Bench and MCP Atlas gains hold under production codebases and whether the sub-half-cost positioning (a qualified claim) survives comparison to Claude’s batch-pricing models.

For creative professionals and content studios, Omni’s physics understanding and watermarking introduce a credibility layer absent in prior video generators—but only if watermark robustness is independently validated. The June Pro rollout will clarify whether Google is positioning Pro as a planning-and-reasoning tier (competing directly with GPT-5 and Claude Opus) or as a specialized frontier model for specific domains.