Google’s Gemini Omni Leak: The Next Leap in AI Video Generation
Google is poised to significantly advance its AI-powered video creation capabilities with the introduction of Gemini Omni. Recent leaks suggest this new model is an extension of the existing Veo tool, designed to provide higher-quality results and more intuitive editing workflows. With Google I/O on the horizon, the industry is anticipating a formal announcement that will detail how Omni integrates into the broader Gemini ecosystem.
What is Gemini Omni?
Gemini Omni is a new video generation model that builds upon the foundation of Google’s Veo. While Veo has been the primary driver for Google’s video AI, metadata shared by leaker Max Weinbach identifies the new model as “VEO_MODE_OMNI,” confirming that Omni is an extension of the Veo architecture rather than a completely separate entity.
Early access to the tool was briefly available to a small group of Gemini users, as reported via Reddit. Although Google has since revoked this early access—reverting those users to the standard Veo experience—the leaked interface revealed a prompt inviting users to “Create with Gemini Omni.”
The Evolution of Veo
The transition to Omni follows a period of rapid iteration for Veo. The current Veo model is sitting on version 3.1, with version 3 having previously introduced audio integration into the video creation process. Gemini Omni represents the next step in this evolution, focusing on believability and user control.

Key Features and Capabilities
Based on the leaked invitation and user results, Gemini Omni introduces several high-utility features that move beyond simple prompt-to-video generation:
- Direct Chat Editing: Users can edit their videos directly within the chat interface, allowing for iterative refinements without starting over.
- Video Remixing: The model allows users to remix existing videos to change styles, elements, or compositions.
- Template Integration: The inclusion of templates suggests a more streamlined workflow for users who need specific professional formats.
- Enhanced Realism: Early results demonstrate a mastery of text rendering within videos and realistic human movement, though some observers note the output can occasionally look “too perfect,” maintaining a slightly artificial sheen.
Access, Pricing, and Usage Limits
Google is expected to tie Gemini Omni to its subscription-based AI tiers. Access will likely be a primary feature of the paid AI Pro plan. To manage the high computational cost of video generation, Google is implementing usage limits determined by the user’s specific subscription tier.
This tiered approach ensures that power users have the capacity to create a higher volume of content, while users on more affordable tiers maintain basic access to the tool.
The Road to Google I/O and Android 17
The timing of the Omni leak is strategic, occurring just before Google I/O. This event is expected to serve as the official launchpad for Gemini Omni and other advancements in the Gemini suite. The name “Omni”—meaning “everywhere”—signals Google’s intent to make sophisticated video generation a ubiquitous part of its AI offerings.
Beyond AI models, Google is also expected to introduce Android 17 through “The Android Show,” further signaling a massive update to the company’s software and AI integration strategy.
Key Takeaways
- Model Identity: Gemini Omni is an extension of Google Veo (labeled internally as VEO_MODE_OMNI).
- Core Functions: It introduces video remixing, in-chat editing, and templates.
- Visual Quality: Significant improvements in text rendering and human movement realism.
- Availability: Expected to be part of the paid AI Pro plan with tiered usage limits.
- Timeline: A full announcement is anticipated during Google I/O.
Frequently Asked Questions
Is Gemini Omni different from Veo?
Yes and no. While it is a new model with enhanced capabilities, metadata indicates it is an extension of the Veo framework, effectively acting as an upgraded mode for the existing technology.
How do I get access to Gemini Omni?
Currently, access is limited. While some users had early access via a leak, the general public is expected to gain access through the AI Pro plan following the official announcement at Google I/O.
What makes Omni better than previous AI video tools?
Omni focuses on “believability,” particularly in how it handles text and realistic human motion, while adding functional tools like direct chat editing and remixing that were previously unavailable.