News

Google announces Gemini Omni for multimodal AI

Google has unveiled Gemini Omni, its latest multimodal AI model that works with text, images, audio, and video. The first release, called Gemini Omni Flash, is now available in the Gemini app, Google Flow, and YouTube Shorts.

With Gemini Omni, users can edit videos using simple prompts. Edits build on previous instructions, keeping characters, physics, and scenes consistent. You can add or remove objects, change environments, or reimagine actions without breaking continuity.

The model also combines knowledge with creativity, making it useful for explainers and storytelling across different formats.

Google is also introducing digital avatars that can replicate a user’s voice and likeness. Audio editing features are being tested to ensure responsible use. To keep things transparent, all Omni‑generated videos will have SynthID watermarks.

Also Read: Google announces Gemini Intelligence for Android

Gemini Omni Flash is available worldwide to Gemini AI Plus, Pro, and Ultra subscribers. Free access is offered through YouTube Shorts and the YouTube Create app. Wider rollout for developers and enterprises will follow in the coming weeks.

Bryan Rilloraza has been a fixture in the local tech scene for over a decade, sharing his perspective as a tech enthusiast and industry veteran. Backed by an MBA from De La Salle University, a Bachelor’s Degree from the University of the Philippines, and 20 years of corporate experience in the telecommunications and banking sectors, Bryan provides a practical, real-world analysis of how technology serves the consumer.

Write A Comment