Compare the current Gemini Omni workflows before you generate: Veo 3.1 for text-to-video and image-to-video, Gemini Omni Flash for fast video drafts, and GPT Image 2 for image generation and editing.
Open each model page for supported inputs, settings, pricing logic, and workflow guidance.
Text-to-video and image-to-video with optional audio, 4/6/8 second clips, and 720p/1080p/4K output where supported.
Fast Gemini Omni video workflow for prompt-led and image-guided drafts using the current VEO 3.1 official integration.
Side-by-side specs for the current Gemini Omni model pages. Actual credit cost depends on duration, resolution, quality, and audio.
Match the work you want to finish to the model that supports that input and output.
Create video from a prompt
Use Veo 3.1 when you want a text-led video clip with optional audio, 4/6/8 second duration, and landscape or portrait output.
Create a fast Gemini Omni video draft
Use Gemini Omni Flash for fast prompt-led or image-guided video drafts before spending more credits on final output.
Animate an image reference
Upload up to 3 image references when the video should follow a product, character, style frame, or first/last-frame direction.
Generate or edit still images
Use GPT Image 2 for thumbnails, product concepts, style frames, and reference images before video generation.
Common questions about choosing between Gemini Omni models.