Which model should I use for a product video?

Start with Veo 3.1 or Gemini Omni Flash. Upload a product image as a reference, write a prompt describing the scene and camera movement, then preview credits before generating.

Can I upload reference images?

Yes. Supported video workflows can use image references, and Gemini Omni Flash supports up to 3 image URLs in the current flow.

Can I upload audio or video as input?

Not in the current Gemini Omni Flash workflow. It accepts a prompt and optional image URLs. Audio is an optional generated output.

Should I generate images before video?

Yes, when you need a clear visual direction. Create a style frame or product visual with GPT Image 2, then use it as a reference for video.

When should I use 4K?

Use higher resolution after the direction is confirmed. Draft with lower-cost settings first, then move to higher quality for final output.

How do I avoid wasting credits?

Choose the correct workflow, start with shorter or silent drafts, and review the credit estimate before every generation.

What inputs does each workflow need?

Video: text prompt and optional images. GPT Image 2: prompt and supported image references. Image editing: source image plus edit prompt where supported.

What is the maximum video duration?

The current VEO 3.1 official flow supports 4, 6, or 8 second clips.

Gemini Omni

Gemini Omni Video Generator: Create AI Video From Text or Images

Start with a prompt, add reference images when needed, choose duration, resolution, aspect ratio, and optional audio, then preview credits before generating.

Create AI Video

View Pricing

One shared credit pool covers video and image workflows. Preview cost before every generation.

Need video?

Use Veo 3.1 or Gemini Omni Flash for text-to-video and image-to-video.

Need visual control?

Upload reference images to guide product, character, style, or first/last-frame direction.

Need images?

Use GPT Image 2 to create style frames, thumbnails, product visuals, or references.

Preview credits before every generation

No surprises. See estimated cost before submitting and adjust settings before spending credits.

Cost shown before submit

Credit estimates update with duration, resolution, quality, and audio settings.

One credit pool

Video and image workflows draw from the same balance.

Failed generations are not charged

Credits are intended to be deducted only for completed generations.

Start lower, upgrade when ready

Use shorter or silent drafts for testing, then generate higher-quality finals.

Model Comparison Table

Pick the Gemini Omni workflow that matches your task

Choose based on output: video from text, video from images, image generation, image editing, or higher-resolution final output.

Video generation

For text-to-video and image-to-video with optional audio and 4/6/8 second settings.

Image workflows

For still image generation, image editing, and reference frames before video generation.

Complete Workflow

Define the output

Video clip, image-guided video, still image, or image edit.

Choose the model

Veo 3.1, Gemini Omni Flash, or GPT Image 2.

Add inputs

Write a prompt and upload supported reference images when useful.

Preview credits and generate

Check cost, choose settings, and submit the task.

Veo 3.1

Text-to-video and image-to-video with optional audio, 4/6/8 second clips, and 720p/1080p/4K output where supported.

Video

GPT Image 2

Generate and edit AI images at 1K, 2K, or 4K tiers for reference frames, product visuals, thumbnails, and concepts.

Image

Guide

Which Gemini Omni workflow should I choose?

Choose the workflow that matches the input you have and the output you need.

Veo 3.1

Text-to-video and image-to-video with optional audio, 4/6/8 second clips, and 720p/1080p/4K output where supported.

Text-to-video

Image-to-video

Optional audio

4/6/8s clips

Cinematic Scene Generation

An AI-generated cinematic scene demonstrating Veo 3.1 text-to-video output.

Dynamic Motion Showcase

A dynamic motion clip showcasing AI video generation with fluid movement.

Gemini Omni Flash

Fast Gemini Omni video workflow for prompt-led and image-guided drafts using the current VEO 3.1 official integration.

Fast video drafts

Prompt input

Up to 3 image references

Optional audio

GPT Image 2

Generate and edit AI images at 1K, 2K, or 4K tiers for reference frames, product visuals, thumbnails, and concepts.

Text-to-image

Image editing

1K / 2K / 4K tiers

Reference frames

Workflow

Preview credits and generate

Choose based on output: video from text, video from images, image generation, image editing, or higher-resolution final output.

Define the output

Video clip, image-guided video, still image, or image edit.

Task

Choose the model

Veo 3.1, Gemini Omni Flash, or GPT Image 2.

Model

Add inputs

Write a prompt and upload supported reference images when useful.

Setup

Preview credits and generate

Check cost, choose settings, and submit the task.

Generate

Which Gemini Omni workflow should I choose?

Choose the workflow that matches the input you have and the output you need.

Need video?

Use Veo 3.1 or Gemini Omni Flash for text-to-video and image-to-video.

Veo 3.1

Need visual control?

Upload reference images to guide product, character, style, or first/last-frame direction.

Gemini Omni Flash

Need images?

Use GPT Image 2 to create style frames, thumbnails, product visuals, or references.

GPT Image 2

Answers

Gemini Omni Generator FAQ

Answers for choosing the right Gemini Omni workflow.

Choose your workflow and start generating

Pick video or image, add supported inputs, preview credits, and generate.

Open Generator

Create AI Images