Supercharge your AI video production pipeline. Automate batch queues, map double-keyframe transitions, and synthesize synchronized background audio in a premium visual environment.
Why AI Video Generation Stalls in standard workflows
Designed around what the codebase actually does
Don't leave video motion to chance. Upload starting images and optional ending keyframe images (`inputImage` and `inputEndImage`). The studio automatically processes, compresses, and maps them to generate seamless, controlled transitions.
Inject atmospheric depth directly. Toggle audio generation in the dashboard config to call model parameters to generate matched soundscapes.
Type or paste lists of multiple prompts with our Bulk Import panel. Run, pause, or skip queues. The client tracks operation indices seamlessly.
Segment your historical records. Switch active environment IDs (`projectEnvId`) to isolate visual campaigns, prompts, and storage settings.
If the safety filter flags a prompt due to guidelines, an interactive countdown modal prompts the operator. Automatically skip or halt queue to avoid quota lockouts.
Watch how Flow Video Studio orchestrates cloud storage, AI model calls, and real-time operation polling to generate cinematic content.
Flexible tiers for creators and studios
Generations are billed dynamically by tokens based on the model selected.
(e.g. Veo 3.1 Lite costs 5 tokens/sec with audio, Imagen 3.0 costs 3 tokens/image).
Need more generation capacity? Purchase additional tokens programmatically calculated at your active subscription plan rate.
You do not currently have an active monthly subscription. You must purchase a Pro or Elite plan first to unlock add-on token purchasing.
Every generation consumes tokens dynamically based on the complexity and quality profile of the model. Video models are billed per generated second (including optional audio synchronization), while image models are billed per static asset.
| Model Name & Capabilities | Token Rate | Hobbyist (Free) | Pro Creator (15k Tokens) | Elite Creator (30k Tokens) |
|---|---|---|---|---|
| 🎬 Cinematic Video Generation (Veo Models) | ||||
| Veo 3.1 Standard1080p Full HD output, premium physical motion simulation, and high cinematic photorealism. | 720p: 20 tokens/s (muted) | 40 tokens/s (audio) 1080p: 30 tokens/s (muted) | 50 tokens/s (audio) | Simulated PreviewWatermarked mock video | 300 to 750 secondse.g., ~60 to ~150 clips of 5s | 600 to 1,500 secondse.g., ~120 to ~300 clips of 5s |
| Veo 3.1 Fast720p/1080p, accelerated generation speeds. Perfect balance of speed and frame stability. | 720p: 10 tokens/s (muted) | 15 tokens/s (audio) 1080p: 15 tokens/s (muted) | 20 tokens/s (audio) | Simulated PreviewWatermarked mock video | 750 to 1,500 secondse.g., ~150 to ~300 clips of 5s | 1,500 to 3,000 secondse.g., ~300 to ~600 clips of 5s |
| Veo 3.1 Lite720p/1080p, optimized latency. Ideal for swift iterations and draft layout storyboarding. | 720p: 3 tokens/s (muted) | 5 tokens/s (audio) 1080p: 8 tokens/s (muted) | 10 tokens/s (audio) | Simulated PreviewWatermarked mock video | 1,500 to 5,000 secondse.g., ~300 to ~1,000 clips of 5s | 3,000 to 10,000 secondse.g., ~600 to ~2,000 clips of 5s |
| 🖼️ High-Fidelity Image Synthesis (Imagen / Banana Models) | ||||
| Nano Banana Pro (Imagen 3 Pro)Premium model configuration. Perfect for high-res advertising key visuals. | Standard: 8 tokens / image High-Res: 12 tokens / image | Simulated PreviewWatermarked SVG card | 1,250 to 1,875 imagesHigh resolution assets | 2,500 to 3,750 imagesHigh resolution assets |
| Nano Banana 2 (Imagen 3.1 Flash)Fast-rendering model configuration. Highly responsive storyboard layouts. | Standard: 5 tokens / image High-Res: 8 tokens / image | Simulated PreviewWatermarked SVG card | 1,875 to 3,000 imagesBalanced asset outputs | 3,750 to 6,000 imagesBalanced asset outputs |
| Nano Banana (Imagen 3.0 Standard)Standard text-to-image and image-to-image generator. Excellent general capability. | Standard: 3 tokens / image High-Res: 5 tokens / image | Simulated PreviewWatermarked SVG card | 3,000 to 5,000 imagesStandard asset outputs | 6,000 to 10,000 imagesStandard asset outputs |
| ⚡ Interactive Chat Assistant (Gemini Models) | ||||
| Gemini 3.5 Flash (Default Chat)Standard ultra-fast chat assistant. Ideal for prompt configuration advice. | Dynamic (Google cost) $1.50 In | $9.00 Out (per 1M) | 2 Chat MessagesFlat 1 token / message | ~30k to ~150k messagesCalculated at $0.01 per token | ~60k to ~300k messagesCalculated at $0.01 per token |
| Gemini 3.1 Pro PreviewPremium complex reasoning chat assistant. Best for long-context scene scripting. | Dynamic (Google cost) $2.00 In | $12.00 Out (per 1M) | 2 Chat MessagesFlat 1 token / message | ~22k to ~112k messagesCalculated at $0.01 per token | ~45k to ~225k messagesCalculated at $0.01 per token |
Get early access to our hosted team features, centralized project workspaces, and advanced rendering queues. We're launching new regions soon.