AI Text to Video Generator

Describe your vision and let AI create stunning videos

Audio Create
3:4 448×592

Balanced speed and quality, but face ID consistency is weaker than 800.

Each scene lasts 5s. Current video duration: 5s.

Describe your desired video in detail for best results0/1000

Use AI to generate prompts (experimental)

Please sign in to use text to video feature

Frequently Asked Questions

Common questions about the Text to Video feature

1

How should I write a good prompt?

Be specific and descriptive. Include details about the scene, characters, actions, camera movements, lighting, and style. For example: 'A majestic dragon flying over snow-capped mountains at sunset, camera slowly zooms out revealing a medieval castle below, cinematic lighting, 4k quality'.

2

Which resolution should I choose?

384 is mainly for GIFs or simple animations; its real video quality is limited. 512 is a balanced choice, but in i2v face consistency is not always guaranteed. 768 provides the best face consistency.

3

What video sizes are supported?

We support three aspect ratios: 16:9 (512x288) for landscape videos, 4:3 (448x336) for classic format, and 1:1 (384x384) for square videos suitable for social media.

4

There are so many img2vid products on the market. What makes your img2vid different?

Our img2vid generates a 20-second video in about 2 minutes at 512 resolution, with an estimated cost of $0.04, while maintaining stable quality.