The new Animate with Text tool generates animation frames from a single reference image using a text action description. It produces smoother and more natural animations compared to the previous version.
How It Works
Upload a reference image (your character or object)
Enter an action description (e.g., "walking", "jumping", "attacking")
Choose the number of frames (4-16, must be even)
Generate
The tool animates your reference image based on the action you describe, keeping the first frame identical to your input.
Key Differences from Previous Version
Free for all users — no subscription required
Flexible frame count — choose 4 to 16 frames (must be even)
Dynamic pricing — cost scales with image size and frame count
Faster generation — uses a new animation model
First frame preserved — your input image is always kept as frame 1
Cost
Cost depends on image size and frame count. Common examples:
Size
4 frames
8 frames
16 frames
64x64
1 gen
2 gens
3 gens
96x96
2 gens
3 gens
5 gens
128x128
3 gens
5 gens
9 gens
256x256
9 gens
—
—
Larger images support fewer frames due to a total pixel budget. The exact cost is shown in the UI before generating.
Options
Reference Image (required): Your character or object sprite to animate
Animation Action (required): What animation to create (e.g., "walk", "run", "jump", "attack", "idle")
Frame Count: Number of animation frames, 4-16 (must be even). Maximum depends on image size.
Remove Background: Generate with transparent background