All videos start from a fixed reference frame. It can be different one but I've locked he character and base scene — your prompt controls the motion, action, and atmosphere.
Output is locked at 480 × 480 px. Higher resolutions (720p, 1080p) are supported by the model but currently disabled — they require significantly more VRAM and generation time.
This uses a local model running on local hardware. No data is sent to OpenAI, Sora, Runway, or any external service. Generation is private and offline.
Generation based on a reference picture.
See Details on the left before generating.
Be patient, it takes about 5 minutes.