Pastel animation, unicorn

Our method maintains consistent subject identities across shots and follows the text prompts. VideoCrafter2 shows diverse motion but inconsistent characters (notice the rainbow hair colors, and different style of the unicorn at the lake), Tokenflow Encoder causes blurring, ConsiS Im2vid shows degraded motion. VSTAR Struggles with adhering to text prompts, but maintains good identity, and shows extensive non-specific motion.

Ours

galloping, rainbow

rearing, hind legs

leaping, lake

VideoCrafter2

Tokenflow Encoder

ConsiS Im2vid

VSTAR