EverAnimate

EverAnimate: Minute-Scale Human Animation via Latent Flow Restoration

EverAnimate is a GPU-friendly post-training method for long-horizon human animation. It uses lightweight rank-32 LoRA adaptation on top of Wan2.2-Animate and improves long-horizon generation through persistent latent propagation and restorative flow matching.

Highlights

  • GPU-friendly training: rank-32 LoRA post-training on Wan2.2-Animate reaches strong results with only thousands of iterations on 4 GPUs.
  • Long-horizon animation: supports minute-scale human animation with controlled identity and motion consistency.
  • Fully open source: code, training/inference scripts, LoRA checkpoints, demo data, and ablation videos are released for reproducible research.

Repository Contents

ckpts/
`-- everanimate-v1-lora32/
    |-- stage1_480p.safetensors
    |-- stage2_480p.safetensors
    `-- stage3_720p_beta.safetensors  # Beta, tested only at small scale

data/
|-- train/       # Minimal training sample
|-- test/        # Inference demo
`-- ablation/    # Stage-1 and Stage-2 ablation videos

The ablation videos are the two-stage outputs: data/ablation/stage1.mp4 is the Stage-1 result, and data/ablation/stage2.mp4 is the Stage-2 result.

Download

Download checkpoints and demo data:

hf download epfl-vita/everanimate \
  --repo-type model \
  --include "ckpts/**" \
  --include "data/**" \
  --local-dir .

Download only the LoRA checkpoints:

hf download epfl-vita/everanimate \
  --repo-type model \
  --include "ckpts/everanimate-v1-lora32/*.safetensors" \
  --local-dir .

Download only the data:

hf download epfl-vita/everanimate \
  --repo-type model \
  --include "data/**" \
  --local-dir .

For full setup, clone the code repo and run:

git clone https://github.com/vita-epfl/EverAnimate.git
cd EverAnimate
bash scripts/download_models.sh

The script downloads the Wan2.2-Animate base files from Wan-AI/Wan2.2-Animate-14B and the EverAnimate files from this repository.

Usage

Inference with the bundled demo:

bash test.sh

Training with the bundled minimal sample:

bash train_stage1.sh
bash train_stage2.sh

See the GitHub repository for environment setup, scripts, and implementation details.

Model Details

  • Base model: Wan2.2-Animate
  • Checkpoint type: LoRA
  • LoRA rank: 32
  • Resolution: 480p stable checkpoints and a 720p beta checkpoint
  • Stages: Stage 1 learns latent propagation; Stage 2 adds restorative flow matching behavior; Stage 3 provides a 720p beta LoRA.

The 720p checkpoint is a beta release and has only been tested at small scale. A more thoroughly fine-tuned and evaluated 720p checkpoint is planned for a future update.

Intended Use

These checkpoints and sample assets are intended for research on controllable human image animation, long-horizon video generation, and reproducible comparison of EverAnimate's two-stage post-training pipeline.

Limitations

The released LoRA checkpoints inherit the capabilities and limitations of the Wan2.2-Animate backbone. Performance can vary with input image quality, pose accuracy, motion difficulty, and generation length. Users should follow the licenses and usage terms of the base model and any input assets.

Citation

@misc{li2026everanimate,
  title         = {EverAnimate: Minute-Scale Human Animation via Latent Flow Restoration},
  author        = {Wuyang Li and Yang Gao and Mariam Hassan and Lan Feng and Wentao Pan and Po-Chien Luan and Alexandre Alahi},
  year          = {2026},
  eprint        = {2605.15042},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CV},
  url           = {https://arxiv.org/abs/2605.15042}
}

Acknowledgements

This work builds on the following projects:

This work has also been inspired by SVI 2.0 Pro and LongCat Video Avatar.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for epfl-vita/everanimate

Adapter
(2)
this model

Papers for epfl-vita/everanimate