Official Pytorch Implementation of Self-Refining Video Sampling
[Preprint 2026]-Self-Refining Video Sampling
Sangwon Jang*, Taekyung Ki*, Jaehyeong Jo*, Saining Xie, Jaehong Yoon†, Sung Ju hwang†
(* indicates equal contribution, † indicates equal advising)
Generated by Wan2.2-A14B T2V
This repo is organized into two main branches:
self-refine-video
├─ (current) : Wan Series (Based on Diffusers)
└─ cosmos2.5-predict: Cosmos 2.5 predict code
2026.02.01: 🚨 There is a Wan model loading error with
transformers==5.0.0. Please usetransformers==4.57.3until this issue is fixed.
Follow the model card for environment details:
Minimum dependencies (install in your environment):
- diffusers
- transformers
- torch
- pip install opencv-python imageio imageio-ffmpeg
Edit prompts and hyperparameters in inference_pnp.py (T2V) or inference_i2v_pnp.py (I2V), then run the script.
@article{jang2026selfrefining,
title={Self-Refining Video Sampling},
author={Sangwon Jang and Taekyung Ki and Jaehyeong Jo and Saining Xie and Jaehong Yoon and Sung Ju Hwang},
year={2026},
journal={arXiv preprint arXiv:2601.18577},
}
This work builds upon:
- Diffusers - Wan pipeline
- Cosmos2.5 - Github


