Instructions to use Saachu/svi-model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Saachu/svi-model with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image, export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Saachu/svi-model", dtype=torch.bfloat16, device_map="cuda") pipe.to("cuda") prompt = "A man with short gray hair plays a red electric guitar." image = load_image( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png" ) output = pipe(image=image, prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Notebooks
- Google Colab
- Kaggle
pip install -U diffusers transformers accelerate
import torch
from diffusers import DiffusionPipeline
from diffusers.utils import load_image, export_to_video
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("Saachu/svi-model", dtype=torch.bfloat16, device_map="cuda")
pipe.to("cuda")
prompt = "A man with short gray hair plays a red electric guitar."
image = load_image(
"https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png"
)
output = pipe(image=image, prompt=prompt).frames[0]
export_to_video(output, "output.mp4")π― About This Repository
Stable-Video-Infinity(SVI) is able to generate ANY-length videos with high temporal consistency, plausible scene transitions, and controllable streaming storylines in ANY domains. This repository contains the model weights of SVI Family.
π Key Highlights
- OpenSVI: Everything is open-sourced: training & evaluation scripts, datasets, and more.
- Infinite Length: No inherent limit on video duration; generate arbitrarily long stories (see the 10βminute βTom and Jerryβ demo).
- Versatile: Supports diverse in-the-wild generation tasks: multi-scene short films, singleβscene animations, skeleton-/audio-conditioned generation, cartoons, and more.
- Efficient: Only LoRA adapters are tuned, requiring very little training data: anyone can make their own SVI easily.
π¦ Resources
| Model | Task | Input | Output | Hugging Face Link | Comments |
|---|---|---|---|---|---|
| ALL | Infinite possibility | Image + X | X video | π€ Folder | Family bucket! I want to play with all! |
| SVI-Shot | Single-scene generation | Image + Text prompt | Long video | π€ Model | Generate consistent long video with 1 text prompt. (This will never drift) |
| SVI-Film | Multi-scene generation | Image + Text prompt stream | Film-style video | π€ Model | Generate creative long video with 1 text prompt stream (5 second per text). |
| SVI-Film (Transition) | Multi-scene generation | Image + Text prompt stream | Film-style video | π€ Model | Generate creative long video with 1 text prompt stream. (More scene transitions due to the training data) |
| SVI-Tom&Jerry | Cartoon animation | Image | Cartoon video | π€ Model | Generate creative long cartoon videos with 1 text prompt stream (This will never drift in our 20 min test) |
| SVI-Talk | Talking head | Image + Audio | Talking video | π€ Model | Generate long videos with audio-conditioned human speaking |
| SVI-Dance | Dancing animation | Image + Skeleton | Dance video | π€ Model | Generate long videos with skeleton-conditioned human dancing |
Note: If you want to play with T2V, you can directly use SVI with an image generated by any T2I model!
π Citation
If you find our work helpful for your research, please consider citing our paper. Thank you so much!
@article{li2025stable,
title={Stable Video Infinity: Infinite-Length Video Generation with Error Recycling},
author={Wuyang Li and Wentao Pan and Po-Chien Luan and Yang Gao and Alexandre Alahi},
journal={arXiv preprint arXiv: arXiv:2510.09212},
year={2025},
url={https://huggingface.co/papers/2510.09212},
}
- Downloads last month
- -
Dataset used to train Saachu/svi-model
Viewer β’ Updated β’ 133 β’ 2.31k β’ 5
# Gated model: Login with a HF token with gated access permission hf auth login