NanoWM-B/2 · rt1 · NanoWM-B/2 on RT-1

NanoWM-B/2 on RT-1 (300k steps)

Run identity

Training setup

Key Value
Architecture NanoWM-B/2
Dataset rt1
Prediction v
Noise schedule squaredcos_cap_v2 (ZTSNR=True)
Steps 300000
Batch 8/GPU

Loading

import sys
from omegaconf import OmegaConf
from safetensors.torch import load_file
sys.path.insert(0, "src")
from models import get_models

cfg = OmegaConf.load("ckpt/config.yaml")
cfg.experiment.infra.compile = False
model = get_models(cfg).eval()
state_dict = load_file("ckpt/model.safetensors")
model.load_state_dict(state_dict, strict=True)
Downloads last month
30
Safetensors
Model size
0.2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including knightnemo/nanowm-b2-rt1-300k