MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published 1 day ago • 129
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition Paper • 2602.08439 • Published 1 day ago • 28
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 8 days ago • 19
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 8 days ago • 19
FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation Paper • 2601.23182 • Published 11 days ago • 20
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 20 days ago • 22
CapRL Collection Stimulating Dense Image Caption Capabilities via Reinforcement Learning • 10 items • Updated Dec 30, 2025