Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper • 2510.02283 • Published Oct 2, 2025 • 96
4DNeX: Feed-Forward 4D Generative Modeling Made Easy Paper • 2508.13154 • Published Aug 18, 2025 • 62
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Paper • 2507.05920 • Published Jul 8, 2025 • 11
Inject Semantic Concepts into Image Tagging for Open-Set Recognition Paper • 2310.15200 • Published Oct 23, 2023 • 6
Tag2Text: Guiding Vision-Language Model via Image Tagging Paper • 2303.05657 • Published Mar 10, 2023 • 1