view article Article Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem 3 days ago • 11
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 22 days ago • 107
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published 28 days ago • 37
view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model 7 days ago • 15
SpecBundle Collection A collection of production-grade draft models for speculative decoding • 14 items • Updated 16 days ago • 13
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 24 days ago • 73
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation Paper • 2512.16913 • Published 21 days ago • 33
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 16 days ago • 43
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale Paper • 2512.10398 • Published 28 days ago • 6
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens Paper • 2511.19418 • Published Nov 24, 2025 • 28
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 63
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Dec 4, 2025 • 184