Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 4 days ago • 228
arithmetic-circuit-overloading/Llama-3.3-70B-Instruct-v2-3d-4M-400K-0.1-reverse-padzero-99-512D-1L-2H-2048I Text Generation • 3.69M • Updated 7 days ago • 951 • 1
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 13 days ago • 339
Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published 23 days ago • 8
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 23 days ago • 330
Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning Paper • 2603.23404 • Published 19 days ago • 7
HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions Paper • 2603.15612 • Published 27 days ago • 152
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263