ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 4 days ago • 241
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published 4 days ago • 17
electricsheepafrica/africa-re4r-project-assessment-phase-survey-gihembe-households Viewer • Updated 4 days ago • 211 • 38 • 1
VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification Paper • 2604.01569 • Published 11 days ago • 12
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 14 days ago • 339
ishikaa/acquisition_metamath_qwen3b_confidence_verydetailed_5000 Text Generation • 3B • Updated 12 days ago • 366 • 1
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 18 days ago • 26
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 24 days ago • 330