AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment Paper • 2605.17602 • Published 26 days ago • 19
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published May 7 • 233
Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles Paper • 2605.22177 • Published 25 days ago • 21
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published May 14 • 145
DiffRetriever: Parallel Representative Tokens for Retrieval with Diffusion Language Models Paper • 2605.07210 • Published May 8 • 4
CGM-JEPA: Learning Consistent Continuous Glucose Monitor Representations via Predictive Self-Supervised Pretraining Paper • 2605.00933 • Published May 1 • 2
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published Apr 29 • 25
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published Apr 30 • 57
Modeling Sparse and Bursty Vulnerability Sightings: Forecasting Under Data Constraints Paper • 2604.16038 • Published Apr 17 • 4