daqi's picture

daqi

Sunshine8393

AI & ML interests

None yet

Recent Activity

authored a paper 16 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

upvoted a paper 16 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

upvoted a collection 16 days ago

View all activity

Organizations

None yet

authored a paper 16 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Paper • 2603.15600 • Published 17 days ago • 7

upvoted a paper 16 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Paper • 2603.15600 • Published 17 days ago • 7

upvoted a collection 16 days ago

PRIMO R1

Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 3 items • Updated 16 days ago • 4

updated a dataset 5 months ago

Sunshine8393/RoboTwinQA_new

Updated Nov 7, 2025 • 3

published a dataset 5 months ago

Sunshine8393/RoboTwinQA_new

Updated Nov 7, 2025 • 3

updated a dataset 5 months ago

Sunshine8393/RoboTwinQA_new

Updated Nov 7, 2025 • 3