13 9

Анна Петрова

hmmm999

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

upvoted a paper 3 days ago

General Multimodal Protein Design Enables DNA-Encoding of Chemistry

liked a dataset 7 days ago

trz6i/lerobot0_5_3cams_WhitebackgroungSlowtest

View all activity

Organizations

None yet

upvoted a paper 1 day ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 9 days ago • 347

upvoted a paper 3 days ago

General Multimodal Protein Design Enables DNA-Encoding of Chemistry

Paper • 2604.05181 • Published 6 days ago • 25

liked a dataset 7 days ago

trz6i/lerobot0_5_3cams_WhitebackgroungSlowtest

Updated 7 days ago • 30

liked a dataset 8 days ago

daaxila/twitter-xiaofang52ad-2026.02.28-2027575173281681703-6H1U_jMGT1F9Laud-part1

Viewer • Updated 8 days ago • 1 • 55

liked a dataset 10 days ago

Eimhin03/NM3-irish-pseudo-iter5

Viewer • Updated 10 days ago • 8.49k • 105

upvoted a paper 10 days ago

PLDR-LLMs Reason At Self-Organized Criticality

Paper • 2603.23539 • Published 30 days ago • 5

upvoted a paper 11 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 22 days ago • 330

upvoted a paper 13 days ago

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 216

upvoted a paper 25 days ago

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published 26 days ago • 152

liked a model 25 days ago

ZJU-AI4H/Hulu-Med-4B

Image-Text-to-Text • 5B • Updated Nov 27, 2025 • 26.8k • 50

upvoted a paper 27 days ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted a paper about 1 month ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 193

liked 4 models about 1 month ago

upvoted 4 papers about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 244

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 208

Анна Петрова

AI & ML interests

Recent Activity

Organizations

hmmm999's activity