문예은

GraysonSmith55

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

MyPCBench: A Benchmark for Personally Intelligent Computer-Use Agents

upvoted a paper 29 days ago

Exploring Autonomous Agentic Data Engineering for Model Specialization

liked a dataset 29 days ago

WINGNUS/ACL-OCL

View all activity

Organizations

None yet

upvoted a paper 12 days ago

MyPCBench: A Benchmark for Personally Intelligent Computer-Use Agents

Paper • 2606.16748 • Published 18 days ago • 7

upvoted a paper 29 days ago

Exploring Autonomous Agentic Data Engineering for Model Specialization

Paper • 2605.30407 • Published May 28 • 23

upvoted 4 papers about 1 month ago

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

Paper • 2605.21487 • Published May 20 • 23

Stress-Testing the Reasoning Competence of LLMs With Proofs Under Minimal Formalism

Paper • 2605.12524 • Published Apr 7 • 4

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published May 14 • 147

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published May 13 • 274

upvoted 2 papers about 2 months ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published May 11 • 79

Linear-Time Global Visual Modeling without Explicit Attention

Paper • 2605.01711 • Published May 3 • 7

upvoted 2 papers 2 months ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published Apr 30 • 222

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

Paper • 2604.18168 • Published Apr 20 • 96

upvoted 5 papers 3 months ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published Apr 8 • 34

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 638

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published Mar 25 • 183

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

upvoted 3 papers 4 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

MediX-R1: Open Ended Medical Reinforcement Learning

Paper • 2602.23363 • Published Feb 26 • 23