2 6

Yan Yang PRO

HelloKKMe

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper about 22 hours ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a dataset about 2 months ago

HelloKKMe/h

View all activity

Organizations

upvoted 2 papers about 22 hours ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 3 days ago • 112

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 2 days ago • 64

updated a dataset about 2 months ago

HelloKKMe/h

Preview • Updated Nov 22, 2025 • 6 • 1

published a dataset about 2 months ago

HelloKKMe/h

Preview • Updated Nov 22, 2025 • 6 • 1

updated 3 models 4 months ago

published a dataset 4 months ago

Salesforce/grounding_dataset

Viewer • Updated Oct 3, 2025 • 70.7k • 571 • 4

published a model 4 months ago

Salesforce/GTA1-7B

Image-Text-to-Text • 8B • Updated Oct 3, 2025 • 240 • 3

updated a dataset 4 months ago

Salesforce/grounding_dataset

Viewer • Updated Oct 3, 2025 • 70.7k • 571 • 4

updated a collection 4 months ago

GTA1

Collection

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 4

published a model 4 months ago

Salesforce/GTA1-7B-2507

Image-Text-to-Text • 8B • Updated Oct 3, 2025 • 399 • 3

updated a collection 4 months ago

GTA1

Collection

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 4

published a model 4 months ago

Salesforce/GTA1-32B

Image-Text-to-Text • 33B • Updated Oct 3, 2025 • 9 • 6

Yan Yang PRO

AI & ML interests

Recent Activity

Organizations

HelloKKMe's activity