SuperWriter: Reflection-Driven Long-Form Generation with Large Language
Models
Paper
•
2506.04180
•
Published
•
33
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven
Clip Generation
Paper
•
2506.10540
•
Published
•
37
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper
•
2506.10974
•
Published
•
19
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced
Academic Search
Paper
•
2507.15245
•
Published
•
11
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper
•
2507.15846
•
Published
•
133
ScreenCoder: Advancing Visual-to-Code Generation for Front-End
Automation via Modular Multimodal Agents
Paper
•
2507.22827
•
Published
•
99
Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Paper
•
2507.23779
•
Published
•
44
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Paper
•
2507.23348
•
Published
•
11
agentica-org/DeepSWE-Preview
Text Generation
•
33B
•
Updated
•
1.75k
•
•
191
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust
GAIA Problem Solving
Paper
•
2508.09889
•
Published
•
32
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with
Long-Term Memory
Paper
•
2508.09736
•
Published
•
57
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm
Bridging Foundation Models and Lifelong Agentic Systems
Paper
•
2508.07407
•
Published
•
98
Efficient Agents: Building Effective Agents While Reducing Cost
Paper
•
2508.02694
•
Published
•
86
SSRL: Self-Search Reinforcement Learning
Paper
•
2508.10874
•
Published
•
97
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent
Distillation and Agentic RL
Paper
•
2508.13167
•
Published
•
129
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Paper
•
2508.15144
•
Published
•
64
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
•
2508.16153
•
Published
•
160
AgentScope 1.0: A Developer-Centric Framework for Building Agentic
Applications
Paper
•
2508.16279
•
Published
•
53
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer
Use Agent with Decoupled Reinforcement Learning
Paper
•
2508.20096
•
Published
•
36
rStar2-Agent: Agentic Reasoning Technical Report
Paper
•
2508.20722
•
Published
•
116
AWorld: Orchestrating the Training Recipe for Agentic AI
Paper
•
2508.20404
•
Published
•
38
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper
•
2508.21767
•
Published
•
12
GTA1: GUI Test-time Scaling Agent
Paper
•
2507.05791
•
Published
•
26
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn
Reinforcement Learning
Paper
•
2509.02544
•
Published
•
124
Morae: Proactively Pausing UI Agents for User Choices
Paper
•
2508.21456
•
Published
•
5
DeepResearch Arena: The First Exam of LLMs' Research Abilities via
Seminar-Grounded Tasks
Paper
•
2509.01396
•
Published
•
57
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI
Agents
Paper
•
2509.06917
•
Published
•
41
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM
Step-Provers
Paper
•
2509.06493
•
Published
•
11
F1: A Vision-Language-Action Model Bridging Understanding and Generation
to Actions
Paper
•
2509.06951
•
Published
•
32
EnvX: Agentize Everything with Agentic AI
Paper
•
2509.08088
•
Published
•
8
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making
through Multi-Turn Reinforcement Learning
Paper
•
2509.08755
•
Published
•
56
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B
Text Generation
•
31B
•
Updated
•
9.5k
•
787
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon
Agents
Paper
•
2509.13309
•
Published
•
67
Paper
•
2509.10147
•
Published
•
26
QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading
Paper
•
2509.09995
•
Published
•
15
Image-Text-to-Text
•
8B
•
Updated
•
598
•
12
VoiceAssistant-Eval: Benchmarking AI Assistants across Listening,
Speaking, and Viewing
Paper
•
2509.22651
•
Published
•
22
ACON: Optimizing Context Compression for Long-horizon LLM Agents
Paper
•
2510.00615
•
Published
•
32
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world
Markets?
Paper
•
2510.02209
•
Published
•
53
CoDA: Agentic Systems for Collaborative Data Visualization
Paper
•
2510.03194
•
Published
•
28
Agent Learning via Early Experience
Paper
•
2510.08558
•
Published
•
270
Training-Free Group Relative Policy Optimization
Paper
•
2510.08191
•
Published
•
44
CoDA: Coding LM via Diffusion Adaptation
Paper
•
2510.03270
•
Published
•
42
In-the-Flow Agentic System Optimization for Effective Planning and Tool
Use
Paper
•
2510.05592
•
Published
•
106
Don't Just Fine-tune the Agent, Tune the Environment
Paper
•
2510.10197
•
Published
•
28
Demystifying Reinforcement Learning in Agentic Reasoning
Paper
•
2510.11701
•
Published
•
31
Agentic Entropy-Balanced Policy Optimization
Paper
•
2510.14545
•
Published
•
104
PokeeAI/pokee_research_7b
Text Generation
•
8B
•
Updated
•
305
•
100
Text Generation
•
229B
•
Updated
•
119k
•
•
1.44k
moonshotai/Kimi-Linear-48B-A3B-Instruct
Text Generation
•
49B
•
Updated
•
72.5k
•
517
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration
Paper
•
2510.27266
•
Published
•
20
IterResearch: Rethinking Long-Horizon Agents via Markovian State
Reconstruction
Paper
•
2511.07327
•
Published
•
76
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery
Paper
•
2511.11257
•
Published
•
24
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
Paper
•
2510.08529
•
Published
•
18
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
Paper
•
2511.11373
•
Published
•
12
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Paper
•
2511.08195
•
Published
•
31
cerebras/MiniMax-M2-REAP-162B-A10B
Text Generation
•
162B
•
Updated
•
1.14k
•
75
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper
•
2511.16043
•
Published
•
108
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
Paper
•
2511.15593
•
Published
•
57
Tongyi DeepResearch Technical Report
Paper
•
2510.24701
•
Published
•
99
AgentFold: Long-Horizon Web Agents with Proactive Context Management
Paper
•
2510.24699
•
Published
•
69
Search Self-play: Pushing the Frontier of Agent Capability without
Supervision
Paper
•
2510.18821
•
Published
•
17
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Paper
•
2511.13288
•
Published
•
17
DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs
Paper
•
2511.20468
•
Published
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
Paper
•
2511.02303
•
Published
•
1
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn,
Multi-Task Framework
Paper
•
2510.04206
•
Published
•
3
MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games
Paper
•
2510.15414
•
Published
•
1
Multi-Agent Tool-Integrated Policy Optimization
Paper
•
2510.04678
•
Published
•
30
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper
•
2511.21678
•
Published
•
12
Latent Collaboration in Multi-Agent Systems
Paper
•
2511.20639
•
Published
•
117
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper
•
2512.02472
•
Published
•
51
open-thoughts/OpenThinker-Agent-v1
Text Generation
•
8B
•
Updated
•
1.63k
•
88
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing
Paper
•
2512.02589
•
Published
•
67
DeepCode: Open Agentic Coding
Paper
•
2512.07921
•
Published
•
31
nvidia/Nemotron-Orchestrator-8B
Text Generation
•
8B
•
Updated
•
65.9k
•
471
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment
Paper
•
2512.12692
•
Published
•
13
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
Paper
•
2512.14442
•
Published
•
10
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize
Memories via Reinforcement Learning
Paper
•
2508.19828
•
Published
•
7
Step-DeepResearch Technical Report
Paper
•
2512.20491
•
Published
•
77
Paper
•
2512.16301
•
Published
•
98
Nested Browser-Use Learning for Agentic Information Seeking
Paper
•
2512.23647
•
Published
•
17
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper
•
2512.24873
•
Published
•
51
Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing
Paper
•
2512.23611
•
Published
•
1
Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling
Paper
•
2511.05951
•
Published
AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent
Paper
•
2512.20745
•
Published