-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 167 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 131 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 69
Collections
Discover the best community collections!
Collections including paper arxiv:2511.03276
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 261 -
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper • 2507.15846 • Published • 133 -
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
Paper • 2507.22827 • Published • 100 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 214
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 128 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 87
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 19 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 40 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 146 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 137
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 58 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 49 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129
-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 19 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48
-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 167 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 131 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 69
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 128 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 87
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 19 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 40 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 146 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 137
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 58 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 49 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 129
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 261 -
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper • 2507.15846 • Published • 133 -
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
Paper • 2507.22827 • Published • 100 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 214
-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 19 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48