alejdiaz
dram023
AI & ML interests
None yet
Recent Activity
updated a collection about 2 hours ago
10. Quantum IA updated a collection about 2 hours ago
04. Finance updated a collection about 2 hours ago
02. Prompt engineeringOrganizations
None yet
06. Math
-
The Hamilton-Jacobi Theory of Deep Learning
Paper • 2605.28983 • Published • 1 -
Convex Optimization: Algorithms and Complexity
Paper • 1405.4980 • Published -
Introduction to Online Convex Optimization
Paper • 1909.05207 • Published -
Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion
Paper • 2302.03775 • Published
05. Derecho
-
PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers
Paper • 2605.26730 • Published • 16 -
Off-the-Shelf LLMs as Process Scorers: Training-Free Alternative to PRMs for Mathematical Reasoning
Paper • 2606.01682 • Published • 7 -
LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs
Paper • 2606.06286 • Published • 8 -
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs
Paper • 2606.06574 • Published • 22
03. AI imagenes
-
Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence
Paper • 2605.30093 • Published • 15 -
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs
Paper • 2605.30611 • Published • 195 -
Next Forcing: Causal World Modeling with Multi-Chunk Prediction
Paper • 2606.11187 • Published • 6
09. Investigación
-
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
Paper • 2606.07591 • Published • 91 -
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research
Paper • 2606.09730 • Published • 50 -
DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch
Paper • 2606.10728 • Published • 33 -
Towards Diverse Scientific Hypothesis Search with Large Language Models
Paper • 2606.10587 • Published • 2
07. IA agentica
-
Multi-Agent Computer Use
Paper • 2606.01533 • Published • 7 -
OpenSkill: Open-World Self-Evolution for LLM Agents
Paper • 2606.06741 • Published • 27 -
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills
Paper • 2606.07412 • Published • 12 -
Bayesian-Agent: Posterior-Guided Skill Evolution for LLM Agent Harnesses
Paper • 2606.08348 • Published • 14
04. Finance
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 100 -
Kronos: A Foundation Model for the Language of Financial Markets
Paper • 2508.02739 • Published • 42 -
Absorbing Complexity: An Interaction-Native Knowledge Harness for Financial LLM Agents
Paper • 2606.01886 • Published • 5 -
A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets
Paper • 2606.13802 • Published
02. Prompt engineering
-
Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question Answering
Paper • 2605.29648 • Published • 10 -
Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention
Paper • 2605.29548 • Published • 11 -
Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation
Paper • 2605.29861 • Published • 16 -
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
Paper • 2605.31264 • Published • 114
10. Quantum IA
09. Investigación
-
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
Paper • 2606.07591 • Published • 91 -
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research
Paper • 2606.09730 • Published • 50 -
DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch
Paper • 2606.10728 • Published • 33 -
Towards Diverse Scientific Hypothesis Search with Large Language Models
Paper • 2606.10587 • Published • 2
06. Math
-
The Hamilton-Jacobi Theory of Deep Learning
Paper • 2605.28983 • Published • 1 -
Convex Optimization: Algorithms and Complexity
Paper • 1405.4980 • Published -
Introduction to Online Convex Optimization
Paper • 1909.05207 • Published -
Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion
Paper • 2302.03775 • Published
07. IA agentica
-
Multi-Agent Computer Use
Paper • 2606.01533 • Published • 7 -
OpenSkill: Open-World Self-Evolution for LLM Agents
Paper • 2606.06741 • Published • 27 -
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills
Paper • 2606.07412 • Published • 12 -
Bayesian-Agent: Posterior-Guided Skill Evolution for LLM Agent Harnesses
Paper • 2606.08348 • Published • 14
05. Derecho
-
PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers
Paper • 2605.26730 • Published • 16 -
Off-the-Shelf LLMs as Process Scorers: Training-Free Alternative to PRMs for Mathematical Reasoning
Paper • 2606.01682 • Published • 7 -
LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs
Paper • 2606.06286 • Published • 8 -
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs
Paper • 2606.06574 • Published • 22
04. Finance
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 100 -
Kronos: A Foundation Model for the Language of Financial Markets
Paper • 2508.02739 • Published • 42 -
Absorbing Complexity: An Interaction-Native Knowledge Harness for Financial LLM Agents
Paper • 2606.01886 • Published • 5 -
A Benchmark and Framework for Evaluating Next Action Predictions in Spreadsheets
Paper • 2606.13802 • Published
03. AI imagenes
-
Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence
Paper • 2605.30093 • Published • 15 -
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs
Paper • 2605.30611 • Published • 195 -
Next Forcing: Causal World Modeling with Multi-Chunk Prediction
Paper • 2606.11187 • Published • 6
02. Prompt engineering
-
Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question Answering
Paper • 2605.29648 • Published • 10 -
Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention
Paper • 2605.29548 • Published • 11 -
Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation
Paper • 2605.29861 • Published • 16 -
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
Paper • 2605.31264 • Published • 114