oguzhanercan 's Collections Reasoning
updated
Paper
• 2506.10910
• Published
• 66
Fractional Reasoning via Latent Steering Vectors Improves Inference Time
Compute
Paper
• 2506.15882
• Published
• 2
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via
Context-Aware Multi-Stage Policy Optimization
Paper
• 2507.14683
• Published
• 134
The Invisible Leash: Why RLVR May Not Escape Its Origin
Paper
• 2507.14843
• Published
• 85
Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for
RLVR
Paper
• 2507.15778
• Published
• 21
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Paper
• 2507.19457
• Published
• 30
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
Paper
• 2508.14029
• Published
• 118
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic
Paper
• 2509.01363
• Published
• 59
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
Paper
• 2510.08525
• Published
• 23
Paper
• 2510.06557
• Published
• 31
A Theoretical Study on Bridging Internal Probability and
Self-Consistency for LLM Reasoning
Paper
• 2510.15444
• Published
• 148
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper
• 2510.18866
• Published
• 114
Scaling Latent Reasoning via Looped Language Models
Paper
• 2510.25741
• Published
• 229
Reasoning with Sampling: Your Base Model is Smarter Than You Think
Paper
• 2510.14901
• Published
• 48
OpenSIR: Open-Ended Self-Improving Reasoner
Paper
• 2511.00602
• Published
• 21