OpenCompass

community

https://opencompass.org.cn/

OpenCompassX

open-compass

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

dongsheng authored a paper about 12 hours ago

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

vansin submitted a paper 16 days ago

End-to-End Video Character Replacement without Structural Guidance

KennyUTC authored a paper 22 days ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

View all activity

Papers

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

View all Papers

dongsheng

authored a paper about 12 hours ago

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Paper • 2511.08487 • Published Nov 11, 2025 • 3

vansin

submitted a paper to Daily Papers 16 days ago

End-to-End Video Character Replacement without Structural Guidance

Paper • 2601.08587 • Published 16 days ago • 8

KennyUTC

authored a paper 22 days ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Paper • 2512.22334 • Published Dec 26, 2025 • 35

Sudanl

updated 2 models 26 days ago

opencompass/CompassVerifier-3B

3B • Updated 26 days ago • 794 • 7

opencompass/CompassVerifier-32B

33B • Updated 26 days ago • 10 • 7

vansin

posted an update about 1 month ago

Post

300

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

ZwwWayne

authored a paper about 2 months ago

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published Dec 11, 2025 • 47

vanilla1116

authored 4 papers about 2 months ago

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Paper • 2508.03686 • Published Aug 5, 2025 • 39

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 262

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Paper • 2512.10756 • Published Dec 11, 2025 • 35

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published Dec 11, 2025 • 47

vanilla1116

submitted 3 papers to Daily Papers about 2 months ago

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Paper • 2512.10534 • Published Dec 11, 2025 • 32

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published Dec 11, 2025 • 47

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Paper • 2512.10756 • Published Dec 11, 2025 • 35

nebulae09

authored 3 papers about 2 months ago

IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?

Paper • 2509.24709 • Published Sep 29, 2025 • 7

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Paper • 2511.14366 • Published Nov 18, 2025 • 17

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 49

yuhangzang

authored a paper about 2 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 49

Sudanl

authored a paper about 2 months ago

How Far Are We from Genuinely Useful Deep Research Agents?

Paper • 2512.01948 • Published Dec 1, 2025 • 56

jnanliu

authored a paper about 2 months ago

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Paper • 2511.08487 • Published Nov 11, 2025 • 3