Liang Siyu's picture

8 5

Liang Siyu

liangsiyu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

liked a model 4 days ago

facebook/contriever

liked a model 4 days ago

glides/counterfeit

View all activity

Organizations

None yet

upvoted a paper about 17 hours ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 4 days ago • 228

liked 2 models 4 days ago

facebook/contriever

Updated Jan 19, 2022 • 7.38M • 77

glides/counterfeit

Text-to-Image • Updated 4 days ago • 271 • 1

liked a model 8 days ago

arithmetic-circuit-overloading/Llama-3.3-70B-Instruct-v2-3d-4M-400K-0.1-reverse-padzero-99-512D-1L-2H-2048I

Text Generation • 3.69M • Updated 7 days ago • 951 • 1

liked a dataset 9 days ago

OpenAssistant/oasst1

Viewer • Updated May 2, 2023 • 88.8k • 11.9k • 1.5k

upvoted 4 papers 11 days ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 13 days ago • 339

Diffutron: A Masked Diffusion Language Model for Turkish Language

Paper • 2603.20466 • Published 23 days ago • 8

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 23 days ago • 330

Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning

Paper • 2603.23404 • Published 19 days ago • 7

upvoted a paper 26 days ago

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published 27 days ago • 152

upvoted a paper about 1 month ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 263

upvoted a paper about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

liked a model about 2 months ago

LocoreMind/LocoOperator-4B

Text Generation • 4B • Updated Feb 24 • 1.17k • 209