shareAI (shareAI)

Lie24

authored a paper 29 days ago

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published May 19 • 40

Lie24

authored a paper 2 months ago

Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models

Paper • 2604.01622 • Published Apr 2 • 7

chenzongchao

authored 2 papers 4 months ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published Feb 13 • 36

Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation

Paper • 2602.14469 • Published Feb 16 • 3

Evanwu50020

authored a paper 4 months ago

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

Paper • 2602.14296 • Published Feb 15 • 51

Evanwu50020

submitted a paper to Daily Papers 4 months ago

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

Paper • 2602.14296 • Published Feb 15 • 51

Baicai003

updated a dataset 6 months ago

shareAI/ShareGPT-Chinese-English-90k

Preview • Updated Dec 29, 2025 • 1.21k • 281

Evanwu50020

authored a paper 11 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4, 2025 • 19

Lie24

authored 2 papers about 1 year ago

A Technical Study into Small Reasoning Language Models

Paper • 2506.13404 • Published Jun 16, 2025 • 8

ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL

Paper • 2505.12768 • Published May 19, 2025 • 5

Baicai003

updated a collection about 1 year ago

LLM-methods

Collection

some paper for learn • 3 items • Updated Apr 26, 2025 • 1

Lie24

authored a paper about 1 year ago

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published Apr 11, 2025 • 33

Evanwu50020

updated a model about 1 year ago

shareAI/gemma3-r1-12b

Updated Apr 1, 2025 • 1

Evanwu50020

published a model about 1 year ago

shareAI/gemma3-r1-12b

Updated Apr 1, 2025 • 1

Lie24

authored a paper over 1 year ago

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

Paper • 2502.07490 • Published Feb 11, 2025 • 10

Baicai003

updated a Space over 1 year ago

shareAI

🚀

StarRing2022

updated 3 models over 1 year ago

updated a dataset over 1 year ago

shareAI/Alpaca-Distill-R1-ZH

Viewer • Updated Feb 6, 2025 • 179k • 18 • 15

AI & ML interests

Team members 174

shareAI's activity

shareAI