Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

alibaba

company
https://www.alibabagroup.com/
https://github.com/alibaba
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

KuanCao320  published a dataset 29 days ago
alibabagroup/OmniDoc-TokenBench
KuanCao320  updated a dataset 29 days ago
alibabagroup/OmniDoc-TokenBench
wensheng0503  new activity 2 months ago
alibabagroup/MobiZen-GUI-4B:tokenizer.json文件有问题
View all activity

Papers

How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

View all Papers

lzj's profile picturejinran's profile pictureboxin's profile pictureWensheng's profile pictureli's profile pictureFengYi's profile picture
alibabagroup 's papers 4
Submitted by
Yang Li (SJTU & SII)
6

How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs

alibabagroup alibaba
2
Submitted by
Ningyu Zhang
13

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

alibabagroup alibaba
3
Submitted by
Yang Li (SJTU & SII)
59

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

alibabagroup alibaba
2
Submitted by
Jiaming Wang
68

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

alibabagroup alibaba
3 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs