LLM-Drop

university

https://github.com/CASE-Lab-UMD/LLM-Drop

Shwai-He

Activity Feed

AI & ML interests

Efficient and adaptive foundation models across language and multimodal intelligence.

Recent Activity

shwai-he authored a paper about 14 hours ago

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

shwai-he authored a paper about 14 hours ago

What Matters in Transformers? Not All Attention is Needed

shwai-he authored a paper about 14 hours ago

Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts

View all activity

Papers

Demystifying When Pruning Works via Representation Hierarchies

Making Large Language Models Efficient Dense Retrievers

View all Papers

Organization Card

Community About org cards

LLM-Drop

🤗 LLM-Drop hosts research artifacts for efficient foundation models, with a focus on large language models and unified multimodal models.

Our work studies how modern foundation models can be made more efficient while preserving their core capabilities. This page collects model weights, code links, project pages, and related resources from our research projects.

📌 Projects

🧩 LLM-Drop

Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping
TMLR 2026

📄 Paper: https://openreview.net/forum?id=1I7PCbOPfe
💻 Code: https://github.com/CASE-Lab-UMD/LLM-Drop
🤗 Models: https://huggingface.co/collections/LLM-Drop/llm-drop

🔍 Pruning on Representations

Demystifying When Pruning Works via Representation Hierarchies

🌐 Project Page: https://case-lab-umd.github.io/Pruning-on-Representations/
📄 Paper: https://arxiv.org/abs/2603.24652
💻 Code: https://github.com/CASE-Lab-UMD/Pruning-on-Representations

🌐 Sparse Unified Models

Understanding and Harnessing Sparsity in Unified Multimodal Models

🌐 Project Page: https://shwai-he.github.io/SparseUnifiedModel/
📄 Paper: https://huggingface.co/papers/2512.02351
💻 Code: https://github.com/Shwai-He/SparseUnifiedModel
🤗 Models:
- https://huggingface.co/LLM-Drop/BAGEL-MoE-7B-GEN-16to8
- https://huggingface.co/LLM-Drop/BAGEL-MoE-7B-GEN-32to16

📬 Contact

For questions or collaborations, please contact:

Shwai He: shwaihe@umd.edu
Guoheng Sun: ghsun@umd.edu

Collections 1

models 2

LLM-Drop/BAGEL-MoE-7B-GEN-32to16

Text-to-Image • Updated 3 days ago • 21 • 2

LLM-Drop/BAGEL-MoE-7B-GEN-16to8

Text-to-Image • Updated 3 days ago • 24 • 2

datasets 0

None public yet

LLM-Drop

AI & ML interests

Recent Activity

Papers

LLM-Drop

📌 Projects

🧩 LLM-Drop

🔍 Pruning on Representations

🌐 Sparse Unified Models

📬 Contact

Collections 1

s1ghhh/Llama-3-70b-Drop

s1ghhh/Llama-2-70b-Drop

s1ghhh/Llama-2-13b-Drop8Attn

s1ghhh/Llama-2-13b-Drop4Attn

s1ghhh/Llama-3-70b-Drop

s1ghhh/Llama-2-70b-Drop

s1ghhh/Llama-2-13b-Drop8Attn

s1ghhh/Llama-2-13b-Drop4Attn

models 2

LLM-Drop/BAGEL-MoE-7B-GEN-32to16

LLM-Drop/BAGEL-MoE-7B-GEN-16to8

datasets 0

AI & ML interests

Recent Activity

Papers

Team members 2

LLM-Drop

📌 Projects

🧩 LLM-Drop

🔍 Pruning on Representations

🌐 Sparse Unified Models

📬 Contact

Collections 1

models 2 Sort: Recently updated

datasets 0

models 2