Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
37
58
Tong Zhu
Spico
Follow
Warrieryes's profile picture
itaowe's profile picture
JusenX's profile picture
19 followers
·
61 following
https://Spico197.github.io
TongZhu197
Spico197
AI & ML interests
Information Extraction, Mixture-of-Experts, LLM
Recent Activity
authored
a paper
about 8 hours ago
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
authored
a paper
about 8 hours ago
Iterative Value Function Optimization for Guided Decoding
authored
a paper
about 8 hours ago
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
View all activity
Organizations
Spico
's models
7
Sort: Recently updated
Spico/LLaMA-MoE-v1-2_8-UniformSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
•
5
Spico/LLaMA-MoE-v1-2_8-DynamicSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
•
6
Spico/sheared-llama-2.7b-deita-6k-sft
Text Generation
•
3B
•
Updated
Feb 25, 2024
•
2
•
1
Spico/internlm2-7b-hf-llama
Text Generation
•
Updated
Feb 23, 2024
•
6
Spico/mirror-chinese-mrcqa-alpha
Updated
Dec 4, 2023
Spico/Humback-Myx
Text Generation
•
Updated
Aug 19, 2023
•
25
•
3
Spico/Humback-M0
Text Generation
•
Updated
Aug 18, 2023
•
23
•
3