Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
5
2
Junxiong Wang
PRO
JunxiongWang
Follow
hudsongouge's profile picture
emircanerol's profile picture
NikolayKozloff's profile picture
16 followers
·
3 following
https://www.cs.cornell.edu/~junxiong/
jxiw
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
upvoted
an
article
about 2 months ago
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
updated
a model
4 months ago
JunxiongWang/M1-3B
updated
a model
5 months ago
togethercomputer/M1-3B
View all activity
Organizations
JunxiongWang
's models
51
Sort: Recently updated
JunxiongWang/M1-3B
Text Generation
•
3B
•
Updated
Sep 2, 2025
•
12
•
2
JunxiongWang/M1-3B-SFT
Text Generation
•
3B
•
Updated
Apr 16, 2025
•
68
•
1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B
•
Updated
Feb 11, 2025
•
6
JunxiongWang/MambaInLlama3B_SFT_MATH
3B
•
Updated
Feb 7, 2025
•
8
JunxiongWang/MambaInLlama3B_DPO2
3B
•
Updated
Feb 5, 2025
•
5
JunxiongWang/MambaInLlama3B_DPO1
3B
•
Updated
Feb 5, 2025
•
5
JunxiongWang/MambaInLlama3B_Distill_MATH
3B
•
Updated
Jan 27, 2025
•
8
JunxiongWang/MambaInLlama3B_v3
3B
•
Updated
Jan 25, 2025
•
5
JunxiongWang/MambaInLlama1B_Distill_MATH
1B
•
Updated
Jan 23, 2025
•
8
JunxiongWang/mamba_0_5_distill
Updated
Dec 25, 2024
•
2
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17, 2024
•
5
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17, 2024
•
11
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17, 2024
•
13
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17, 2024
•
11
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17, 2024
•
7
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17, 2024
•
11
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17, 2024
•
257
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17, 2024
•
8
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
Sep 9, 2024
•
20
•
1
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
Sep 9, 2024
•
10
•
3
JunxiongWang/MambaByte_PG19_353M
Text Generation
•
Updated
Sep 9, 2024
•
9
JunxiongWang/MambaByte_Books
Text Generation
•
Updated
Sep 9, 2024
•
10
•
2
JunxiongWang/MambaByte_Code
Text Generation
•
Updated
Sep 9, 2024
•
14
•
2
JunxiongWang/MambaByte_PG19_972M
Text Generation
•
Updated
Sep 9, 2024
•
11
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2, 2024
•
6
•
1
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2, 2024
•
8
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2, 2024
•
5
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2, 2024
•
26
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2, 2024
•
5
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2, 2024
•
7
Previous
1
2
Next