Models

152

Full-text search

Active filters: quark

superbigtree/Mistral-Nemo-Instruct-2407-FP8_aq

12B • Updated Apr 22, 2025 • 23

aigdat/Llama-3.2-1B-Instruct-awq-uint4-float16

0.4B • Updated Apr 29, 2025 • 1

aigdat/Llama-3.2-3B-Instruct-awq-uint4-float16

0.8B • Updated Apr 24, 2025 • 1

aigdat/Phi-3.5-mini-instruct-awq-uint4-float16

0.6B • Updated Apr 29, 2025 • 2

aigdat/DeepSeek-R1-Distill-Qwen-1.5B_quantized_int4_bfloat16

0.4B • Updated Apr 29, 2025 • 1

aigdat/Qwen3-0.6B_quantized_int4_float16

0.2B • Updated Apr 30, 2025 • 4

aigdat/Arch-Function-Chat-3B_quantized_int4_float16

0.7B • Updated May 5, 2025 • 3

aigdat/DeepCoder-14B-Preview_quantized_int4_float16

3B • Updated May 5, 2025 • 7

aigdat/Qwen2.5-Coder-1.5B-Instruct_quantized_int4_bfloat16

0.4B • Updated May 5, 2025 • 1

aigdat/Qwen2.5-Coder-7B-Instruct_quantized_int4_bfloat16

1B • Updated May 6, 2025 • 2

aigdat/Qwen2.5-3B-Instruct_quantized_int4_bfloat16

0.7B • Updated May 8, 2025 • 3

aigdat/Qwen2.5-Coder-32B-Instruct_quantized_int4_bfloat16

5B • Updated May 9, 2025 • 1

aigdat/Llama-xLAM-2-8b-fc-r_quantized_int4_bfloat16

2B • Updated May 9, 2025

fxmarty/qwen_1.5-moe-a2.7b-mxfp4

8B • Updated May 13, 2025 • 6.18k

amd/Llama-3.3-70B-Instruct-MXFP4-Preview

38B • Updated Nov 6, 2025 • 4.37k • 2

fxmarty/deepseek_r1_3_layers_mxfp4

8B • Updated May 15, 2025 • 109 • 1

fxmarty/Llama-4-Scout-17B-16E-Instruct-2-layers-mxfp4

5B • Updated Oct 6, 2025 • 3.32k • 1

amd/DeepSeek-R1-MXFP4

371B • Updated 8 days ago • 98.5k • 5

mohitsha/Llama-2-7b-hf-w_mx_fp4_per_group_sym

4B • Updated May 23, 2025

amd/Llama-3.1-405B-Instruct-MXFP4-Preview

218B • Updated Nov 6, 2025 • 447 • 1

amd/DeepSeek-R1-MXFP4-ASQ

363B • Updated Nov 6, 2025 • 3k • 1

haoyang-amd/qwen1.5-0.5B-ptpc

0.5B • Updated Jul 1, 2025 • 1

amd/DeepSeek-R1-0528-MXFP4

356B • Updated Feb 26 • 19.2k • 1

fxmarty/Llama-3.1-70B-Instruct-2-layers-mxfp6

3B • Updated Jul 9, 2025 • 4.38k

fxmarty/qwen1.5_moe_a2.7b_chat_w_fp4_a_fp6_e2m3

8B • Updated Jul 11, 2025 • 5.19k

fxmarty/qwen1.5_moe_a2.7b_chat_w_fp6_e2m3_a_fp6_e2m3

11B • Updated Jul 11, 2025 • 2

fxmarty/qwen1.5_moe_a2.7b_chat_w_fp6_e3m2_a_fp6_e3m2

11B • Updated Jul 11, 2025 • 6.13k

amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-MLPerf-GPTQ

37B • Updated Aug 5, 2025 • 5

sudhab1988/rakuten-7b-awq-g128-int4-asym-fp16-hf

1B • Updated Jul 15, 2025 • 1

matmelis/Llama_3.2_1B_w_uint4_gptq

0.4B • Updated Jul 16, 2025 • 6