Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
inference-optimization
's Collections
HIGGS-stiched
HIGGS-per-tensor
HIGGS
test-models
HIGGS-stiched
updated
1 day ago
Stitched HIGGS Llama3 8B mixed-precision model variants.
Upvote
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation
•
8B
•
Updated
Sep 25, 2024
•
9.55M
•
•
5.82k
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic
Text Generation
•
8B
•
Updated
Mar 19
•
65.9k
•
9
RedHatAI/Llama-3.1-8B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
Nov 21, 2025
•
18k
•
1
Qwen/Qwen3-8B
Text Generation
•
8B
•
Updated
Jul 26, 2025
•
11.5M
•
•
1.09k
RedHatAI/Qwen3-8B-FP8-dynamic
Text Generation
•
8B
•
Updated
14 days ago
•
34.9k
•
12
RedHatAI/Qwen3-8B-NVFP4
Text Generation
•
5B
•
Updated
Nov 21, 2025
•
3.38k
•
2
inference-optimization/llama3_8b_5.0_bits_mode_heuristic_stiched
5B
•
Updated
5 days ago
•
24
inference-optimization/llama3_8b_5.0_bits_mode_hybrid_stiched
5B
•
Updated
5 days ago
•
24
inference-optimization/llama3_8b_5.0_bits_mode_noise_stiched
5B
•
Updated
5 days ago
•
21
inference-optimization/llama3_8b_5.5_bits_mode_heuristic_stiched
6B
•
Updated
5 days ago
•
20
inference-optimization/llama3_8b_5.5_bits_mode_hybrid_stiched
6B
•
Updated
5 days ago
•
26
inference-optimization/llama3_8b_5.5_bits_mode_noise_stiched
6B
•
Updated
5 days ago
•
28
inference-optimization/llama3_8b_6.0_bits_mode_heuristic_stiched
6B
•
Updated
5 days ago
•
22
inference-optimization/llama3_8b_6.0_bits_mode_hybrid_stiched
6B
•
Updated
5 days ago
•
47
inference-optimization/llama3_8b_6.0_bits_mode_noise_stiched
6B
•
Updated
5 days ago
•
28
inference-optimization/llama3_8b_6.5_bits_mode_heuristic_stiched
7B
•
Updated
5 days ago
•
22
inference-optimization/llama3_8b_6.5_bits_mode_hybrid_stiched
7B
•
Updated
5 days ago
•
24
inference-optimization/llama3_8b_6.5_bits_mode_noise_stiched
7B
•
Updated
5 days ago
•
25
inference-optimization/llama3_8b_7.0_bits_mode_heuristic_stiched
7B
•
Updated
5 days ago
•
24
inference-optimization/llama3_8b_7.0_bits_mode_hybrid_stiched
7B
•
Updated
5 days ago
•
28
inference-optimization/llama3_8b_7.0_bits_mode_noise_stiched
7B
•
Updated
5 days ago
•
27
inference-optimization/qwen3_8b_5.0_bits_mode_heuristic_stiched
6B
•
Updated
5 days ago
•
25
inference-optimization/qwen3_8b_5.0_bits_mode_hybrid_stiched
6B
•
Updated
5 days ago
•
28
inference-optimization/qwen3_8b_5.0_bits_mode_noise_stiched
6B
•
Updated
5 days ago
•
26
inference-optimization/qwen3_8b_5.5_bits_mode_heuristic_stiched
6B
•
Updated
5 days ago
•
26
inference-optimization/qwen3_8b_5.5_bits_mode_hybrid_stiched
6B
•
Updated
5 days ago
•
27
inference-optimization/qwen3_8b_5.5_bits_mode_noise_stiched
6B
•
Updated
5 days ago
•
23
inference-optimization/qwen3_8b_6.0_bits_mode_heuristic_stiched
6B
•
Updated
5 days ago
•
26
inference-optimization/qwen3_8b_6.0_bits_mode_hybrid_stiched
6B
•
Updated
5 days ago
•
21
inference-optimization/qwen3_8b_6.0_bits_mode_noise_stiched
6B
•
Updated
5 days ago
•
27
inference-optimization/qwen3_8b_6.5_bits_mode_heuristic_stiched
7B
•
Updated
5 days ago
•
27
inference-optimization/qwen3_8b_6.5_bits_mode_hybrid_stiched
7B
•
Updated
5 days ago
•
24
inference-optimization/qwen3_8b_6.5_bits_mode_noise_stiched
7B
•
Updated
5 days ago
•
29
inference-optimization/qwen3_8b_7.0_bits_mode_heuristic_stiched
7B
•
Updated
5 days ago
•
27
inference-optimization/qwen3_8b_7.0_bits_mode_hybrid_stiched
7B
•
Updated
5 days ago
•
27
inference-optimization/qwen3_8b_7.0_bits_mode_noise_stiched
7B
•
Updated
5 days ago
•
27
Upvote
-
Share collection
View history
Collection guide
Browse collections