Instructions to use Reza2kn/Higgs-Audio-v3-TTS-4bit-MLX with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Reza2kn/Higgs-Audio-v3-TTS-4bit-MLX with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="Reza2kn/Higgs-Audio-v3-TTS-4bit-MLX")# Load model directly from transformers import AutoModelForSeq2SeqLM model = AutoModelForSeq2SeqLM.from_pretrained("Reza2kn/Higgs-Audio-v3-TTS-4bit-MLX", dtype="auto") - MLX
How to use Reza2kn/Higgs-Audio-v3-TTS-4bit-MLX with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Higgs-Audio-v3-TTS-4bit-MLX Reza2kn/Higgs-Audio-v3-TTS-4bit-MLX
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
Higgs-Audio-v3-TTS-4bit-MLX
4-bit MLX 4-bit artifact for bosonai/higgs-audio-v3-tts-4b.
Scope
This is a transformer-body quantized artifact, not a complete drop-in runtime yet. It quantizes body.layers.* attention/MLP 2D weights and preserves the Higgs audio tokenizer/vocoder, fused modality embedding/head, norms, biases, and non-2D tensors.
Higgs Audio v3 TTS uses a custom HiggsMultimodalQwen3ForConditionalGeneration architecture with 8 audio codebooks, delayed multi-codebook generation, and waveform decode. Current vanilla Transformers in the tested environment does not instantiate this architecture, so runtime integration must be done through SGLang-Omni or a custom loader.
Quantization Report
- Quantized tensors:
252 - Quantized parameter fraction seen:
0.7805 - Mean relative L2:
0.066956 - Max relative L2:
0.103319 - Max absolute error:
0.075130
See:
quantization_config.jsonquant_error_report.jsontensor_manifest.json
License
Released under the upstream Boson Higgs Audio v3 research and non-commercial license. Production, hosted APIs, or revenue-generating use requires a separate commercial license from Boson AI.
- Downloads last month
- 39
4-bit
Model tree for Reza2kn/Higgs-Audio-v3-TTS-4bit-MLX
Base model
bosonai/higgs-audio-v3-tts-4b