NVFP4 MLX
#3
by sm54 - opened
Hi,
I wonder if you can quantise this using nvfp4, I would do it myself but my internet connection is too slow to download the bf16 model for quantisation.
Thanks,
sure
Thank you, that's great
pcuenq changed discussion status to closed