NVFP4 MLX

#3
by sm54 - opened

Hi,

I wonder if you can quantise this using nvfp4, I would do it myself but my internet connection is too slow to download the bf16 model for quantisation.

Thanks,

MLX Community org

sure

Thank you, that's great

pcuenq changed discussion status to closed

Sign up or log in to comment