Finally Qwen3.5-122B-A10B-NVFP4 working on Thor!

#1
by pastoriomarco - opened

Thank you, it works on my Thor too! And thank you for uploading the resharded model.

I actually ran it in another container and got it up to 256k context window, and there's still room. I basically need a single request at a time and prefer higher capability so context window over concurrent requests!

The commands I used are here:
https://github.com/pastoriomarco/thor_llm/tree/main/models/qwen3.5-122b-a10b-nvfp4-resharded

Sign up or log in to comment