Guidance Needed: GPT-OSS 20B Fine-Tuning with Unsloth → GGUF → Ollama → Triton (vLLM / TensorRT-LLM)
#9 opened about 1 month ago
by
GauravEA
Fix generation config
2
#7 opened 3 months ago
by
markian-rybchuk
Chat template differs from OpenAI's. Is it expected?
3
#6 opened 6 months ago
by
gentry1337
now working on vllm
🔥 1
4
#5 opened 7 months ago
by
heiyan2024
Quant with Llamacpp ?
1
#4 opened 7 months ago
by
DavidAU
not working on A10G
#2 opened 7 months ago
by
abhisskk