srivatsa92/Qwen2.5-3B-Instruct-GSM8K-Reasoning-v1-grpo Text Generation • 3B • Updated Feb 16, 2025 • 7
srivatsa92/llama3.1_fineTomeAlpaca_modified_aligned Text Generation • 8B • Updated Dec 10, 2024 • 6