shopifyinterngrinder/sidekick-autocomplete-1.7b
Fine-tuned from Qwen/Qwen3-1.7B using TRL SFT.
Training Details
| Parameter |
Value |
| Base Model |
Qwen/Qwen3-1.7B |
| Dataset |
shopifyinterngrinder/sidekick-autocomplete-data @ main |
| Training Examples |
900 |
| Validation Examples |
101 |
| Epochs |
3 |
| Learning Rate |
2e-05 |
| Batch Size (per device) |
1 |
| Gradient Accumulation |
2 |
| Max Sequence Length |
512 |
| Precision |
bf16 |
| Optimizer |
adamw_torch_fused |
| Warmup Steps |
50 |
| Weight Decay |
0.01 |
| LR Scheduler |
cosine |
| Packing |
Enabled |
| Dataset Format |
chat |
Framework Versions
| Library |
Version |
| Transformers |
4.57.6 |
| TRL |
0.29.0 |
| PyTorch |
2.8.0+cu128 |
| Datasets |
3.6.0 |
| Accelerate |
1.13.0 |