shopifyinterngrinder/sidekick-autocomplete-1.7b

Fine-tuned from Qwen/Qwen3-1.7B using TRL SFT.

Training Details

Parameter Value
Base Model Qwen/Qwen3-1.7B
Dataset shopifyinterngrinder/sidekick-autocomplete-data @ main
Training Examples 900
Validation Examples 101
Epochs 3
Learning Rate 2e-05
Batch Size (per device) 1
Gradient Accumulation 2
Max Sequence Length 512
Precision bf16
Optimizer adamw_torch_fused
Warmup Steps 50
Weight Decay 0.01
LR Scheduler cosine
Packing Enabled
Dataset Format chat

Framework Versions

Library Version
Transformers 4.57.6
TRL 0.29.0
PyTorch 2.8.0+cu128
Datasets 3.6.0
Accelerate 1.13.0
Downloads last month
26
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for shopifyinterngrinder/sidekick-autocomplete-1-7b

Finetuned
Qwen/Qwen3-1.7B
Finetuned
(535)
this model

Dataset used to train shopifyinterngrinder/sidekick-autocomplete-1-7b