sai_reddy
saireddy
AI & ML interests
None yet
Organizations
insights on comparisons with Qwen/Qwen3-Next-80B-A3B-Instruct ?
➕
6
#14 opened 2 months ago
by
saireddy
function calling
#4 opened 3 months ago
by
saireddy
possible to extend context to 1m tokens ?
#5 opened 5 months ago
by
saireddy
RuntimeError: Index put requires the source and destination dtypes match, got BFloat16 for the destination and Float for the source.
➕
4
13
#24 opened over 1 year ago
by
saireddy
model.generate is throwing AttributeError: 'HybridCache' object has no attribute 'float'
7
#18 opened over 1 year ago
by
saireddy
base vs instruct model
1
#17 opened over 1 year ago
by
saireddy
Inference error
9
#20 opened over 1 year ago
by
gsasikiran
8-bit precision error
17
#32 opened almost 2 years ago
by
saireddy
ValueError with multi A100 GPUS
2
#28 opened almost 2 years ago
by
saireddy
Base vs instruct
5
#17 opened over 1 year ago
by
saireddy
Could not find GemmaForCausalLM neither in <module 'transformers.models.gemma'
6
#36 opened almost 2 years ago
by
chenwei1984