ILSP Greek Evaluation Suite Collection A collection of test sets for evaluating base and chat LLMs (incl. VLMs) on Greek generation and understanding capabilities • 23 items • Updated about 19 hours ago • 6
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published 26 days ago • 31
🤏 Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated Mar 2 • 12
Granite 4.0 Nano Language Models Collection Ultra-compact language models designed for the edge and on-device deployment. • 9 items • Updated Apr 29 • 103