view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 19 days ago • 114
Gemma 4 QAT Collection Gemma 4 QAT (Quantization-Aware Training) for 3x less memory use and near original accuracy. • 16 items • Updated 1 day ago • 89
gliner2 family Collection GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. • 7 items • Updated May 16 • 51
Tfree-HAT-7b-pretrained Collection Tokenizer free models based on Hierarchical Autoregressive Transformer (https://arxiv.org/abs/2501.10322) trained from scratch. • 2 items • Updated Aug 1, 2025 • 11
NVIDIA Nemotron V2 Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 5 days ago • 106
view article Article Introducing Command A Vision: Multimodal AI built for Business CohereLabs • Jul 31, 2025 • 64
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 8 items • Updated Mar 2 • 254
GLiCLass-V3 Collection Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy. • 8 items • Updated Jan 29 • 21
Seed-X Collection A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated Aug 22, 2025 • 69
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper • 2401.10774 • Published Jan 19, 2024 • 60
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 28 items • Updated 10 days ago • 154