Community Blog & Articles

Community Articles

The Optimal Architecture for Small Language Models

Deriving the PPO Loss from First Principles

LLM based Audio models

Skill is All You Need: Lessons from Building Marketing Agents at Noumena

Continuity as a First-Class System Property in Artificial Intelligence

about 15 hours ago

Encoding the World's Medical Knowledge into 970K

Uncensor any LLM with abliteration

What makes good reasoning data

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Why You Should Care About Partial Differential Equations (PDEs)

Code a simple RAG from scratch

KV Caching Explained: Optimizing Transformer Inference Efficiency

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Make and publish your Reachy Mini App

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

Deriving the DPO Loss from First Principles

about 9 hours ago

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Small Language Models (SLM): A Comprehensive Overview

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

December 23, 2025

tokenizerstransformersopen-source

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+2

December 18, 2025

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

December 17, 2025

CUGA on Hugging Face: Democratizing Configurable AI Agents

December 15, 2025

New in llama.cpp: Model Management

December 11, 2025

llmfine-tuningopen-source

Codex is Open Sourcing AI models

December 11, 2025

swifthubopen-source

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

December 5, 2025

llmreasoningagents

DeepMath: A lightweight math reasoning Agent with smolagents

December 4, 2025

llmfine-tuningopen-source

We Got Claude to Fine-Tune an Open Source LLM

December 4, 2025

transformersv5community

Transformers v5: Simple model definitions powering the AI ecosystem

December 1, 2025

diffusersfluxquantization

Diffusers welcomes FLUX-2

+4

November 25, 2025

transformerspytorchoptimization

Continuous batching from first principles

November 25, 2025

Building Deep Research: How we Achieved State of the Art

November 24, 2025

OVHcloud on Hugging Face Inference Providers 🔥

November 24, 2025

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

The Optimal Architecture for Small Language Models

Deriving the PPO Loss from First Principles

LLM based Audio models

Skill is All You Need: Lessons from Building Marketing Agents at Noumena

Continuity as a First-Class System Property in Artificial Intelligence

about 15 hours ago

Encoding the World's Medical Knowledge into 970K

Uncensor any LLM with abliteration

What makes good reasoning data

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Why You Should Care About Partial Differential Equations (PDEs)

Code a simple RAG from scratch

KV Caching Explained: Optimizing Transformer Inference Efficiency

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Make and publish your Reachy Mini App

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

Deriving the DPO Loss from First Principles

about 9 hours ago

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Small Language Models (SLM): A Comprehensive Overview

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

View all articles