mltrials (mltrials)

upvoted 2 articles 3 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 505

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

+4

burtenshaw, danielhanchen, shimmyshimmer, mlabonne, davanstrien, evalstate

•

Feb 20

• 100

upvoted an article 5 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 624

upvoted a collection 8 months ago

Granite Docling

Collection

Models for parsing complex PDFs and structured documents, designed to complement Docling. • 4 items • Updated 12 days ago • 62

upvoted an article 8 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego

•

Sep 4, 2025

• 273

upvoted 2 papers 9 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

upvoted an article 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 773

upvoted a paper 11 months ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 64

upvoted 3 papers over 1 year ago

upvoted 5 articles over 1 year ago

Article

🌁#81: Key AI Concepts to Follow in 2025

Kseniase

•

Dec 23, 2024

• 24

Article

Fine-tune ModernBERT for text classification using synthetic data

davidberenstein1957

•

Dec 30, 2024

• 39

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

wolfram

•

Jan 2, 2025

• 41

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

davidberenstein1957

•

Jan 3, 2025

• 38

Article

Accelerating Language Model Inference with Mixture of Attentions

hba123

•

Jan 7, 2025

• 24

mltrials

AI & ML interests

Organizations

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Train AI models with Unsloth and Hugging Face Jobs for FREE

We Got Claude to Fine-Tune an Open Source LLM

Granite Docling

Welcome EmbeddingGemma, Google's new efficient embedding model

Prompt Orchestration Markup Language

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

SmolLM3: smol, multilingual, long-context reasoner

Scaling Test-time Compute for LLM Agents

Tensor Product Attention Is All You Need

The Lessons of Developing Process Reward Models in Mathematical Reasoning

MiniMax-01: Scaling Foundation Models with Lightning Attention

🌁#81: Key AI Concepts to Follow in 2025

Fine-tune ModernBERT for text classification using synthetic data

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

Accelerating Language Model Inference with Mixture of Attentions

mltrials

AI & ML interests

Organizations

mltrials's activity

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Train AI models with Unsloth and Hugging Face Jobs for FREE

We Got Claude to Fine-Tune an Open Source LLM

Welcome EmbeddingGemma, Google's new efficient embedding model

SmolLM3: smol, multilingual, long-context reasoner

🌁#81: Key AI Concepts to Follow in 2025

Fine-tune ModernBERT for text classification using synthetic data

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

Accelerating Language Model Inference with Mixture of Attentions