Building on HF

15 726 285

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper 2 days ago

mHC: Manifold-Constrained Hyper-Connections

liked a model 2 days ago

MiniMaxAI/MiniMax-M2.1

upvoted a paper 6 days ago

Qwen3-VL Technical Report

View all activity

Organizations

upvoted a paper 2 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 4 days ago • 176

liked a model 2 days ago

MiniMaxAI/MiniMax-M2.1

Text Generation • 229B • Updated 8 days ago • 188k • • 827

upvoted a paper 6 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 148

liked a model 17 days ago

google/functiongemma-270m-it

Text Generation • 0.3B • Updated 17 days ago • 49.9k • 737

upvoted an article 17 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

18 days ago

•

liked a model 18 days ago

apple/Sharp

Image-to-3D • Updated 17 days ago • 5.38k • 297

upvoted a collection 20 days ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 6 items • Updated 4 days ago • 110

upvoted an article 21 days ago

Article

New in llama.cpp: Model Management

24 days ago

•

103

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 244

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 115k • • 1.07k

upvoted 2 collections about 1 month ago

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 135

Mistral Large 3

Collection

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 81

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

265

upvoted a paper about 1 month ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 125

liked a model about 2 months ago

facebook/sam3

Mask Generation • 0.9B • Updated Nov 20, 2025 • 1.33M • 1.27k

upvoted 2 papers about 2 months ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 95

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8, 2025 • 337k • • 1.59k

upvoted 2 papers 2 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 77

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 83

Taufiq Dwi Purnomo

AI & ML interests

Recent Activity

Organizations

taufiqdp's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

New in llama.cpp: Model Management

Transformers v5: Simple model definitions powering the AI ecosystem