Phú Võ

phuvo

phuvo

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

upvoted a paper 28 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 6 months ago

Fast and Simplex: 2-Simplicial Attention in Triton

View all activity

Organizations

None yet

upvoted 2 papers 28 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 223

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 244

upvoted a paper 6 months ago

Fast and Simplex: 2-Simplicial Attention in Triton

Paper • 2507.02754 • Published Jul 3, 2025 • 25

upvoted 2 papers 7 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published May 30, 2025 • 80

upvoted a paper 8 months ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published Apr 25, 2025 • 47

upvoted a paper 9 months ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 75

liked a model 9 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated 19 days ago • 5.68k • 1.23k

upvoted 2 papers 10 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 153

LongRoPE2: Near-Lossless LLM Context Window Scaling

Paper • 2502.20082 • Published Feb 27, 2025 • 36

upvoted 2 papers 11 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 166

NoLiMa: Long-Context Evaluation Beyond Literal Matching

Paper • 2502.05167 • Published Feb 7, 2025 • 15

upvoted a paper about 1 year ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 151

upvoted 5 papers over 1 year ago

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

Paper • 2409.18943 • Published Sep 27, 2024 • 28

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25, 2024 • 28

liked a model over 1 year ago

Sao10K/L3.1-70B-Euryale-v2.2

Updated Aug 25, 2024 • 212 • 63

upvoted a paper over 1 year ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 67

Phú Võ

AI & ML interests

Recent Activity

Organizations

phuvo's activity