58 775 827

Sugato Ray PRO

sugatoray

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

upvoted an article 2 days ago

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

upvoted a collection 2 days ago

Nemotron-Cascade

updated a collection 2 days ago

Papers

View all activity

Organizations

upvoted an article 2 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

14 days ago

•

upvoted a collection 2 days ago

Nemotron-Cascade

Collection

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 17 items • Updated 8 days ago • 40

upvoted a paper 2 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published 5 days ago • 17

upvoted a paper 4 days ago

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 12 days ago • 24

upvoted a paper 6 days ago

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

Paper • 2512.16913 • Published 13 days ago • 33

upvoted a paper 9 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 14 days ago • 56

upvoted an article 13 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

about 1 month ago

•

260

upvoted an article 15 days ago

Article

CUGA on Hugging Face: Democratizing Configurable AI Agents

16 days ago

•

upvoted an article 16 days ago

Article

Codex is Open Sourcing AI models

21 days ago

•

upvoted a collection 21 days ago

GLM-4.6V

Collection

3 items • Updated 23 days ago • 47

upvoted 2 articles 21 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

22 days ago

•

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

23 days ago

•

upvoted a paper 22 days ago

Mathematical Framing for Different Agent Strategies

Paper • 2512.04469 • Published 27 days ago • 1

upvoted 2 articles 24 days ago

Article

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

27 days ago

•

Article

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

Nov 20

•

upvoted a paper 24 days ago

ReviewerToo: Should AI Join The Program Committee? A Look At The Future of Peer Review

Paper • 2510.08867 • Published Oct 9 • 5

upvoted an article 27 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

28 days ago

•

550

upvoted a paper 28 days ago

Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory

Paper • 2511.20857 • Published Nov 25 • 2

upvoted a paper 29 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published about 1 month ago • 93

upvoted an article about 1 month ago

Article

Continuous batching from first principles

Nov 25

•

288

Sugato Ray PRO

AI & ML interests

Recent Activity

Organizations

sugatoray's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Transformers v5: Simple model definitions powering the AI ecosystem

CUGA on Hugging Face: Democratizing Configurable AI Agents

Codex is Open Sourcing AI models

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles