Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2410.24164

Physical AI Papers

List of papers we review/read for building physical AI models

about 5 hours ago

ViViT: A Video Vision Transformer

Paper • 2103.15691 • Published Mar 29, 2021 • 4
DINO-Foresight: Looking into the Future with DINO

Paper • 2412.11673 • Published Dec 16, 2024 • 1
Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing

Paper • 2601.04575 • Published Jan 8 • 9
Learning Long-Context Diffusion Policies via Past-Token Prediction

Paper • 2505.09561 • Published May 14, 2025

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 124
π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

Robotics Collection

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87
Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 83
FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16, 2025 • 29

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30
Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18, 2025 • 58
Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Paper • 2310.08864 • Published Oct 13, 2023 • 2
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Paper • 2502.13143 • Published Feb 18, 2025 • 31

Physical AI Papers

List of papers we review/read for building physical AI models

about 5 hours ago

ViViT: A Video Vision Transformer

Paper • 2103.15691 • Published Mar 29, 2021 • 4
DINO-Foresight: Looking into the Future with DINO

Paper • 2412.11673 • Published Dec 16, 2024 • 1
Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing

Paper • 2601.04575 • Published Jan 8 • 9
Learning Long-Context Diffusion Policies via Past-Token Prediction

Paper • 2505.09561 • Published May 14, 2025

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 124
π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

Robotics Collection

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87
Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 83
FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16, 2025 • 29

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30
Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18, 2025 • 58
Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Paper • 2310.08864 • Published Oct 13, 2023 • 2
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Paper • 2502.13143 • Published Feb 18, 2025 • 31

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 30

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs