Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
vbdai 's Collections
CPPO
Multi Agent System Optimization
Vision-Language Reasoning
AI4Math
Object Detection
Federated Learning
Graph Neural Networks
Data / Model Search
Trustworthy AI
Inference Optimization

Vision-Language Reasoning

updated 10 days ago
Upvote
3

  • Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes

    Paper • 2509.06266 • Published Sep 8, 2025 • 12

  • From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model

    Paper • 2512.05277 • Published Dec 4, 2025 • 6

  • CPPO: Contrastive Perception for Vision Language Policy Optimization

    Paper • 2601.00501 • Published Jan 1 • 7

  • vbdai/CPPO-3B

    4B • Updated 10 days ago • 36 • 2

  • vbdai/CPPO-7B

    8B • Updated 10 days ago • 19 • 1
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs