PedSense-AI: YOLO26 Pedestrian Intent Predictor

PedSense-AI is a computer vision framework for predicting pedestrian crossing intent. This repository hosts the YOLO26 End-to-End Detector, optimized for real-time inference and high-FPS applications.

Architectural Approach: YOLO26

Unlike temporal models, this version of YOLO26 is fine-tuned to classify crossing vs. not-crossing intent directly within the detection head.

  • Type: End-to-end single-stage detector.
  • Input: Single RGB frames.
  • Best For: Real-time edge deployment where low latency is critical.

Performance (10-Epoch Benchmark)

Trained on the JAAD (Joint Attention in Autonomous Driving) dataset, the model achieves the following validation metrics:

Metric Score Definition
mAP@50 0.631 Mean Average Precision at 0.5 Intersection over Union (IoU).
mAP@50-95 0.453 Average mAP across IoU thresholds from 0.5 to 0.95.
Precision 0.684 Probability that a detection is a true pedestrian.
Recall 0.600 Probability that a true pedestrian is detected.

Training Dynamics

The model showed rapid convergence, with Classification Loss dropping from 2.25 to 0.74 over 10 epochs. This indicates a strong ability to distinguish intent categories even without temporal data.

Usage

from ultralytics import YOLO

# Load from Hugging Face
model = YOLO("hf://JcProg/pedsense-yolo26-jaad-10e.pt")

# Predict intent (Class 0: Not-Crossing, Class 1: Crossing)
results = model.predict("street_scene.jpg")
results[0].show()
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support