MMDetection: Open MMLab Detection Toolbox and Benchmark Paper • 1906.07155 • Published Jun 17, 2019
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction Paper • 2412.13187 • Published Dec 17, 2024
One-Minute Video Generation with Test-Time Training Paper • 2504.05298 • Published Apr 7, 2025 • 110
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond Paper • 1904.11492 • Published Apr 25, 2019
GroupViT: Semantic Segmentation Emerges from Text Supervision Paper • 2202.11094 • Published Feb 22, 2022
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Paper • 2303.04803 • Published Mar 8, 2023 • 1