Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery Paper • 2411.02136 • Published Nov 4, 2024 • 1
Multi-Source Urban Traffic Flow Forecasting with Drone and Loop Detector Data Paper • 2501.03492 • Published Jan 7, 2025 • 1
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 148
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper • 2311.07069 • Published Nov 13, 2023 • 45
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation Paper • 2405.20289 • Published May 30, 2024 • 11
Presto! Distilling Steps and Layers for Accelerating Music Generation Paper • 2410.05167 • Published Oct 7, 2024 • 18
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation Paper • 2408.00205 • Published Aug 1, 2024 • 5
DITTO: Diffusion Inference-Time T-Optimization for Music Generation Paper • 2401.12179 • Published Jan 22, 2024 • 21