Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published Dec 29, 2025 • 6
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets Paper • 2512.15110 • Published Dec 17, 2025 • 10
REASONEDIT: Towards Reasoning-Enhanced Image Editing Models Paper • 2511.22625 • Published Nov 27, 2025 • 47
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation Paper • 2511.20635 • Published Nov 25, 2025 • 32
MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent Paper • 2502.03207 • Published Feb 5, 2025 • 1
In-Context Learning with Unpaired Clips for Instruction-based Video Editing Paper • 2510.14648 • Published Oct 16, 2025
RegionE: Adaptive Region-Aware Generation for Efficient Image Editing Paper • 2510.25590 • Published Oct 29, 2025 • 28
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper • 2510.14975 • Published Oct 16, 2025 • 85
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation Paper • 2506.07977 • Published Jun 9, 2025 • 41
Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers Paper • 2506.03065 • Published Jun 3, 2025 • 27