docling-project/SmolDocling-256M-preview
Image-Text-to-Text • 0.3B • Updated • 25.8k • 1.61k
Generate a talking-head video from a photo and audio
Generate any application by Vibe Coding it
Generate virtual try‑on images of a person wearing a chosen garment
Blazingly Fast and Embarrassingly Simple Song Generation