MeiGen-MultiTalk Demo

This is a demo of MeiGen-MultiTalk, an audio-driven multi-person conversational video generation model.

Features

  • ๐Ÿ’ฌ Generate videos of people talking from still images and audio
  • ๐Ÿ‘ฅ Support for both single-person and multi-person conversations
  • ๐ŸŽฏ High-quality lip synchronization
  • ๐Ÿ“บ Support for 480p and 720p resolution
  • โฑ๏ธ Generate videos up to 15 seconds long

How to Use

  1. Upload a reference image (photo of person(s) who will be speaking)
  2. Upload an audio file
  3. Enter a prompt describing the desired video
  4. Click "Generate Video" to process

Tips

  • Use clear, front-facing photos for best results
  • Ensure good audio quality without background noise
  • Keep prompts clear and specific
  • Supported formats: PNG, JPG, JPEG for images; MP3, WAV, OGG for audio

Limitations

  • Generation can take several minutes
  • Maximum video duration is 15 seconds
  • Best results with clear, well-lit reference images
  • Audio should be clear and without background noise

Credits

This demo uses the MeiGen-MultiTalk model created by MeiGen-AI.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support