MeiGen-MultiTalk Demo

This is a demo of MeiGen-MultiTalk, an audio-driven multi-person conversational video generation model.

Features

  • πŸ’¬ Generate videos of people talking from still images and audio
  • πŸ‘₯ Support for both single-person and multi-person conversations
  • 🎯 High-quality lip synchronization
  • πŸ“Ί Support for 480p and 720p resolution
  • ⏱️ Generate videos up to 15 seconds long

How to Use

  1. Upload a reference image (photo of person(s) who will be speaking)
  2. Upload an audio file
  3. Enter a prompt describing the desired video
  4. Click "Generate Video" to process

Tips

  • Use clear, front-facing photos for best results
  • Ensure good audio quality without background noise
  • Keep prompts clear and specific
  • Supported formats: PNG, JPG, JPEG for images; MP3, WAV, OGG for audio

Limitations

  • Generation can take several minutes
  • Maximum video duration is 15 seconds
  • Best results with clear, well-lit reference images
  • Audio should be clear and without background noise

Credits

This demo uses the MeiGen-MultiTalk model created by MeiGen-AI.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support