MeiGen-MultiTalk Demo
This is a demo of MeiGen-MultiTalk, an audio-driven multi-person conversational video generation model.
Features
- π¬ Generate videos of people talking from still images and audio
- π₯ Support for both single-person and multi-person conversations
- π― High-quality lip synchronization
- πΊ Support for 480p and 720p resolution
- β±οΈ Generate videos up to 15 seconds long
How to Use
- Upload a reference image (photo of person(s) who will be speaking)
- Upload an audio file
- Enter a prompt describing the desired video
- Click "Generate Video" to process
Tips
- Use clear, front-facing photos for best results
- Ensure good audio quality without background noise
- Keep prompts clear and specific
- Supported formats: PNG, JPG, JPEG for images; MP3, WAV, OGG for audio
Limitations
- Generation can take several minutes
- Maximum video duration is 15 seconds
- Best results with clear, well-lit reference images
- Audio should be clear and without background noise
Credits
This demo uses the MeiGen-MultiTalk model created by MeiGen-AI.
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support