rocketmandrey
/

phunter_space

Model card Files Files and versions

MeiGen-MultiTalk Demo

This is a demo of MeiGen-MultiTalk, an audio-driven multi-person conversational video generation model.

Features

💬 Generate videos of people talking from still images and audio
👥 Support for both single-person and multi-person conversations
🎯 High-quality lip synchronization
📺 Support for 480p and 720p resolution
⏱️ Generate videos up to 15 seconds long

How to Use

Upload a reference image (photo of person(s) who will be speaking)
Upload an audio file
Enter a prompt describing the desired video
Click "Generate Video" to process

Tips

Use clear, front-facing photos for best results
Ensure good audio quality without background noise
Keep prompts clear and specific
Supported formats: PNG, JPG, JPEG for images; MP3, WAV, OGG for audio

Limitations

Generation can take several minutes
Maximum video duration is 15 seconds
Best results with clear, well-lit reference images
Audio should be clear and without background noise

Credits

This demo uses the MeiGen-MultiTalk model created by MeiGen-AI.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support