File size: 1,364 Bytes
f0b6296
912dd8d
132fb5e
912dd8d
 
fd969a5
 
f0b6296
 
132fb5e
f0b6296
 
912dd8d
132fb5e
912dd8d
132fb5e
912dd8d
132fb5e
912dd8d
 
f846252
912dd8d
 
132fb5e
912dd8d
132fb5e
912dd8d
 
 
fd969a5
132fb5e
912dd8d
132fb5e
912dd8d
 
 
fd969a5
132fb5e
912dd8d
132fb5e
912dd8d
 
 
 
132fb5e
912dd8d
132fb5e
912dd8d
132fb5e
912dd8d
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
---
title: MeiGen MultiTalk Demo
emoji: 🎬
colorFrom: red
colorTo: blue
sdk: streamlit
sdk_version: 1.28.1
app_file: app.py
pinned: false
license: apache-2.0
---

# MeiGen-MultiTalk Demo

This is a demo of MeiGen-MultiTalk, an audio-driven multi-person conversational video generation model.

## Features

- 💬 Generate videos of people talking from still images and audio
- 👥 Support for both single-person and multi-person conversations
- 🎯 High-quality lip synchronization
- 📺 Support for 480p and 720p resolution
- ⏱️ Generate videos up to 15 seconds long

## How to Use

1. Upload a reference image (photo of person(s) who will be speaking)
2. Upload an audio file
3. Enter a prompt describing the desired video
4. Click "Generate Video" to process

## Tips

- Use clear, front-facing photos for best results
- Ensure good audio quality without background noise
- Keep prompts clear and specific
- Supported formats: PNG, JPG, JPEG for images; MP3, WAV, OGG for audio

## Limitations

- Generation can take several minutes
- Maximum video duration is 15 seconds
- Best results with clear, well-lit reference images
- Audio should be clear and without background noise

## Credits

This demo uses the MeiGen-MultiTalk model created by MeiGen-AI.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference