Spaces:
Running
Running
File size: 1,700 Bytes
5556f2b 5513800 5556f2b 00fa6cd 5556f2b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 |
---
title: Audio Trabscriber
sdk: docker
emoji: 💻
colorFrom: green
colorTo: blue
short_description: Monophonic audio transcription
---
# Audio to Sheet Music Transcriber
A web application that converts monophonic audio recordings into sheet music using machine learning. This app can transcribe audio files (WAV, MP3) or record live audio and convert it to MIDI and MusicXML formats.
## Features
- Upload audio files (WAV, MP3) for transcription
- Record audio directly in the browser
- Choose between different transcription models
- Download MIDI and MusicXML files
- View basic audio visualizations
## How to Use
1. **Input Audio**:
- Upload an audio file using the file uploader
- OR record audio directly in the browser
2. **Transcription Settings**:
- Select your preferred transcription model
- Adjust audio parameters if needed
3. **Process**:
- Click "Transcribe" to start the transcription
- Wait for the processing to complete
4. **Download**:
- Download the generated MIDI file
- Download the MusicXML file for sheet music
## Models
- **Facebook wav2vec2**: Fast and accurate speech recognition
- **Microsoft SpeechT5**: High-quality speech recognition with better intonation
## Technical Details
This app uses:
- PyTorch and Transformers for audio processing
- Librosa for audio feature extraction
- PrettyMIDI and Music21 for MIDI and MusicXML generation
- Streamlit for the web interface
## Limitations
- Works best with clean, monophonic recordings
- May have difficulty with fast passages or complex articulations
- Performance depends on the quality of the input audio
## License
MIT License - See the [LICENSE](LICENSE) file for details. |