ProsodyLM

This repository contains the model checkpoints and sample training data for
the paper ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models.

πŸ“ Repository structure

  • llm/: ProsodyLM checkpoint and tokenizer
  • tts/: TTS checkpoint and speaker embeddings
  • data/: A small-scale sample dataset (same format as the real training data)

πŸ”— Citation

If you use this resource, please cite the paper above.


License: CC BY-NC 4.0

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support