Real-time video captioning powered by FastVLM
Generate music from text descriptions and optional melodies