nvidia
/

parakeet-tdt-0.6b-v2

@@ -153,6 +153,8 @@ img {
 | [![Model size](https://img.shields.io/badge/Params-0.6B-green#model-badge)](#model-architecture)
 | [![Language](https://img.shields.io/badge/Language-en-orange#model-badge)](#datasets)
 ## <span style="color:#466f00;">Description:</span>
@@ -160,6 +162,8 @@ img {
 This XL variant of the FastConformer [1] architecture integrates the TDT [2] decoder and is trained with full attention, enabling efficient transcription of audio segments up to 24 minutes in a single pass. The model achieves an RTFx of 3380 on the HF-Open-ASR leaderboard with a batch size of 128. Note: *RTFx Performance may vary depending on dataset audio duration and batch size.*
 **Key Features**
 - Accurate word-level timestamp predictions
 - Automatic punctuation and capitalization

 | [![Model size](https://img.shields.io/badge/Params-0.6B-green#model-badge)](#model-architecture)
 | [![Language](https://img.shields.io/badge/Language-en-orange#model-badge)](#datasets)
+> **🎉 NEW: Multilingual Parakeet TDT 0.6B V3 is now available!**
+> 🌍 **25 European Languages** | 🚀 **Enhanced Performance** | 🔗 **[Try it here: nvidia/parakeet-tdt-0.6b-v3](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3)**
 ## <span style="color:#466f00;">Description:</span>
 This XL variant of the FastConformer [1] architecture integrates the TDT [2] decoder and is trained with full attention, enabling efficient transcription of audio segments up to 24 minutes in a single pass. The model achieves an RTFx of 3380 on the HF-Open-ASR leaderboard with a batch size of 128. Note: *RTFx Performance may vary depending on dataset audio duration and batch size.*
 **Key Features**
 - Accurate word-level timestamp predictions
 - Automatic punctuation and capitalization