nithinraok
commited on
Commit
Β·
4f7f008
1
Parent(s):
f1cd669
add v3
Browse filesSigned-off-by: nithinraok <nithinrao.koluguri@gmail.com>
README.md
CHANGED
|
@@ -153,6 +153,8 @@ img {
|
|
| 153 |
| [](#model-architecture)
|
| 154 |
| [](#datasets)
|
| 155 |
|
|
|
|
|
|
|
| 156 |
|
| 157 |
## <span style="color:#466f00;">Description:</span>
|
| 158 |
|
|
@@ -160,6 +162,8 @@ img {
|
|
| 160 |
|
| 161 |
This XL variant of the FastConformer [1] architecture integrates the TDT [2] decoder and is trained with full attention, enabling efficient transcription of audio segments up to 24 minutes in a single pass. The model achieves an RTFx of 3380 on the HF-Open-ASR leaderboard with a batch size of 128. Note: *RTFx Performance may vary depending on dataset audio duration and batch size.*
|
| 162 |
|
|
|
|
|
|
|
| 163 |
**Key Features**
|
| 164 |
- Accurate word-level timestamp predictions
|
| 165 |
- Automatic punctuation and capitalization
|
|
|
|
| 153 |
| [](#model-architecture)
|
| 154 |
| [](#datasets)
|
| 155 |
|
| 156 |
+
> **π NEW: Multilingual Parakeet TDT 0.6B V3 is now available!**
|
| 157 |
+
> π **25 European Languages** | π **Enhanced Performance** | π **[Try it here: nvidia/parakeet-tdt-0.6b-v3](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3)**
|
| 158 |
|
| 159 |
## <span style="color:#466f00;">Description:</span>
|
| 160 |
|
|
|
|
| 162 |
|
| 163 |
This XL variant of the FastConformer [1] architecture integrates the TDT [2] decoder and is trained with full attention, enabling efficient transcription of audio segments up to 24 minutes in a single pass. The model achieves an RTFx of 3380 on the HF-Open-ASR leaderboard with a batch size of 128. Note: *RTFx Performance may vary depending on dataset audio duration and batch size.*
|
| 164 |
|
| 165 |
+
|
| 166 |
+
|
| 167 |
**Key Features**
|
| 168 |
- Accurate word-level timestamp predictions
|
| 169 |
- Automatic punctuation and capitalization
|