Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Drishti 's Collections
vlm-unlearning
vlm-unlearning-benchmarks
LLMs
Code models
music generation
OCR/VLMs
biomed ner models + spaces
biomed ner
med benchmarks
medllms
STT
Podcast
Summarizer (Mono + Multi-lingual)
Hugging Face
Meal Planner
Cool chatbots
Social Media
Translate
Personal Stylist + Ecom Assistant
Elsa
Professional Development
Doc/PDF RAG
Consilium
Travel Planner
watch AI learn
Research Co-pilot
multi-agent
Code Agent
GitHub
Search and Monitor Gradio MCP Server + REST API
Environment/Climate/Agriculture
OCR
MCP Router + Customizable MCP Agents
Imp Leaderboards
medical/clinical/health
web search + scrape
TTS
One-stop Knowledge Solution
Intellectual Property One-Stop Solution
VLMs

STT

updated Jun 25
Upvote
-

  • Running on Zero
    988

    Whisper Turbo

    🀯
    988

    Transcribe audio or YouTube videos into text


  • Build error

    Deepspeech_live_speech_to_text

    πŸƒ


  • facebook/wav2vec2-base-960h

    Automatic Speech Recognition β€’ 94.4M β€’ Updated Nov 14, 2022 β€’ 4.2M β€’ 381

  • kyutai/stt-1b-en_fr

    Automatic Speech Recognition β€’ Updated 1 day ago β€’ 96

    Note kyutai released new speech-to-text models that come in 1B & 2B

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs