Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sanchit Gandhi's picture
713 3 43

Sanchit Gandhi

sanchit-gandhi
ArierMiao's profile picture Drewtu's profile picture xinei's profile picture
·
  • sanchitgandhi99
  • sanchit-gandhi

AI & ML interests

Research Scientist at Mistral. Prev Open-Source at Hugging Face, Masters at University of Cambridge.

Recent Activity

new activity 2 days ago
mistralai/Voxtral-Mini-3B-2507:Clarification regarding model architecture
new activity 2 days ago
mistralai/Voxtral-Mini-3B-2507:Request for open sourcing evaluation code --- at least for librispeech
authored a paper 16 days ago
Magistral
View all activity

Organizations

Whisper fine-tuning sprint's profile picture XTREME-S's profile picture ESPnet's profile picture Centre for Vision, Speech and Signal Processing - University of Surrey's profile picture Whisper Fine-Tuning Event's profile picture Speech Recognition Community Event Version 2's profile picture Internal Data & Models for Speech Recognition Event's profile picture Speech Seq2Seq Experiments's profile picture Speechbox's profile picture SpeechColab's profile picture Linguistic Data Consortium's profile picture Whisper Distillation's profile picture University of Edingburgh - Centre For Speech Technology Research's profile picture ESC Benchmark's profile picture End-to-End Speech Benchmark's profile picture Music Gen Sprint's profile picture Kakao Enterprise's profile picture USCD REACH's profile picture TTS Eval (OLD)'s profile picture diarizers-community's profile picture TTS AGI's profile picture Sweet Dream(Booth)s's profile picture

authored 2 papers 16 days ago

Magistral

Paper • 2506.10910 • Published Jun 12 • 63

Voxtral

Paper • 2507.13264 • Published 20 days ago • 25
authored a paper almost 2 years ago

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 58
authored 3 papers over 2 years ago

ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition

Paper • 2210.13352 • Published Oct 24, 2022 • 3

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 32

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages

Paper • 2303.12582 • Published Mar 22, 2023 • 20
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs