Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
677
2
41
Sanchit Gandhi
sanchit-gandhi
Follow
High-Tower80's profile picture
Taino's profile picture
Vompie's profile picture
367 followers
·
13 following
sanchitgandhi99
sanchit-gandhi
AI & ML interests
Open-Source Speech
Articles
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
Feb 27
•
16
Speculative Decoding for 2x Faster Whisper Inference
Dec 20, 2023
•
10
AudioLDM 2, but faster ⚡️
Aug 30, 2023
•
1
A Complete Guide to Audio Datasets
Dec 15, 2022
•
1
Fine-Tune Whisper with 🤗 Transformers
Nov 3, 2022
•
24
Organizations
sanchit-gandhi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
sweet-dreambooths/musicgen-songstarter-v0.2-hf
10 days ago
Upload processor
#2 opened 10 days ago by
sanchit-gandhi
Upload MusicgenMelodyForConditionalGeneration
#1 opened 10 days ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
21 days ago
Error loading dataset
2
#9 opened 27 days ago by
jorgetebl
New activity in
LIUM/tedlium
21 days ago
FileNotFoundError when loading the LIUM/tedlium data on Windows
4
#4 opened about 2 months ago by
wondav
New activity in
sanchit-gandhi/musicgen-streaming
28 days ago
Song doesn't appear to play (regardless of any browser)
3
#5 opened about 1 month ago by
Nothsa
New activity in
openai/whisper-large-v3
28 days ago
How to get accuracy of transcription from the model?
5
#98 opened about 2 months ago by
Atulad
How we can use this model to achieve a real-time trans?
4
#99 opened about 1 month ago by
Von-violet
New activity in
parler-tts/parler_tts_mini
about 1 month ago
Fixed . on a different line.
1
#2 opened about 1 month ago by
blaise-tk
minor ui fix
1
#4 opened about 1 month ago by
mrfakename
New activity in
parler-tts/parler_tts_mini_v0.1
about 1 month ago
Inference speed
6
#2 opened about 1 month ago by
andreasrath
Link model to the training datasets in metadata
1
#3 opened about 1 month ago by
julien-c
Add training datasets to metadata
1
#5 opened about 1 month ago by
sanchit-gandhi
Update README.md
#4 opened about 1 month ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
about 1 month ago
Update alignment heads in gen config
#3 opened about 1 month ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
about 1 month ago
LICENSE question
2
#8 opened about 2 months ago by
phoneme
New activity in
sanchit-gandhi/musicgen-streaming
about 1 month ago
Streaming doesn't work yet with gradio 4.0
#4 opened about 1 month ago by
ylacombe
New activity in
distil-whisper/distil-large-v3
about 2 months ago
about multiple languages?
2
#2 opened about 2 months ago by
obtion
New activity in
sanchit-gandhi/whisper-small-hi
about 2 months ago
Adding `safetensors` variant of this model
#17 opened 6 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-lv-60-espeak-cv-ft
about 2 months ago
Adding `safetensors` variant of this model
1
#4 opened 6 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-large-xlsr-53
about 2 months ago
Adding `safetensors` variant of this model
1
#3 opened 3 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-base
about 2 months ago
Adding `safetensors` variant of this model
1
#2 opened 5 months ago by
SFconvertbot
New activity in
distil-whisper/distil-large-v3-ct2
about 2 months ago
Update README.md
3
#2 opened about 2 months ago by
muhtasham
New activity in
distil-whisper/distil-large-v3-ggml
about 2 months ago
is it fp16?
3
#1 opened about 2 months ago by
supercharge19
New activity in
distil-whisper/distil-medium.en
about 2 months ago
Just can't run!
3
#14 opened about 2 months ago by
awesomeandy
New activity in
distil-whisper/distil-large-v3-ct2
about 2 months ago
Update alignment heads
#1 opened about 2 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
about 2 months ago
How to do multilingual transcription?
3
#1 opened about 2 months ago by
emraza110
New activity in
facebook/mms-tts-tao
2 months ago
Reference of the Dataset
1
#1 opened 2 months ago by
ChiaLingWeng
New activity in
openai/whisper-large-v3
2 months ago
How to save the loss value for each step during the training process?
2
#91 opened 2 months ago by
zhouwen999
New activity in
hf-audio/open_asr_leaderboard
2 months ago
[Average WER Calculation] Drop Common Voice WER.
4
#14 opened 2 months ago by
reach-vb
New activity in
openai/whisper-large-v3
2 months ago
Transcript an Spanish audio
3
#86 opened 2 months ago by
Andrews99
New activity in
sanchit-gandhi/whisper-medium-fleurs-lang-id
2 months ago
How do you fine tune Whisper for classification task rather than transcription?
6
#1 opened about 1 year ago by
nkburns
New activity in
openai/whisper-large-v2
3 months ago
Add missing merge to tokenizer
#100 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-large
3 months ago
Add missing merge to tokenizer
#50 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-medium
3 months ago
Add missing merge to tokenizer
#36 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-small
3 months ago
Add missing merge to tokenizer
#38 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-tiny
3 months ago
Add missing merge to tokenizer
#40 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-base
3 months ago
Upload tokenizer
2
#28 opened 5 months ago by
ArthurZ
New activity in
sanchit-gandhi/large-v3-32-2-conditioned-prompt-logic-timestamped-resumed-pt
3 months ago
Update generation_config.json
#2 opened 3 months ago by
sanchit-gandhi
Update generation_config.json
#1 opened 3 months ago by
sanchit-gandhi
New activity in
facebook/s2t-wav2vec2-large-en-de
3 months ago
Updates incorrect tokenizer configuration file
1
#3 opened 3 months ago by
lysandre
New activity in
kakao-enterprise/vits-vctk
3 months ago
List of all available speakers?
2
#2 opened 3 months ago by
Nikerino
New activity in
facebook/mms-tts-eng
3 months ago
What kind of dataset was used?
1
#8 opened 3 months ago by
f0rGoTTen000
New activity in
distil-whisper/whisper-vs-distil-whisper
3 months ago
Distil version does a bad job at Transcribing
3
#2 opened 3 months ago by
arslankas
New activity in
google/gemma-7b-it
3 months ago
error model.generate()
14
#13 opened 3 months ago by
NickyNicky
New activity in
facebook/musicgen-melody
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#8 opened 3 months ago by
ylacombe
Upload processor
#9 opened 3 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 3 months ago by
ylacombe
Upload processor
#3 opened 3 months ago by
ylacombe
New activity in
facebook/musicgen-stereo-melody-large
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#2 opened 3 months ago by
ylacombe
Upload processor
#3 opened 3 months ago by
ylacombe
New activity in
facebook/musicgen-melody-large
3 months ago
Upload MusicgenMelodyForConditionalGeneration
#3 opened 3 months ago by
ylacombe
Upload processor
1
#4 opened 3 months ago by
ylacombe
New activity in
google/gemma-7b
3 months ago
Upload FlaxGemmaForCausalLM
1
#3 opened 3 months ago by
pcuenq
New activity in
facebook/mms-tts-tam
3 months ago
AttributeError
1
#1 opened 3 months ago by
murthy1998
Fix code examples for transformers
#2 opened 3 months ago by
sanchit-gandhi
New activity in
hf-audio/open_asr_leaderboard
3 months ago
Smaller model sizes lead to worse RTF on Whisper
2
#8 opened 4 months ago by
lorenzopark
Define RTF
#12 opened 3 months ago by
sanchit-gandhi
New activity in
openai/whisper-large-v3
3 months ago
Update forced decoder ids
#79 opened 3 months ago by
sanchit-gandhi
model in closed network
3
#78 opened 3 months ago by
iamwhoiamm
New activity in
sanchit-gandhi/large-v3-32-2-token-ids-freeze-embeds-label-length-448-unshuffled-filtered-conditioned-pt
3 months ago
Update generation_config.json
#1 opened 3 months ago by
sanchit-gandhi
Load more