Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Why do you think that https://github.com/dsalnikov/wav2vec is a good alternative to whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Why do you think that https://github.com/dsalnikov/wav2vec is a good alternative to whisper-timestamped