Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Why do you think that https://github.com/FFmpeg/FFmpeg is a good alternative to whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Why do you think that https://github.com/FFmpeg/FFmpeg is a good alternative to whisper-timestamped