-
Notifications
You must be signed in to change notification settings - Fork 975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
faster-whisper vs whisper: PyAV stops during decode, ffmpeg continues #988
Comments
can you upload the file to test? |
This is the python source code `import time logging.basicConfig() model_size = "large-v3" def speech2text(fileName): if name =="main": This is the python3 output
This is the output of ffmpeg (it shows the Input/output error but export the file to /tmp/1.mp3
This is the output for whisper |
The audio file is corrupted at the end, so an error is expected during decode process. However, PyAV stop processing while whisper using ffmpeg process the file until the corrupted are is detected.
Expected behavior: PyAV to process valid part of the file and throw an waning message.
Can we add a parameter to faster-whisper to behavior like whisper? i.e. process the file until the corrupt part.
Workaround: use ffmpeg command line to export corrupted file - discarding invalid data - and run faster-whisper on top of new file processed by ffmpeg. This is a waste of processing if we can have a parameter to ignore corrupted content.
The text was updated successfully, but these errors were encountered: