-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to just evaluate a pre-trained network on an audio file? #87
Comments
Hey my bad! Should update the docs sometime :) To do this use the predict script like below:
There are further parameters if you need them, use the |
Thanks for the information! I attempted to run the model, and it blew up with this message:
Any ideas? |
Is it expecting me to pass it a table or a directory with a collection of audio files? |
I tried changing the file and then also the sampling rate, and these were the error messages that I got:
|
Make sure the file is a 16khz wav file, is this the case? I've also added documentation here. |
I'm having the same problem. I downloaded the LibriSpeech pre trained model, am launching with I'm trying to run this against a WAV file I downsampled to 16k mono with I'm getting a very similar error when i try to run predict, If I figure out what I'm doing wrong, I'd be happy to contribute some better documentation or strengthen the input file checking in Predict.lua so it throws actionable errors. |
Hi,
I was able to get my environment setup, and I am wanting to just try evaluating an existing model (such as the LibriSpeech network) to attempt speech-to-text on an audio file. I just want to perform the transcription.
How do I go about this with your library? I am not sure from the documentation what steps are necessary and how much extra development work I will need to do (if any) to perform the transcription task from your library.
The text was updated successfully, but these errors were encountered: