-
-
Notifications
You must be signed in to change notification settings - Fork 758
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug - Offline use of "speechbrain/spkrec-ecapa-voxceleb " does not work #1427
Comments
Thank you for your issue.You might want to check the FAQ if you haven't done so already. Feel free to close this issue if you found an answer in the FAQ. If your issue is a feature request, please read this first and update your request accordingly, if needed. If your issue is a bug report, please provide a minimum reproducible example as a link to a self-contained Google Colab notebook containing everthing needed to reproduce the bug:
Providing an MRE will increase your chance of getting an answer from the community (either maintainers or other power users). We also offer paid scientific consulting services around speaker diarization (and speech processing in general).
|
@asusdisciple any luck ?having the same error |
found the solution at #1294, basically for embedding path there has to be "speechbrain" in it like |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
I am trying to use the speaker-diarzation pipeline offline. The problem occurs when I try to load the model for speaker
embeddings. I found out that the problem is, that the model is from Speechbrain (https://huggingface.co/speechbrain/spkrec-ecapa-voxceleb/tree/main) which is also used in the pyannote speaker-diarization config.yaml on huggingface:
I looked into the. pkl file and it seems like pyannote tags the models with a string which is not found inside the speechbrain models when pyannote tries to extract the module name with
module_name: str = loaded_checkpoint["pyannote.audio"]["architecture"]["module"]
.So if I use a pyannote speaker embedding model everything works fine (tested it) but if I try to run the speaker-diarization pipeline offline with the aforementioned speechbrain model it will not work.
Maybe you have an idea for a workaround? The error related to this issue is:
My local .yaml file looks like this:
I call the pipeline with
pipeline = Pipeline.from_pretrained("config/config_pyannote.yaml")
All the names are correct and all models were downloaded from huggingface. Do you have any ideas why this could happen?
The text was updated successfully, but these errors were encountered: