Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error when calling local model,(speaker-diarization-3.0) #1508

Closed
Heavenbest opened this issue Oct 20, 2023 · 7 comments
Closed

Error when calling local model,(speaker-diarization-3.0) #1508

Heavenbest opened this issue Oct 20, 2023 · 7 comments

Comments

@Heavenbest
Copy link

When I execute the following code I get an error message。

from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("./models/config.yaml")

send pipeline to GPU (when available)

import torch
pipeline.to(torch.device("cuda"))

apply pretrained pipeline

diarization = pipeline("test.wav")

image

I have downloaded the required model and saved it locally for calling. The specific directory is as follows. When running, an error is reported. The config.yaml is as follows.

image

@github-actions
Copy link

Thank you for your issue.
We found the following entry in the FAQ which you may find helpful:

Feel free to close this issue if you found an answer in the FAQ.

If your issue is a feature request, please read this first and update your request accordingly, if needed.

If your issue is a bug report, please provide a minimum reproducible example as a link to a self-contained Google Colab notebook containing everthing needed to reproduce the bug:

  • installation
  • data preparation
  • model download
  • etc.

Providing an MRE will increase your chance of getting an answer from the community (either maintainers or other power users).

Companies relying on pyannote.audio in production may contact me via email regarding:

  • paid scientific consulting around speaker diarization and speech processing in general;
  • custom models and tailored features (via the local tech transfer office).

This is an automated reply, generated by FAQtory

@ColtonBehannon
Copy link

Same issue being faced here. There exists another thread (#1410) where someone seems to have found a way around it but it's not straightforward. The method you followed here seems like it should work as it follows the directions in the tutorial for VAD (https://github.com/pyannote/pyannote-audio/blob/develop/tutorials/applying_a_pipeline.ipynb).

@hbredin
Copy link
Member

hbredin commented Oct 24, 2023

Renaming speaker-embedding.onnx into something that contains the wespeaker string should do the trick (e.g. wespeaker-embedding.onnx) .

@Heavenbest
Copy link
Author

thanks for your reply. @ColtonBehannon ..
When I verify based on the example of (https://github.com/pyannote/pyannote-audio/blob/develop/tutorials/applying_a_pipeline.ipynb), the error message is as follows:
image
I am running it on Google colab, can you help me find out the reason? Thanks

@Heavenbest
Copy link
Author

@hbredin thanks ,after changing the model name according to the method you mentioned, the program can run normally. Thank you very much.

@hbredin
Copy link
Member

hbredin commented Nov 9, 2023

FYI: #1537

@hbredin
Copy link
Member

hbredin commented Nov 16, 2023

Latest version no longer relies on ONNX runtime.
Please update to pyannote.audio 3.1 and pyannote/speaker-diarization-3.1 (and open new issues if needed).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants