Error when calling local model，（speaker-diarization-3.0） #1508

Heavenbest · 2023-10-20T08:31:58Z

When I execute the following code I get an error message。

from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("./models/config.yaml")

send pipeline to GPU (when available)

import torch
pipeline.to(torch.device("cuda"))

apply pretrained pipeline

diarization = pipeline("test.wav")

I have downloaded the required model and saved it locally for calling. The specific directory is as follows. When running, an error is reported. The config.yaml is as follows.

github-actions · 2023-10-20T08:32:23Z

Thank you for your issue.
We found the following entry in the FAQ which you may find helpful:

Does pyannote support streaming speaker diarization?

Feel free to close this issue if you found an answer in the FAQ.

If your issue is a feature request, please read this first and update your request accordingly, if needed.

If your issue is a bug report, please provide a minimum reproducible example as a link to a self-contained Google Colab notebook containing everthing needed to reproduce the bug:

installation
data preparation
model download
etc.

Providing an MRE will increase your chance of getting an answer from the community (either maintainers or other power users).

Companies relying on pyannote.audio in production may contact me via email regarding:

paid scientific consulting around speaker diarization and speech processing in general;
custom models and tailored features (via the local tech transfer office).

This is an automated reply, generated by FAQtory

ColtonBehannon · 2023-10-23T14:11:22Z

Same issue being faced here. There exists another thread (#1410) where someone seems to have found a way around it but it's not straightforward. The method you followed here seems like it should work as it follows the directions in the tutorial for VAD (https://github.com/pyannote/pyannote-audio/blob/develop/tutorials/applying_a_pipeline.ipynb).

hbredin · 2023-10-24T06:04:47Z

Renaming speaker-embedding.onnx into something that contains the wespeaker string should do the trick (e.g. wespeaker-embedding.onnx) .

Heavenbest · 2023-10-24T08:59:46Z

thanks for your reply. @ColtonBehannon ..
When I verify based on the example of (https://github.com/pyannote/pyannote-audio/blob/develop/tutorials/applying_a_pipeline.ipynb), the error message is as follows:

I am running it on Google colab, can you help me find out the reason? Thanks

Heavenbest · 2023-10-25T01:33:15Z

@hbredin thanks ,after changing the model name according to the method you mentioned, the program can run normally. Thank you very much.

hbredin · 2023-11-09T12:04:34Z

FYI: #1537

hbredin · 2023-11-16T13:03:19Z

Latest version no longer relies on ONNX runtime.
Please update to pyannote.audio 3.1 and pyannote/speaker-diarization-3.1 (and open new issues if needed).

Heavenbest closed this as completed Oct 25, 2023

hbredin mentioned this issue Nov 9, 2023

Get rid of ONNX WeSpeaker in favor of its pytorch implementation #1537

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when calling local model，（speaker-diarization-3.0） #1508

Error when calling local model，（speaker-diarization-3.0） #1508

Heavenbest commented Oct 20, 2023

github-actions bot commented Oct 20, 2023

ColtonBehannon commented Oct 23, 2023

hbredin commented Oct 24, 2023

Heavenbest commented Oct 24, 2023

Heavenbest commented Oct 25, 2023

hbredin commented Nov 9, 2023

hbredin commented Nov 16, 2023

Error when calling local model，（speaker-diarization-3.0） #1508

Error when calling local model，（speaker-diarization-3.0） #1508

Comments

Heavenbest commented Oct 20, 2023

send pipeline to GPU (when available)

apply pretrained pipeline

github-actions bot commented Oct 20, 2023

ColtonBehannon commented Oct 23, 2023

hbredin commented Oct 24, 2023

Heavenbest commented Oct 24, 2023

Heavenbest commented Oct 25, 2023

hbredin commented Nov 9, 2023

hbredin commented Nov 16, 2023