Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is Speaker Diarization Available or Planned? #961

Open
Abandersen04 opened this issue Oct 25, 2024 · 2 comments
Open

Is Speaker Diarization Available or Planned? #961

Abandersen04 opened this issue Oct 25, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@Abandersen04
Copy link

I’m currently using Buzz for transcribing interviews with multiple speakers. However, I’ve noticed that the transcription doesn’t differentiate between different voices or speakers in the audio. Is speaker diarization (speaker identification) available or on the roadmap as a feature?

Additionally, I noticed the "prompt" feature, but it doesn't seem to affect speaker recognition. Could you clarify its purpose and if it might relate to this?

Thanks in advance for your help!

@raivisdejus
Copy link
Collaborator

raivisdejus commented Oct 25, 2024

Yes this is one of idea that has been requested previously. It is in the list of things that would be nice to have in the future.
Also seems that https://github.com/MahmoudAshraf97/whisper-diarization works quite well, so it could be implemented in the Buzz at some future day.

Prompt feature of the whisper models is described here https://cookbook.openai.com/examples/whisper_prompting_guide
In my testing it has not showed super meaningful results, but other may get better results. Feel free to share feedback on the results of prompting as it may be useful to others

@raivisdejus raivisdejus added the enhancement New feature or request label Oct 25, 2024
@adijahangir123
Copy link

Isn't possible to utilize pyannote.audio as it is claiming to show good results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants