Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transcription issues. #190

Open
RezaTokhshid opened this issue Apr 12, 2024 · 1 comment
Open

Transcription issues. #190

RezaTokhshid opened this issue Apr 12, 2024 · 1 comment

Comments

@RezaTokhshid
Copy link

I've come across some issues while testing the repo. Have you seen any of these issues or do you have a solution for them?

  • "thank you" or "alright" where it's not said and model stops transcribing after that point
  • Same words at the end of transcription when it's not said (on this one up to that word every thing is transcribed ok)
  • Audio not transcribe. I had a audio that got "So." as transcription. I tested the same with a diff model, your distill large, but it worked ok
  • Last 2-3 words just repeating

Let me know if any of these ring a bell or if you need more info.

@Jrcordal
Copy link

Same, also sometimes a word repeates too much. I was thinking to use regular expression for the strings, did you arrived at a better solution?

I've come across some issues while testing the repo. Have you seen any of these issues or do you have a solution for them?

  • "thank you" or "alright" where it's not said and model stops transcribing after that point
  • Same words at the end of transcription when it's not said (on this one up to that word every thing is transcribed ok)
  • Audio not transcribe. I had a audio that got "So." as transcription. I tested the same with a diff model, your distill large, but it worked ok
  • Last 2-3 words just repeating

Let me know if any of these ring a bell or if you need more info.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants