Add support for large-v3 #559

turicas · 2023-11-12T05:19:02Z

In summary:

Upgrade ctranslate2 and transformers
Add large-v3 to the list of models (currently using turicas/faster-whisper-large-v3)
Fix feature_size (n_mels) for large-v3
Change tokenizer to be compatible with large-v3 (the older cannot be used, tokens like transcribe and translate were changed)
Update README

Note: I'm not sure if the way I've implemented is the best one, feel free to give feedbacks. =) One of the things that can be optimized is the loading of the new tokenizer, which requires the transformers library when using large-v3.

By now using turicas/faster-whisper-large-v3

hedrergudene · 2023-11-13T14:12:38Z

faster_whisper/utils.py

Shouldn't "large" be the v3 version?

Good point! Should large be a shortcut to the latest large model?

captainyugi00 · 2023-11-13T18:43:43Z

Hello, I am currently getting this error after downloading the large-v3 model. Is there any way I can fix this?

File "C:\Users\User\OneDrive\Desktop\GitHub-Projects\Proj-B\faster_whisper\transcribe.py", line 149, in init
processor = AutoProcessor.from_pretrained("openai/whisper-large-v3")
File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\models\auto\processing_auto.py", line 287, in from_pretrained
return processor_class.from_pretrained(
File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\processing_utils.py", line 226, in from_pretrained
args = cls._get_arguments_from_pretrained(pretrained_model_name_or_path, **kwargs)
File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\processing_utils.py", line 270, in _get_arguments_from_pretrained
args.append(attribute_class.from_pretrained(pretrained_model_name_or_path, **kwargs))
File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\tokenization_utils_base.py", line 1854, in from_pretrained
return cls._from_pretrained(
File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\tokenization_utils_base.py", line 2073, in _from_pretrained
raise ValueError(
ValueError: Non-consecutive added token '<|0.02|>' found. Should have index 50365 but has index 50366 in saved vocabulary.

turicas · 2023-11-13T21:38:37Z

Hello, I am currently getting this error after downloading the large-v3 model. Is there any way I can fix this?

File "C:\Users\User\OneDrive\Desktop\GitHub-Projects\Proj-B\faster_whisper\transcribe.py", line 149, in init processor = AutoProcessor.from_pretrained("openai/whisper-large-v3") File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\models\auto\processing_auto.py", line 287, in from_pretrained return processor_class.from_pretrained( File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\processing_utils.py", line 226, in from_pretrained args = cls._get_arguments_from_pretrained(pretrained_model_name_or_path, **kwargs) File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\processing_utils.py", line 270, in _get_arguments_from_pretrained args.append(attribute_class.from_pretrained(pretrained_model_name_or_path, **kwargs)) File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\tokenization_utils_base.py", line 1854, in from_pretrained return cls._from_pretrained( File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\tokenization_utils_base.py", line 2073, in _from_pretrained raise ValueError( ValueError: Non-consecutive added token '<|0.02|>' found. Should have index 50365 but has index 50366 in saved vocabulary.

Try to upgrade the transformers library.

captainyugi00 · 2023-11-14T14:58:09Z

Hello, I am currently getting this error after downloading the large-v3 model. Is there any way I can fix this?

File "C:\Users\User\OneDrive\Desktop\GitHub-Projects\Proj-B\faster_whisper\transcribe.py", line 149, in init processor = AutoProcessor.from_pretrained("openai/whisper-large-v3") File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\models\auto\processing_auto.py", line 287, in from_pretrained return processor_class.from_pretrained( File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\processing_utils.py", line 226, in from_pretrained args = cls._get_arguments_from_pretrained(pretrained_model_name_or_path, **kwargs) File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\processing_utils.py", line 270, in _get_arguments_from_pretrained args.append(attribute_class.from_pretrained(pretrained_model_name_or_path, **kwargs)) File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\tokenization_utils_base.py", line 1854, in from_pretrained return cls._from_pretrained( File "C:\Users\User.conda\envs\Proj\lib\site-packages\transformers\tokenization_utils_base.py", line 2073, in _from_pretrained raise ValueError( ValueError: Non-consecutive added token '<|0.02|>' found. Should have index 50365 but has index 50366 in saved vocabulary.

Try to upgrade the transformers library.

Thanks, that worked 👍

salahzoubi · 2023-11-17T15:55:00Z

Do you intend on adding batch_transcribe? Because it seems like the whisper v3 on HF pipes allows transcribing multiple audio files at once... would be an amazing feature!

ILG2021 · 2023-11-23T00:45:18Z

Wish the maintainer merge soon. It delays so long.

EIIisD · 2023-11-24T11:31:56Z

please please please

nguyendc-systran · 2023-11-24T22:52:22Z

Thanks for your PR, it seems a duplicate of this #578
So, I'm closing it, feel free to reopen if needed.

turicas added 5 commits November 11, 2023 21:11

Update minimum versions for ctranslate2 and transformers

65104b6

Add large-v3 to model list

07c0efb

By now using turicas/faster-whisper-large-v3

Fix feature_size (n_mels) for large-v3

fcf6b8e

Change tokenizer to be compatible with large-v3

6a427b4

Update README to large-v3

714ea1e

turicas mentioned this pull request Nov 12, 2023

Working with Whisper-large-v3 #547

Closed

Fix flake8 errors

7358507

turicas force-pushed the feature/large-v3 branch from 106ef98 to 7358507 Compare November 12, 2023 06:46

funboarder13920 mentioned this pull request Nov 13, 2023

feat: code for whisper-large-v3 #548

Closed

hedrergudene reviewed Nov 13, 2023

View reviewed changes

Jeronymous mentioned this pull request Nov 13, 2023

Support of new Whisper model large-v3 linto-ai/faster-whisper#1

Merged

DougTrajano mentioned this pull request Nov 16, 2023

Add support for whisper-large-v3 #565

Closed

This comment was marked as off-topic.

Sign in to view

nguyendc-systran closed this Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for large-v3 #559

Add support for large-v3 #559

turicas commented Nov 12, 2023 •

edited

Loading

hedrergudene Nov 13, 2023

turicas Nov 13, 2023

captainyugi00 commented Nov 13, 2023

turicas commented Nov 13, 2023

captainyugi00 commented Nov 14, 2023

salahzoubi commented Nov 17, 2023

This comment was marked as off-topic.

ILG2021 commented Nov 23, 2023

EIIisD commented Nov 24, 2023

nguyendc-systran commented Nov 24, 2023

Add support for large-v3 #559

Add support for large-v3 #559

Conversation

turicas commented Nov 12, 2023 • edited Loading

hedrergudene Nov 13, 2023

Choose a reason for hiding this comment

turicas Nov 13, 2023

Choose a reason for hiding this comment

captainyugi00 commented Nov 13, 2023

turicas commented Nov 13, 2023

captainyugi00 commented Nov 14, 2023

salahzoubi commented Nov 17, 2023

This comment was marked as off-topic.

ILG2021 commented Nov 23, 2023

EIIisD commented Nov 24, 2023

nguyendc-systran commented Nov 24, 2023

turicas commented Nov 12, 2023 •

edited

Loading