Add llama 31 support updated #92

Bihan · 2024-09-10T17:35:37Z

What does this PR do?

This PR is an updated version closed PR. This includes recent updates made in main branch.

Add Llama 3.1 in test_decode.py Set generation_config._eos_token_tensor to None

Add llama 31 support updated

tengomucho · 2024-09-11T07:24:33Z

optimum/tpu/generation/token_selector.py

@@ -118,6 +118,7 @@ def create(
            )
            generation_config.max_length = max_seq_length

+        generation_config._eos_token_tensor = None


As said here, could you replace this by this snippet?

Suggested change

generation_config._eos_token_tensor = None

generation_config._eos_token_tensor = getattr(generation_config, "_eos_token_tensor", None)

@tengomucho Above is review is from the old PR. It is already updated. 10f857c. I have replaced the snippet in here

sorry, I somehow missed it, it's fine then!

HuggingFaceDocBuilderDev · 2024-09-11T08:14:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Add workaround for eos_token_tensor in jetstream_pt_support

tengomucho · 2024-11-20T08:42:40Z

@Bihan any plan to rebase/fix this?

Bihan · 2024-12-24T12:44:37Z

@Bihan any plan to rebase/fix this?

@tengomucho I apologize for delayed response. I think I can close this issue as optimum-tpu supports llama 3.1 now.

Bihan Rana and others added 3 commits September 10, 2024 21:53

Add llama 3.1 Support

cbeeb69

Add Llama 3.1 in test_decode.py Set generation_config._eos_token_tensor to None

Add workaround to avoid bug in token_utils

240e977

Merge pull request #2 from Bihan/add_llama_31_support_backup

10f857c

Add llama 31 support updated

Bihan mentioned this pull request Sep 10, 2024

Add llama 31 support #87

Closed

7 tasks

tengomucho requested changes Sep 11, 2024

View reviewed changes

tengomucho approved these changes Sep 11, 2024

View reviewed changes

Bihan Rana and others added 2 commits September 11, 2024 18:46

Add workaround for eos_token_tensor in jetstream_pt_support

a35eeed

Merge pull request #3 from Bihan/add_llama_31_support_backup

557f4b8

Add workaround for eos_token_tensor in jetstream_pt_support

tengomucho approved these changes Sep 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llama 31 support updated #92

Add llama 31 support updated #92

Bihan commented Sep 10, 2024

tengomucho Sep 11, 2024

Bihan Sep 11, 2024

tengomucho Sep 11, 2024

HuggingFaceDocBuilderDev commented Sep 11, 2024

tengomucho commented Nov 20, 2024

Bihan commented Dec 24, 2024

	generation_config._eos_token_tensor = None
	generation_config._eos_token_tensor = getattr(generation_config, "_eos_token_tensor", None)

Add llama 31 support updated #92

Are you sure you want to change the base?

Add llama 31 support updated #92

Conversation

Bihan commented Sep 10, 2024

What does this PR do?

tengomucho Sep 11, 2024

Choose a reason for hiding this comment

Bihan Sep 11, 2024

Choose a reason for hiding this comment

tengomucho Sep 11, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Sep 11, 2024

tengomucho commented Nov 20, 2024

Bihan commented Dec 24, 2024