Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

👷 Add integration test to the CI #140

Merged
merged 3 commits into from
Jan 8, 2025

Conversation

baptistecolle
Copy link
Collaborator

@baptistecolle baptistecolle commented Jan 7, 2025

What does this PR do?

This PR adds the TGI docker integration test to the CI.
This is related to issue #124 about the new integration test for TGI.

Link to the new action execution: https://github.com/huggingface/optimum-tpu/actions/runs/12667037809/job/35299701615

Special thanks to @paulinebm for all the help debugging the CI 🙌

@baptistecolle baptistecolle force-pushed the fix-integration-test-tgi-with-ci branch from 66ebdb7 to a7af4f1 Compare January 8, 2025 08:45
@baptistecolle baptistecolle force-pushed the fix-integration-test-tgi-with-ci branch from a7af4f1 to 5187173 Compare January 8, 2025 08:46
@baptistecolle baptistecolle marked this pull request as ready for review January 8, 2025 09:39
HF_HUB_CACHE: /mnt/hf_cache/cache_huggingface
HF_TOKEN: ${{ secrets.HF_TOKEN_OPTIMUM_TPU_CI }}
V5_LITEPOD_8_ENV: ${{ vars.V5_LITEPOD_8_ENV}}

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this does not have container section, does it means it runs on the VM directly?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, indeed, if you run in the container directly, you have networking issues. So we use the VM directly that spawn the container

PJRT_DEVICE: TPU
HF_HUB_CACHE: /mnt/hf_cache/cache_huggingface
HF_TOKEN: ${{ secrets.HF_TOKEN_OPTIMUM_TPU_CI }}
V5_LITEPOD_8_ENV: ${{ vars.V5_LITEPOD_8_ENV}}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if it might be better to rename this into something mode generic, e.g.: TPU_ENV, the reason being that we will soon need to support v6e too.

tpu-tgi:
docker build --rm -f text-generation-inference/docker/Dockerfile \
--build-arg VERSION=$(VERSION) \
--build-arg TGI_VERSION=$(TGI_VERSION) \
--ulimit nofile=100000:100000 \
$(if $(NETWORK),--network $(NETWORK),) \
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need the network hot at this step?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is needed for the CI. If you build with the default network, some requests during the build process are blocked. For example, the protobuf installation fails.

Copy link
Collaborator

@tengomucho tengomucho left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@baptistecolle baptistecolle merged commit 9c791e0 into main Jan 8, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants