Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TRTLLM] Bump up dockerfile #2412

Merged

Conversation

ydm-amazon
Copy link
Contributor

@ydm-amazon ydm-amazon commented Oct 1, 2024

Description

Bumps up Dockerfile to TRTLLM v0.12.

Important information
This bump up is different from earlier ones - there are some things that I could not test yet due to some technical issues with installing tensorrt_llm for breaking changes that require new binaries. It seemed to be missing a binary I could not provide. Hence, due to this change, our nightly container is prematurely sent to v12, so it may have some bugs for around a day or so - I will fix all remaining issues and test as quickly as possible.

To clarify, the bugs left are related to how the code for tp_size, pp_size, etc. is written. It should work regardless, but I just need to double check.

I have found that the other breaking changes including use_custom_all_reduce and max_output_len are non-blocking pending changes (they do not affect functionality but I will be cleaning up the code later).

@ydm-amazon ydm-amazon requested review from zachgk and a team as code owners October 1, 2024 05:54
@ydm-amazon ydm-amazon merged commit 77041b5 into deepjavalibrary:master Oct 1, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants