Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(docker): add a gpu-trainer dockerfile #563

Merged
merged 1 commit into from
Aug 2, 2023

Conversation

Major-333
Copy link
Contributor

The existing Dockerfile does not incorporate the installation of a CUDA-related environment, and the lack of transparency in the base image's build process can potentially confound those new to the field.

To address this, I've created an example of a training image, utilizing a widely-used NVIDIA CUDA image as its foundation.

Should you wish to construct a custom CUDA-based training image, typically, all that's needed is the inclusion of your specific code into the image and the installation of the requisite Python packages.

@Antlera
Copy link
Collaborator

Antlera commented Aug 1, 2023

Thank you so much! The base image now indeed meets Cuda driver error sometimes.

@Major-333
Copy link
Contributor Author

Thank you so much! The base image now indeed meets Cuda driver error sometimes.

I'm glad to help! Going forward, I'll be adding a guide on how to train using GPUs, which will include brief instructions on deploying the NVIDIA device plugin and changing Docker runtime settings.

@Major-333 Major-333 requested a review from samplise August 2, 2023 01:28
@Major-333 Major-333 force-pushed the wzj_docker_gpu_trainer branch 2 times, most recently from c60e476 to 1b2af2e Compare August 2, 2023 07:44
Copy link
Collaborator

@workingloong workingloong left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Major-333 Major-333 merged commit a7e368e into master Aug 2, 2023
13 checks passed
@workingloong workingloong deleted the wzj_docker_gpu_trainer branch August 12, 2023 08:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants