feat(docker): add a gpu-trainer dockerfile #563

Major-333 · 2023-08-01T13:02:31Z

The existing Dockerfile does not incorporate the installation of a CUDA-related environment, and the lack of transparency in the base image's build process can potentially confound those new to the field.

To address this, I've created an example of a training image, utilizing a widely-used NVIDIA CUDA image as its foundation.

Should you wish to construct a custom CUDA-based training image, typically, all that's needed is the inclusion of your specific code into the image and the installation of the requisite Python packages.

Antlera · 2023-08-01T13:06:32Z

Thank you so much! The base image now indeed meets Cuda driver error sometimes.

Major-333 · 2023-08-01T13:10:40Z

Thank you so much! The base image now indeed meets Cuda driver error sometimes.

I'm glad to help! Going forward, I'll be adding a guide on how to train using GPUs, which will include brief instructions on deploying the NVIDIA device plugin and changing Docker runtime settings.

docker/pytorch/gpu-trainer.dockerfile

workingloong

LGTM

Major-333 requested review from workingloong and Antlera August 1, 2023 13:02

Major-333 requested a review from samplise August 2, 2023 01:28

workingloong reviewed Aug 2, 2023

View reviewed changes

docker/pytorch/gpu-trainer.dockerfile Show resolved Hide resolved

Major-333 force-pushed the wzj_docker_gpu_trainer branch 2 times, most recently from c60e476 to 1b2af2e Compare August 2, 2023 07:44

Major-333 requested a review from workingloong August 2, 2023 07:59

workingloong reviewed Aug 2, 2023

View reviewed changes

docker/pytorch/gpu-trainer.dockerfile Outdated Show resolved Hide resolved

feat(docker): add a gpu-trainer dockerfile

90df047

Major-333 force-pushed the wzj_docker_gpu_trainer branch from 1b2af2e to 90df047 Compare August 2, 2023 08:31

workingloong approved these changes Aug 2, 2023

View reviewed changes

Major-333 merged commit a7e368e into master Aug 2, 2023
13 checks passed

workingloong deleted the wzj_docker_gpu_trainer branch August 12, 2023 08:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(docker): add a gpu-trainer dockerfile #563

feat(docker): add a gpu-trainer dockerfile #563

Major-333 commented Aug 1, 2023

Antlera commented Aug 1, 2023

Major-333 commented Aug 1, 2023

workingloong left a comment

feat(docker): add a gpu-trainer dockerfile #563

feat(docker): add a gpu-trainer dockerfile #563

Conversation

Major-333 commented Aug 1, 2023

Antlera commented Aug 1, 2023

Major-333 commented Aug 1, 2023

workingloong left a comment

Choose a reason for hiding this comment