DistributedDataParallel PyTorch Template

Introduction

This template provides a ready-to-use implementation of PyTorch's DistributedDataParallel (DDP) feature. It is designed to facilitate easy and efficient parallelism across multiple GPUs, minimizing the need for extensive code modifications.

Prerequisites

PyTorch
Tensorboard

Structure

main.py: The main script for DDP implementation. It is recommended not to modify this file.
model.py: Place your model architecture here.
util.py: Define parameters, optimizers, and other utilities.
dataloader.py: Contains the DataLoader and Dataset definitions.

Usage

To use this template, follow these steps:

Model Setup: Update your model architecture in model.py.
Parameter Configuration: Specify your training parameters, optimizer, scheduler, etc., in util.py.
Data Preparation: Modify dataloader.py to suit your dataset and data loading strategy.
Run the Script: Use the following command to run the training script:

sbatch main.slurm
Tensor broad: Use the following command to run tensorborad

./bd.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DistributedDataParallel PyTorch Template

Introduction

Prerequisites

Structure

Usage

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
bd.sh		bd.sh
dataset.py		dataset.py
main.py		main.py
main.slurm		main.slurm
model.py		model.py
util.py		util.py

Rong-Tao/ddp_template

Folders and files

Latest commit

History

Repository files navigation

DistributedDataParallel PyTorch Template

Introduction

Prerequisites

Structure

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages