Question about batch size #23

gyxxyg · 2024-04-15T04:43:50Z

Hello, I attempted to replicate the results from the paper using the specified settings. According to the paper, the experiments were conducted on a single server with 8 V100 GPUs, and the total batch size was 32. Consequently, the batch size for each GPU should be 4. However, when I used this value, the training consistently failed.

Could you please provide the training scripts that were used in the paper? I would greatly appreciate your assistance.

RenShuhuai-Andy · 2024-04-16T07:45:57Z

Hi, what does "The Training Consistently Failed" mean?

Do you mean gpu out-of-memory? If so, please refer to #10 (comment)

gyxxyg closed this as completed Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about batch size #23

Question about batch size #23

gyxxyg commented Apr 15, 2024

RenShuhuai-Andy commented Apr 16, 2024

Question about batch size #23

Question about batch size #23

Comments

gyxxyg commented Apr 15, 2024

RenShuhuai-Andy commented Apr 16, 2024