saving 70B checkpoint takes 1000s for full finetuning 8GPU #1735

felipemello1 · 2024-10-01T21:13:03Z

tune run --nproc_per_node 8 full_finetune_distributed --config llama3_1/70B_full max_steps_per_epoch=20

joecummings added discussion Start a discussion better engineering Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs labels Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

saving 70B checkpoint takes 1000s for full finetuning 8GPU #1735

saving 70B checkpoint takes 1000s for full finetuning 8GPU #1735

felipemello1 commented Oct 1, 2024

saving 70B checkpoint takes 1000s for full finetuning 8GPU #1735

saving 70B checkpoint takes 1000s for full finetuning 8GPU #1735

Comments

felipemello1 commented Oct 1, 2024