Multiple GPU low performance

Hello,
I have an issue with multiple GPU performance.
- I use the recipe `lora_finetune_single_device` with the config `mini_lora_single_device.yaml` on 6000ADA, I got ~5it/s
- I use the recipe `lora_finetune_distributed` with the config `mini_lora.yaml` on 2 x 6000ADA, I got 1.5s/it 
The dataset that I used to fine-tune is HuggingFaceFW/fineweb-edu-score-2
How can I improve the performance in multiple GPU?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multiple GPU low performance #1734

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Multiple GPU low performance #1734

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions