When running set_finetuning_example.ipynb on a MacBook on the mps backend I seem to be hitting an OOM, is this expected? #67

Rajjeshwar · 2024-12-08T16:19:06Z

Is it expected for the notebook to use upto 30gb of the shared memory? I checked on my resource activity monitor and the memory usage hits 30gb before it crashes. Just wanted to know if it is expected for this particular notebook since it seems to be a small model ("HuggingFaceTB/SmolLM2-135M") and a small dataset ("HuggingFaceTB/smoltalk").

burtenshaw · 2024-12-09T10:44:01Z

Have you tried reducing batch size to reduce memory footprint?

Rajjeshwar · 2024-12-09T10:52:36Z

Hello, yes, it does reduce the overall memory footprint. I reduced the batch size and the sequence length to get it to run for around 1000 iterations. However, the total memory is not really what I was wondering about, its moreso the fact that the total memory usage keeps increasing almost as though there is a memory leak in the trainer when its saving the batch statistics.

In fact I guess here its less a memory leak and moreso just the fact that since apple uses unified storage it is actually less memory than a typical Nvidia gpu setup where the system ram is separate and can store these values? Hence, why I wanted to know if this is an expected behaviour on Macs with this particular model.

burtenshaw · 2024-12-09T11:47:18Z

Thanks, I understand the question now. Your intuition makes sense the Trainer is going to use memory and due to the architecture of apple silicone, this would not be as clear as discrete devices.

If you want to discuss the topic, you could join the discord. There are also mlx pros over there:

smol-course discord channel: https://discord.com/channels/879548962464493619/1313889336907010110
HF invite link: https://discord.gg/hugging-face-879548962464493619

Rajjeshwar · 2024-12-09T16:46:53Z

Oh hey, thank you for the link, I will check it out. Also, thanks for the responses!

edfenergy-yuhang · 2024-12-17T23:16:46Z

i ran into the similar issue during fine tuning:
RuntimeError: MPS backend out of memory (MPS allocated: 6.80 GB, other allocations: 11.25 GB, max allowed: 18.13 GB).
my machine is 16gb macbook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When running set_finetuning_example.ipynb on a MacBook on the mps backend I seem to be hitting an OOM, is this expected? #67

When running set_finetuning_example.ipynb on a MacBook on the mps backend I seem to be hitting an OOM, is this expected? #67

Rajjeshwar commented Dec 8, 2024

burtenshaw commented Dec 9, 2024

Rajjeshwar commented Dec 9, 2024

burtenshaw commented Dec 9, 2024

Rajjeshwar commented Dec 9, 2024

edfenergy-yuhang commented Dec 17, 2024

When running set_finetuning_example.ipynb on a MacBook on the mps backend I seem to be hitting an OOM, is this expected? #67

When running set_finetuning_example.ipynb on a MacBook on the mps backend I seem to be hitting an OOM, is this expected? #67

Comments

Rajjeshwar commented Dec 8, 2024

burtenshaw commented Dec 9, 2024

Rajjeshwar commented Dec 9, 2024

burtenshaw commented Dec 9, 2024

Rajjeshwar commented Dec 9, 2024

edfenergy-yuhang commented Dec 17, 2024