SegResNet crashes mid training #7283
LOL1234LOL
started this conversation in
General
Replies: 1 comment
-
Hi @LOL1234LOL, according to the error message, the error was caused by LoadImage, possibly due to data preparation issues. I recommend that you load the data directly without auto3dseg to determine if there is a problem with the data. Hope it helps, thanks! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am using auto3dseg with wsl on a custom dataset, training it with dints worked but as soon as I start to run it for SegResNet it seems to be working for a while and then crashes leaving behind a error i don't quite understand:
The command:
python3 -m monai.apps.auto3dseg AutoRunner run --input="input.yaml"
It trains for a while:
And then it crashes:
I don't know where the error lies, I attempted to change:
os.environ["PYTORCH_CUDA_ALLOC_CONF"] = "max_split_size_mb:8192"
from 2048 to 8192 in segmenter.py and it seems to extend the time it can run without crashing but it never reaches the end of an epoche. Please help thank youBeta Was this translation helpful? Give feedback.
All reactions