-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU idioms with cross-validation #133
Comments
Similar to #126, but resolving this issue may not fully resolve that one. |
The default in |
I think that makes sense to me. @kadri-nizam , is there any reason that you chose CPU as the default? |
Jeff is right; the default argument should be None instead of CPU. I misunderstood the copying of the input tensors in the constructor when I added the default argument. @jeffjennings Shall I revert the changes or will you push it in a PR you're working on? |
If it's convenient for you to do, that would be great! If not, I'll have time later this afternoon. |
Reverting default device in dataset.py
The device args should be fixed by #170 thanks Kadri! |
Yes, thank you @kadri-nizam ! |
The GriddedDataset now inherits from I think what type of cross-validation will depend on whether the dataset is a
Do we need to think about using Ray Tune or Lightning to solve this issue? For a |
@jeffjennings is this still an issue for you after the updates to |
Feel free to close! |
Thanks :) |
Is your feature request related to a problem or opportunity? Please describe.
Ensure the routines called in a full fit pipeline can be run on a (single) GPU without error.
Ensure these routines don't overload VRAM. E.g. in the cross-val loop, Ian noted "the eventual model that we should adopt is to send one dataset to the GPU at a time, do the training for that, send the results back tothe CPU, and then repeat for the next dataset subset. Because
test_train_datasets
is now a list of a bunch of K copies of the dataset, there's a good chance that it won't fit on GPU memory all at once especially if the original dataset is large."Describe the solution you'd like
Add
.to(<device>)
calls in the fit runner script, only where necessary and starting as early in the pipeline as possible, to integrate GPU support.Add GPU-specific tests for core routines.
Additional context
A further improvement would be to handle multiple GPU workflows.
The text was updated successfully, but these errors were encountered: