Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add warnings when input has too much / too little data #216

Open
mastoffel opened this issue Jun 7, 2024 · 0 comments
Open

Add warnings when input has too much / too little data #216

mastoffel opened this issue Jun 7, 2024 · 0 comments

Comments

@mastoffel
Copy link
Collaborator

mastoffel commented Jun 7, 2024

  • models might overfit when there are not enough datapoints relative to the number of variables. We should check good rules of thumb and give a warning if this is the case
  • some models, in particular GP's, get very slow and compute intensive with too many samples. We should give a warning when the data is too big. Even better, we could provide a drop in replacement for the scikit-learn GP that deals with larger datasets (probably at the cost of accuracy). This is an active area of research but there might be solutions.

Thanks to @MaxBalmus for the feedback.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant