Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add random train-test splitting #28

Open
abbradar opened this issue May 13, 2016 · 1 comment
Open

Add random train-test splitting #28

abbradar opened this issue May 13, 2016 · 1 comment

Comments

@abbradar
Copy link

abbradar commented May 13, 2016

It can be implemented via sample family of functions from StatsBase. Example implementation with sklearn-like interface is here. If it's okay I can make a PR; what holds me from it is that I'm a newcomer and may have just missed an already existing and obvious way to do it.

EDIT: also a nice addition would be to support several arrays simultaneously -- I'll work on this if it's accepted to be useful.

@bobbywlindsey
Copy link

I think the sample function in StatsBase doesn't allow a user to specify through which dimensions to take a sample from. So in practice, it's only useful for 1-dimensional arrays.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants