Cannot reproduce results #13

RomuloPaiva01 · 2020-08-18T21:01:32Z

Every time I call fit_transform I get different results.

I noticed that np.random.permutation changes the random_state, so I used np.random.RandomState(seed=seed).permutation() to solve.

I also noticed that np.random.seed(i) is used in run_select_features, but it changes the random state in the same way, so I can always convert back to the random_state that I had.

Even with those changes, and always getting the same random_state after calling fit_transform, I always end up with different results.

cod3licious · 2020-08-21T10:55:09Z

Yes, randomness is used in a lot of places in the code, both explicitly in places you've mentioned as well as internally (e.g. in some of the models). And it is crucial for the feature selection to use lots of randomness everywhere to make sure a robust subset of features is selected.

If you find a way to catch all instances where randomness is used and make it possible to pass a single random seed to the model to make the results reproducible, I'd love to accept a pull request! :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot reproduce results #13

Cannot reproduce results #13

RomuloPaiva01 commented Aug 18, 2020 •

edited

Loading

cod3licious commented Aug 21, 2020

Cannot reproduce results #13

Cannot reproduce results #13

Comments

RomuloPaiva01 commented Aug 18, 2020 • edited Loading

cod3licious commented Aug 21, 2020

RomuloPaiva01 commented Aug 18, 2020 •

edited

Loading