Support for ensemble out-of-bag prediction #25

tecosaur · 2022-11-25T04:45:27Z

In addition to out_of_bag_measure, I think it could be rather helpful to be able to obtain the ensemble's overall out-of-bag prediction.

In my own code (that I'd like to convert to use MLJEnsembles when possible), I'm currently creating a Matrix of predictions and missing for each model then aggregating the result by taking the row-wise means.

The text was updated successfully, but these errors were encountered:

ablaom · 2022-11-27T20:09:24Z

Sounds like a good suggestion to me. The usual way to expose something like this would be to return the out-of-bag predictions as part of the model report (last item returned by MLJModelInterface.fit). For example, outlier detection models return training scores that way, and MLJFlux models return training losses that way. Happy to support a PR.

I should say, MLJEnsembles is some of the oldest MLJ code and it may be that a rethink is worthwhile, if someone had the resources. See JuliaAI/MLJ.jl#363 for some old related discussion.

tecosaur · 2022-11-29T05:26:51Z

Thanks for that link. A generalized blend of MLJEnsembles and SampleFitCombine seems like it would be quite good, but I'd think some breaking changes would be required to do this nicely.

ablaom · 2022-11-29T20:07:15Z

If we get a better design that would be fine by me, as long as we don't need breaking changes to the basic MLJ model interface. I see SampleFitCombine.jl looks abandoned and was never registered, so one may want to be cautious what we take from there.

I did meet with the author at that time and I think his main use case was mixture models - creating an ensemble of probability distributions, which in MLJ we treat as supervised learners with empty input X; see here . We haven't actually implemented any of those, although I don't see any immediate problem. So, for example, one could wrap distributions from Distributions.jl that way.

But this may not be too relevant or out-of-scope. I'm just trying to recall what I remember from our conversations and I haven't reviewed the discussion linked above myself yet.

tecosaur mentioned this issue Nov 29, 2022

Support for custom resampling #24

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for ensemble out-of-bag prediction #25

Support for ensemble out-of-bag prediction #25

tecosaur commented Nov 25, 2022

ablaom commented Nov 27, 2022

tecosaur commented Nov 29, 2022

ablaom commented Nov 29, 2022 •

edited

Loading

Support for ensemble out-of-bag prediction #25

Support for ensemble out-of-bag prediction #25

Comments

tecosaur commented Nov 25, 2022

ablaom commented Nov 27, 2022

tecosaur commented Nov 29, 2022

ablaom commented Nov 29, 2022 • edited Loading

ablaom commented Nov 29, 2022 •

edited

Loading