-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pipeline - Parameterize transforms of MLDataset or xarray.Dataset #10
Comments
Sample implementation:https://github.com/ContinuumIO/xarray_filters/blob/param-mlds/notebooks/Parameterize-MLDataset.ipynb Notes:"Pipeline compatibility" docIllustrates how to write custom Estimators/Transformers to pass into a sklearn Pipeline: http://scikit-learn.org/stable/developers/contributing.html#pipeline-compatibility Reusing sklearn.pipeline.PipelineIn order to reuse as much of the sklearn.pipeline.Pipeline class as possible, the sample implementation (above) sets the final estimator to
Pipelining example from sklearnReal-world example of Pipeline being used with GridSearchCV: http://scikit-learn.org/stable/auto_examples/plot_digits_pipe.html#sphx-glr-auto-examples-plot-digits-pipe-py
sklearn.pipeline.FeatureUnionSimilar to Pipeline, but parallelizes the computations over each feature, allowing for optional weighting: http://scikit-learn.org/stable/modules/generated/sklearn.pipeline.FeatureUnion.html |
A couple points:
|
Thanks for the notes @PeterDSteinberg . I haven't looked much into FeatureUnion yet but I intend to soon. I added a unit test to confirm that I'm now adding builtin transformers from |
PR looks good - adding |
This issue is about creating
Pipeline
to control parameterization of transformations ofMLDataset
orDataset
objects. When this issue is done, we should be able to do the following:See this example notebook and also look at the documentation on custom estimators in
sklearn
.The text was updated successfully, but these errors were encountered: