Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transformers and transformer pipelines #17

Open
jnothman opened this issue Aug 25, 2015 · 3 comments
Open

Transformers and transformer pipelines #17

jnothman opened this issue Aug 25, 2015 · 3 comments

Comments

@jnothman
Copy link

A PMML representation of basic scikit-learn transformers and transformer pipelines / featureunions is needed for this to be more broadly useful.

@NeverNude
Copy link
Contributor

Definitely agree. What transformers were you thinking of and where would be a good place to start?

@jnothman
Copy link
Author

jnothman commented Sep 7, 2015

I'd consider a feature selector initially, as it should be fairly trivial to just ignore some features in PMML, but correctly interpreting the pipeline object may involve some work. Otherwise, something involving projections like PCA or random projection, although I'm not sure that these are often used with forests.

@alex-pirozhenko
Copy link
Owner

I think the most natural way to implement pipelines in PMML would be using nested models. That will allow transformers to define their own LocalTransformations, replace the schemas and pass the adjusted context to the nested model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants