Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add StandardScaler Estimator #2307

Open
icfaust opened this issue Feb 10, 2025 · 0 comments
Open

Add StandardScaler Estimator #2307

icfaust opened this issue Feb 10, 2025 · 0 comments
Labels
good first issue Good for newcomers hacktoberfest help wanted Extra attention is needed

Comments

@icfaust
Copy link
Contributor

icfaust commented Feb 10, 2025

The StandardScaler estimator scales the data to zero mean and unit variance. Use the IncrementalBasicStatistics estimator
to generate the mean and variance to scale the data. Investigate where the new implementation may be low performance and
include guards in the code to use Scikit-learn as necessary. The final deliverable would be to add this estimator to the 'spmd'
interfaces which are effective on MPI-enabled supercomputers, this will use the underlying MPI-enabled mean and variance
calculators in IncrementalBasicStatistics. This is an easy difficulty project, and would be a medium time commitment
when combined with other pre-processing projects.

https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html#sklearn.preprocessing.StandardScaler

@icfaust icfaust added good first issue Good for newcomers hacktoberfest help wanted Extra attention is needed labels Feb 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers hacktoberfest help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant