You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What's wrong?
Normalization is really slow. I did normalization and PCA on the 100000x1000 data set, and PCA was much faster than normalization. Here is what it looks like (memory use/against time from [ENH] Domain transformations in batches for less memory use #5218):
Orange's normalization, which, by default only needs the mean and standard deviation, computes these very inefficiently. As intermediate result it builds a distribution for each variable, which is, for continuous variables, mostly a sorted list of values.
Because normalization is the default for some learners and unsupervised methods, we should make it faster. This should be huge speedup for some Orange's learners, k-means and PCA.
The text was updated successfully, but these errors were encountered:
markotoplak
added
bug report
Bug is reported by user, not yet confirmed by the core team
bug
A bug confirmed by the core team
and removed
bug report
Bug is reported by user, not yet confirmed by the core team
labels
Jan 28, 2021
Normalization is really slow. I did normalization and PCA on the 100000x1000 data set, and PCA was much faster than normalization. Here is what it looks like (memory use/against time from [ENH] Domain transformations in batches for less memory use #5218):
Orange's normalization, which, by default only needs the mean and standard deviation, computes these very inefficiently. As intermediate result it builds a distribution for each variable, which is, for continuous variables, mostly a sorted list of values.
Because normalization is the default for some learners and unsupervised methods, we should make it faster. This should be huge speedup for some Orange's learners, k-means and PCA.
The text was updated successfully, but these errors were encountered: