sem is an unfortunate name for "standard error of the mean" #305

randyzwitch · 2017-09-21T00:51:49Z

(continued from Slack)

I was looking to see if Julia had any work around structural equation models, of which the R package is sem. In StatsBase, sem refers to standard error of the mean, which feels not easily Google-able and possibly unnecessary to have as a function altogether.

sem within StatsBase is only used in defining the function and documentation, but I haven't evaluated whether other packages use it. Does it make sense to rename, remove, or leave alone as too trivial to worry about?

The text was updated successfully, but these errors were encountered:

nalimilan · 2017-09-21T09:12:36Z

Renaming it would be fine if we can find a good alternative. What names do other software use?

randyzwitch · 2017-09-21T13:50:09Z

scipy uses sem
Base R doesn't define it (plotrix package in R uses std.error, rapport packagerp.se.mean)
MATLAB doesn't provide a function, according to this blog post

So scipy does lend support to having the name remain as sem, though I feel like semean or similar would be much less ambiguous.

andreasnoack · 2017-09-22T08:17:08Z

I don't think we need this function. It is not as common as other statistics and it is really simple to compute so I'd be in favor of just getting rid of the definition here to free the name down stream.

diegozea · 2017-09-22T09:52:14Z

I'm in favor of keeping the sem functionality, because the simplest version would be std(x)/sqrt(length(x)). So, the naive implementation is slower than the sqrt(var(x)/length(x)) of this definition. However, StatsBase exports stderr and sem. Could sem be renamed to stderr?

andreasnoack · 2017-09-22T11:06:23Z

I'd be fine with using stderr(Vector) for the mean implicitly. It couldn't really be anything else, could it?

nalimilan · 2017-09-22T11:48:06Z

I'd rather keep sem than using stderr for that. That's really a different operation from what stderr currently does (returning the standard errors of coefficients of a model), and the standard error of the mean is just one of the possible standard errors one can compute for a vector.

andreasnoack · 2017-09-22T11:58:13Z

That's really a different operation from what stderr currently does

No. It is exactly the same thing. It returns the estimated standard ~~error~~deviation of an estimator

is just one of the possible standard errors one can compute for a vector

Really? Which? The standard error of the variance? It is as implicit as stderr(RegressionModel) where it returns the estimated variance of the coefficients. It could also have been the standard error of the error variance.

jerlich · 2024-05-20T13:36:30Z

7 years later, I found out the name of the function because of this issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sem is an unfortunate name for "standard error of the mean" #305

sem is an unfortunate name for "standard error of the mean" #305

randyzwitch commented Sep 21, 2017

nalimilan commented Sep 21, 2017

randyzwitch commented Sep 21, 2017

andreasnoack commented Sep 22, 2017

diegozea commented Sep 22, 2017 •

edited

Loading

andreasnoack commented Sep 22, 2017

nalimilan commented Sep 22, 2017

andreasnoack commented Sep 22, 2017 •

edited

Loading

jerlich commented May 20, 2024

sem is an unfortunate name for "standard error of the mean" #305

sem is an unfortunate name for "standard error of the mean" #305

Comments

randyzwitch commented Sep 21, 2017

nalimilan commented Sep 21, 2017

randyzwitch commented Sep 21, 2017

andreasnoack commented Sep 22, 2017

diegozea commented Sep 22, 2017 • edited Loading

andreasnoack commented Sep 22, 2017

nalimilan commented Sep 22, 2017

andreasnoack commented Sep 22, 2017 • edited Loading

jerlich commented May 20, 2024

diegozea commented Sep 22, 2017 •

edited

Loading

andreasnoack commented Sep 22, 2017 •

edited

Loading