use improved Rhat to implement ESS-bulk and ESS-tail #3299

mitzimorris · 2024-07-12T21:28:42Z

Summary:

PR #3266 implemented the rank-normalization and folding needed for Rhat-bulk and Rhat-tail.
We should also add logic for ESS-bulk and ESS-tail, per Vehtari et al https://arxiv.org/abs/1903.08008.

Description:

Bulk ESS and tail ESS are available in R, but not in CmdStan or CmdStanPy. We need to add this to Stan so that all interfaces are providing the same estimates given a sample.

Bulk ESS can be computed from the rank normalized draws, per section 4.1

We will use the term bulk effective sample size (bulk-ESS or bulk-Seff ) to refer to the effective sample size based on the rank normalized draws.

Tail ESS is defined in section 4.3

To get a better sense of the sampling efficiency in the distributions’ tails, we propose to compute the minimum of the effective sample sizes of the 5% and 95% quantiles, which we will call tail effective sample size (tail-ESS or tail-Seff ).

Current Version:

v2.35.0

mitzimorris · 2024-07-12T21:31:46Z

@avehtari @jgabry do we also want to implement the MCSE described in section 4.4 ? has this already been done in R? which package and where?

mitzimorris · 2024-07-12T21:39:42Z

also pings to @aleksgorica and @SteveBronder.

plugging this in to CmdStan's stansummary is trivial, cf: stan-dev/cmdstan@develop...feature/1263-new-rhat-summary

jgabry · 2024-07-14T00:44:59Z

@avehtari @jgabry do we also want to implement the MCSE described in section 4.4 ? has this already been done in R? which package and where?

Yeah bulk and tail ESS are included in the standard diagnostics computed by the posterior package:

https://github.com/stan-dev/posterior/blob/master/R/convergence.R

avehtari · 2024-07-16T08:50:46Z

@avehtari @jgabry do we also want to implement the MCSE described in section 4.4 ? has this already been done in R? which package and where?

Yeah bulk and tail ESS are included in the standard diagnostics computed by the posterior package:

And the implementation for the MCSE for quantiles (Section 4.4 in the paper) is in .mcse_quantile function
https://github.com/stan-dev/posterior/blob/18c915e540e7578bb69830f3d544e1cfcae26b72/R/convergence.R#L418

mitzimorris · 2024-07-16T18:51:11Z

is current ESS computation using autocorrelation? comment from @syclik, added 12 years ago remains:

stan/src/stan/mcmc/chains.hpp

Line 561 in cb7fe1f

// FIXME: reimplement using autocorrelation.

the answer is yes it is. comment should be removed.

jgabry · 2024-07-16T20:34:28Z

I don't know about the C++ version, but the ESS calculations in the posterior package that @avehtari and I linked to definitely do use it.

…

On Tue, Jul 16, 2024 at 12:51 PM Mitzi Morris ***@***.***> wrote: is current ESS computation using autocorrelation? comment from @syclik <https://github.com/syclik>, added 12 years ago remains: https://github.com/stan-dev/stan/blob/cb7fe1f6c4fdabb03d56b299c1424ef3f847a26b/src/stan/mcmc/chains.hpp#L561 — Reply to this email directly, view it on GitHub <#3299 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AB3PQQ2C2JZOQ6OKZDTQTV3ZMVTTNAVCNFSM6AAAAABKZTNB7CVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMZRGYYDKMZRGE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

avehtari · 2024-07-17T12:21:56Z

I agree that that FIXME comment should be removed

mitzimorris · 2024-07-24T16:40:09Z

since we don't know how much of the API in chains.hpp is being used elsewhere, I think that we should re-implement it as a new class chainset.hpp, which takes as its underlying data structure a std::vector<Eigen MatrixXd> to hold the chains, where all chains are the same size and shape, with matching column names, and do not contain any warmup draws. enforcing these constraints at object instantiation will allow us to get rid of a lot of repeated code across the various functions which do these checks. at some future point, we could deprecate chains.hpp.

further refactoring of the code in stan/analyze/mcmc: the rank-normalized split rhat added to compute_potential_scale_reduction.hpp should be split out into a separate files for rank-normalization, bulk and tail rhat, and bulk and tail ess.

@WardBrian, @SteveBronder thoughts?

WardBrian · 2024-07-24T16:47:17Z

I'm still not sure we need something like chains.hpp at all. If we had analysis functions that took as an argument std::vector<Eigen::MatrixXd> chains, I suspect that is all we really need. Adding this object whose only job is to call the other functions seems like more code for little benefit, unless we use the object abstraction in some other way as well that I've missed

mitzimorris · 2024-07-24T16:58:16Z

I'm still not sure we need something like chains.hpp at all. If we had analysis functions that took as an argument std::vector<Eigen::MatrixXd> chains, I suspect that is all we really need.

generally agree. the added functionality needed is consistency checking across chains, indexing into this object by column name, and ensuring that variance calculations are done on columns of finite, non-identical values. but yes, the analysis functions should take as arguments std::vector<Eigen::MatrixXd> chains plus column index/name and return the computed statistic/diagnostic for that column across all chains.

mitzimorris added the feature label Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use improved Rhat to implement ESS-bulk and ESS-tail #3299

use improved Rhat to implement ESS-bulk and ESS-tail #3299

mitzimorris commented Jul 12, 2024

mitzimorris commented Jul 12, 2024

mitzimorris commented Jul 12, 2024

jgabry commented Jul 14, 2024

avehtari commented Jul 16, 2024

mitzimorris commented Jul 16, 2024 •

edited

Loading

jgabry commented Jul 16, 2024 via email

avehtari commented Jul 17, 2024

mitzimorris commented Jul 24, 2024 •

edited

Loading

WardBrian commented Jul 24, 2024

mitzimorris commented Jul 24, 2024 •

edited

Loading

use improved Rhat to implement ESS-bulk and ESS-tail #3299

use improved Rhat to implement ESS-bulk and ESS-tail #3299

Comments

mitzimorris commented Jul 12, 2024

Summary:

Description:

Current Version:

mitzimorris commented Jul 12, 2024

mitzimorris commented Jul 12, 2024

jgabry commented Jul 14, 2024

avehtari commented Jul 16, 2024

mitzimorris commented Jul 16, 2024 • edited Loading

jgabry commented Jul 16, 2024 via email

avehtari commented Jul 17, 2024

mitzimorris commented Jul 24, 2024 • edited Loading

WardBrian commented Jul 24, 2024

mitzimorris commented Jul 24, 2024 • edited Loading

mitzimorris commented Jul 16, 2024 •

edited

Loading

mitzimorris commented Jul 24, 2024 •

edited

Loading

mitzimorris commented Jul 24, 2024 •

edited

Loading