-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improvements to ess_rhat #53
Conversation
Pull Request Test Coverage Report for Build 3690426417Warning: This coverage report may be inaccurate.This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.
Details
💛 - Coveralls |
Codecov ReportBase: 94.91% // Head: 95.19% // Increases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## main #53 +/- ##
==========================================
+ Coverage 94.91% 95.19% +0.27%
==========================================
Files 10 10
Lines 669 708 +39
==========================================
+ Hits 635 674 +39
Misses 34 34
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't checked the PR in detail yet (since I'm on my phone and tests seem to fail) but one immediate suggestion is to avoid the JLD dependency and the JLD file. Either just use a text file or even better create the data in the tests without storing it.
yes, the failures are in
Generating in the tests is cumbersome. When I tried sampling with Turing, I gave up after nearly an hour of sampling. StanSample can do it within a few minutes, but then we need CmdStan installed to run the tests locally and globally. Plus I wouldn't assume the tests would pass for all possible seeds to the sampler, so if CmdStan ever made a change to its sampler, our tests could suddenly begin failing. I'll add these draws as a text file. |
Co-authored-by: David Widmann <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
I've implemented all requested changes. Only the windows tests fail, but that's expected. (see #56 (comment)) The references now look a little strange, but it seems to be the best we can do right now: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very nice, great PR! Looks good to me once the Linux test passes.
Thanks for the review! Can you merge? The repo settings require all tests pass before merge, and I don't have permissions to override. |
I changed the settings, now PRs can be merged (from everyone with merge permissions) even if Windows tests fail. |
Thanks! |
This PR implements the nonbreaking parts of #22 (specifically, the proposal in #22 (comment)):
estimator
to compute an estimate-specific ESS (and R-hat too, though I don't think this is commonly used)ess_rhat_bulk
to compute rank-normalized ESS and R-hatess_tail
andrhat_tail
(the two cannot be simultaneously computed since they use different transformations)Currently this is marked as a draft while I write more tests and try to figure out how improve type-inferrability for utility functions.