Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Special case Geometric(OneHalf()) #1934

Merged
merged 3 commits into from
Jan 8, 2025

Conversation

LilithHafner
Copy link
Contributor

@LilithHafner LilithHafner commented Dec 19, 2024

Before

julia> @b Geometric() mean
2.097 ns

julia> @b Geometric() rand
10.762 ns

julia> @b Geometric() var
2.102 ns

julia> @b Geometric() entropy
8.543 ns

julia> @b Geometric() rand(_, 1000)
9.264 μs (3 allocs: 7.875 KiB)

1331649

julia> @b Geometric() mean
1.199 ns

julia> @b Geometric() rand
2.409 ns

julia> @b Geometric() var
1.195 ns

julia> @b Geometric() entropy
1.197 ns

julia> @b Geometric() rand(_, 1000)
1.118 μs (3 allocs: 7.875 KiB)

After

julia> @b Geometric() mean
1.990 ns

julia> @b Geometric() rand
3.516 ns

julia> @b Geometric() var
2.047 ns

julia> @b Geometric() entropy
8.339 ns

julia> @b Geometric() rand(_, 1000)
1.193 μs (3 allocs: 7.875 KiB)

In the PR, rand uses a substantially different (and simpler) algorithm proposed in #1933 for the p=0.5 case. mean, var, and entropy constant propagate. This latter impact is more of a convenient result than an intended design as those methods should never be bottlenecks for Geometric()

Comment on lines 37 to 38
struct OneHalf <: Real end
Geometric() = Geometric{OneHalf}(OneHalf())
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The common approach in Distributions is to branch depending on the values of the parameters if a faster or more accurate sampler exists. That is, one would just check if p == 1//2 or something similar inside of rand and sampler (if it is specialized).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's some runtime cost there but if you'd prefer runtime branching over compile time branching I can switch the implementation. Here's an idea of the runtime cost:

julia> using Random, Chairmarks

julia> current(p) = floor(Int,-randexp() / log1p(-p))
current (generic function with 1 method)

julia> pr() = leading_zeros(rand(UInt))
pr (generic function with 1 method)

julia> proposed(p) = p == 1//2 ? pr() : current(p)
proposed (generic function with 1 method)

julia> @b .2 current,proposed
(8.764 ns, 9.627 ns)

julia> @b .5 current,proposed
(8.582 ns, 2.584 ns)

julia> @b pr
2.280 ns

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the runtime cost is fine (overhead of ~0.3 ns it seems?). It's still a significant improvement and it would be consistent with existing samplers and rand implementations in Distributions which would also be a bit simpler for users I assume.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. The runtime comparison was pretty substantial in bulk generation but inlining allows the compiler to hoist it, giving another 3x speedup:

julia> using Distributions

julia> @b Geometric() rand(_, 100)
891.906 ns (2 allocs: 928 bytes)

julia> @eval Distributions function rand(rng::AbstractRNG, d::Geometric)
           if d.p == 0.5
               leading_zeros(rand(rng, UInt)) # This branch is a performance optimization
           else
               floor(Int,-randexp(rng) / log1p(-d.p))
           end
       end
rand (generic function with 181 methods)

julia> @b Geometric() rand(_, 100)
342.896 ns (2 allocs: 928 bytes)

julia> @eval Distributions @inline function rand(rng::AbstractRNG, d::Geometric)
           if d.p == 0.5
               leading_zeros(rand(rng, UInt)) # This branch is a performance optimization
           else
               floor(Int,-randexp(rng) / log1p(-d.p))
           end
       end
rand (generic function with 181 methods)

julia> @b Geometric() rand(_, 100)
110.390 ns (2 allocs: 928 bytes)

(@v1.11) pkg> st Distributions
Status `~/.julia/environments/v1.11/Project.toml`
  [31c24e10] Distributions v0.25.115

@codecov-commenter
Copy link

codecov-commenter commented Dec 20, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 86.02%. Comparing base (ceb6343) to head (55be7f6).
Report is 1 commits behind head on master.

Additional details and impacted files
@@           Coverage Diff           @@
##           master    #1934   +/-   ##
=======================================
  Coverage   86.01%   86.02%           
=======================================
  Files         144      144           
  Lines        8696     8699    +3     
=======================================
+ Hits         7480     7483    +3     
  Misses       1216     1216           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Copy link
Member

@devmotion devmotion left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Did you check whether both branches are covered by existing tests? Or whether more tests should be added?

@LilithHafner
Copy link
Contributor Author

codecov says

All modified and coverable lines are covered by tests ✅

But I don't see where when looking in test/. I can add some (possibly duplicate) tests on the statistical properties of samples or simply that they are sensible if you would like.

@devmotion
Copy link
Member

I checked and both cases (p=0.5 and p!=0.5) are covered by the reference tests:

{
"expr": "Geometric()",
"dtype": "Geometric",
"minimum": 0,
"maximum": "inf",
"properties": {
"succprob": 0.5,
"failprob": 0.5,
"mean": 1,
"var": 2,
"skewness": 2.12132034355964,
"kurtosis": 6.5,
"entropy": 1.38629436111989
},
"points": [
{ "x": 0, "pdf": 0.5, "logpdf": -0.693147180559945, "cdf": 0.5 },
{ "x": 1, "pdf": 0.25, "logpdf": -1.38629436111989, "cdf": 0.75 },
{ "x": 2, "pdf": 0.125, "logpdf": -2.07944154167984, "cdf": 0.875 },
{ "x": 3, "pdf": 0.0625, "logpdf": -2.77258872223978, "cdf": 0.9375 },
{ "x": 4, "pdf": 0.03125, "logpdf": -3.46573590279973, "cdf": 0.96875 },
{ "x": 5, "pdf": 0.015625, "logpdf": -4.15888308335967, "cdf": 0.984375 },
{ "x": 6, "pdf": 0.0078125, "logpdf": -4.85203026391962, "cdf": 0.9921875 }
],
"quans": [
{ "q": 0.10, "x": 0 },
{ "q": 0.25, "x": 0 },
{ "q": 0.50, "x": 0 },
{ "q": 0.75, "x": 1 },
{ "q": 0.90, "x": 3 }
]
},
{
"expr": "Geometric(0.02)",
"dtype": "Geometric",
"minimum": 0,
"maximum": "inf",
"properties": {
"succprob": 0.02,
"failprob": 0.98,
"mean": 49,
"var": 2450,
"skewness": 2.00010203821338,
"kurtosis": 6.00040816326531,
"entropy": 4.9019556639866
},
"points": [
{ "x": 0, "pdf": 0.02, "logpdf": -3.91202300542815, "cdf": 0.02 },
{ "x": 5, "pdf": 0.018078415936, "logpdf": -4.01303654201574, "cdf": 0.114157619136 },
{ "x": 11, "pdf": 0.0160146270149959, "logpdf": -4.13425278592086, "cdf": 0.2152832762652 },
{ "x": 17, "pdf": 0.0141864353236129, "logpdf": -4.25546902982598, "cdf": 0.304864669142967 },
{ "x": 25, "pdf": 0.0120692945955779, "logpdf": -4.41709068836613, "cdf": 0.408604564816681 },
{ "x": 34, "pdf": 0.0100627473595526, "logpdf": -4.59891505422381, "cdf": 0.506925379381922 },
{ "x": 45, "pdf": 0.00805755728546851, "logpdf": -4.82114483471652, "cdf": 0.605179693012043 },
{ "x": 59, "pdf": 0.00607251311616573, "logpdf": -5.10398273716179, "cdf": 0.702446857307879 },
{ "x": 79, "pdf": 0.00405405816493961, "logpdf": -5.50803688351218, "cdf": 0.801351149917959 },
{ "x": 113, "pdf": 0.00203974815473594, "logpdf": -6.19492893230784, "cdf": 0.900052340417939 },
{ "x": 227, "pdf": 0.000203868054202685, "logpdf": -8.49803756650506, "cdf": 0.990010465344068 }
],
"quans": [
{ "q": 0.10, "x": 5 },
{ "q": 0.25, "x": 14 },
{ "q": 0.50, "x": 34 },
{ "q": 0.75, "x": 68 },
{ "q": 0.90, "x": 113 }
]
},
{
"expr": "Geometric(0.1)",
"dtype": "Geometric",
"minimum": 0,
"maximum": "inf",
"properties": {
"succprob": 0.1,
"failprob": 0.9,
"mean": 9,
"var": 90,
"skewness": 2.00277585143997,
"kurtosis": 6.01111111111111,
"entropy": 3.25082973391448
},
"points": [
{ "x": 0, "pdf": 0.1, "logpdf": -2.30258509299405, "cdf": 0.1 },
{ "x": 2, "pdf": 0.081, "logpdf": -2.5133061243097, "cdf": 0.271 },
{ "x": 3, "pdf": 0.0729, "logpdf": -2.61866663996752, "cdf": 0.3439 },
{ "x": 4, "pdf": 0.06561, "logpdf": -2.72402715562535, "cdf": 0.40951 },
{ "x": 6, "pdf": 0.0531441, "logpdf": -2.934748186941, "cdf": 0.5217031 },
{ "x": 8, "pdf": 0.043046721, "logpdf": -3.14546921825666, "cdf": 0.612579511 },
{ "x": 11, "pdf": 0.031381059609, "logpdf": -3.46155076523013, "cdf": 0.717570463519 },
{ "x": 15, "pdf": 0.0205891132094649, "logpdf": -3.88299282786144, "cdf": 0.814697981114816 },
{ "x": 21, "pdf": 0.0109418989131512, "logpdf": -4.5151559218084, "cdf": 0.901522909781639 },
{ "x": 43, "pdf": 0.00107752636643058, "logpdf": -6.83308726628058, "cdf": 0.990302262702125 }
],
"quans": [
{ "q": 0.10, "x": 0 },
{ "q": 0.25, "x": 2 },
{ "q": 0.50, "x": 6 },
{ "q": 0.75, "x": 13 },
{ "q": 0.90, "x": 21 }
]
},
{
"expr": "Geometric(0.5)",
"dtype": "Geometric",
"minimum": 0,
"maximum": "inf",
"properties": {
"succprob": 0.5,
"failprob": 0.5,
"mean": 1,
"var": 2,
"skewness": 2.12132034355964,
"kurtosis": 6.5,
"entropy": 1.38629436111989
},
"points": [
{ "x": 0, "pdf": 0.5, "logpdf": -0.693147180559945, "cdf": 0.5 },
{ "x": 1, "pdf": 0.25, "logpdf": -1.38629436111989, "cdf": 0.75 },
{ "x": 2, "pdf": 0.125, "logpdf": -2.07944154167984, "cdf": 0.875 },
{ "x": 3, "pdf": 0.0625, "logpdf": -2.77258872223978, "cdf": 0.9375 },
{ "x": 4, "pdf": 0.03125, "logpdf": -3.46573590279973, "cdf": 0.96875 },
{ "x": 5, "pdf": 0.015625, "logpdf": -4.15888308335967, "cdf": 0.984375 },
{ "x": 6, "pdf": 0.0078125, "logpdf": -4.85203026391962, "cdf": 0.9921875 }
],
"quans": [
{ "q": 0.10, "x": 0 },
{ "q": 0.25, "x": 0 },
{ "q": 0.50, "x": 0 },
{ "q": 0.75, "x": 1 },
{ "q": 0.90, "x": 3 }
]
},
{
"expr": "Geometric(0.9)",
"dtype": "Geometric",
"minimum": 0,
"maximum": "inf",
"properties": {
"succprob": 0.9,
"failprob": 0.1,
"mean": 0.111111111111111,
"var": 0.123456790123457,
"skewness": 3.47850542618522,
"kurtosis": 14.1,
"entropy": 0.361203303768276
},
"points": [
{ "x": 0, "pdf": 0.9, "logpdf": -0.105360515657826, "cdf": 0.9 },
{ "x": 1, "pdf": 0.09, "logpdf": -2.40794560865187, "cdf": 0.99 }
],
"quans": [
{ "q": 0.10, "x": 0 },
{ "q": 0.25, "x": 0 },
{ "q": 0.50, "x": 0 },
{ "q": 0.75, "x": 0 },
{ "q": 0.90, "x": 0 }
]
},
for c in ["discrete",
"continuous"]
title = string(uppercase(c[1]), c[2:end])
println(" [$title]")
println(" ------------")
jsonfile = joinpath(@__DIR__, "ref", "$(c)_test.ref.json")
verify_and_test_drive(jsonfile, ARGS, 10^6)
println()
end

@devmotion devmotion merged commit e767d3e into JuliaStats:master Jan 8, 2025
14 checks passed
@LilithHafner LilithHafner deleted the lh/Geometric-05 branch January 8, 2025 13:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants