Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generate floats using lexographically ordered encoding #49

Open
wants to merge 17 commits into
base: main
Choose a base branch
from

Conversation

raineszm
Copy link

@raineszm raineszm commented Jul 16, 2024

PR Description

This implements an encoding of floating point numbers which is more amenable to nice shrinking as discussed in #22.

The basic idea is to reinterpret unsigned integers in a manner somewhat similar to the IEE754 encoding, but modified such that smaller UInts correspond to "simpler" floats.

The encoding is a port of the one used in Hypothesis

PR Checklist

  • Tests written
  • Documentation expanded
  • Feature ready for review (if false, please mark the PR as Draft!)

This uses the same encoding as `hypothesis` to give floats
better shrinking properties.
@raineszm
Copy link
Author

A good starting point for tests may be to port some of checks from Hypothesis regarding the properties of the encoding.

raineszm added 2 commits July 17, 2024 09:26
This just ports the tests that check the ordering properties of the
encoding and fixes the bugs that were found by the tests.
@raineszm raineszm force-pushed the lexographically-ordered-floats branch from 30533a7 to e58ac23 Compare July 17, 2024 17:24
@raineszm
Copy link
Author

I've implemented the new encoding of floating point numbers and ported the tests from hypothesis for the ordering properties of the encoding.

This means that that the shrinking to smaller UInts should correspond to shrinking toward positive floats of order 1. Nothing seems to have been broken by the change.

I think this is probably ready for at least a preliminary review.

@raineszm raineszm marked this pull request as ready for review July 17, 2024 17:31
@raineszm
Copy link
Author

It seems the Julia 1.8 test timed out when trying to fetch the package list?

Copy link
Owner

@Seelengrab Seelengrab left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Alright, thanks for the PR! I think it's mostly good, just some things around docstrings & preventing erroring paths.

I'm not 100% sure about putting everything in a dedicated submodule (I did mention that I wanted to refactor it, but I was referring to all of data.jl, not just the floating point parts), but with it we might as well go all the way and put the floating point utilities from util.jl here as well. They're currently unused anyway.

src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/util.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
@Seelengrab
Copy link
Owner

Seelengrab commented Jul 18, 2024

I've also come across HypothesisWorks/hypothesis#816 and HypothesisWorks/hypothesis#469, which may be helpful in terms of adding provenance/tests to this! If the issues mentioned there hit Supposition.jl too, it's possible that we'll need our own version of NASTY_FLOATS.

@raineszm raineszm requested a review from Seelengrab July 18, 2024 17:43
Copy link
Owner

@Seelengrab Seelengrab left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Alright, thanks! I've added some minor wording changes so we don't unintentionally imply an operation that doesn't happen, but otherwise this looks good to go! I'll play around a bit with some properties and see how they shrink with this, but after that I'll probably tag a release.

src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
src/float.jl Outdated Show resolved Hide resolved
raineszm and others added 3 commits July 19, 2024 08:08
The test was previously failing because NaNs were being generated with
the signbit set, but lex_to_float only generates floats with an
unset signbit.

Refactored the tests to test for all "positive" NaNs, separately
from other positive float.
test/runtests.jl Outdated Show resolved Hide resolved
Copy link
Owner

@Seelengrab Seelengrab left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yup, that should do the trick! No need to worry about doing manual bitshifts either, the "fancy" version compiles away completely:

julia> function foo(f)
           _, expo, frac = tear(f)
           assemble(Float64, zero(UInt), expo, frac)
       end
foo (generic function with 1 method)

julia> @code_llvm debuginfo=:none dump_module=false foo(1.0)
; Function Signature: foo(Float64)
define double @julia_foo_3453(double %"f::Float64") #0 {
top:
  %bitcast_coercion = bitcast double %"f::Float64" to i64
  %0 = and i64 %bitcast_coercion, 9223372036854775807
  %bitcast_coercion7 = bitcast i64 %0 to double
  ret double %bitcast_coercion7
}

@Seelengrab
Copy link
Owner

Seelengrab commented Jul 24, 2024

Alright, I experimented with this a bit and found this when trying to check the generated docs:

julia> rand_goal = rand()
0.93768949423463

julia> @check verbose=true function israndgoal(f=Data.Floats{Float64}())
           # negative absolute distance, because we want to _minimize_ the distance
           target!(-abs(rand_goal - f))
           event!(f)
           f != rand_goal
       end
Test passed
  Context: israndgoal

Score: -0.48696306540036444

  Events:
    UNLABELED_EVENT_0
        0.45072642883426556

This should fail though, as the searched-for target is actually found on main at the moment. We have perfect knowledge of when an example will fail, after all.

In essence, it seems like the changed IR resulted in the naive hill-climbing currently used with target! regressing when it comes to handling floating point data :/ I don't have a direct solution for that, unfortunately.. Making the hill-climbing aware of floating point would be one option worth trying, I guess?

@raineszm
Copy link
Author

That's unfortunate. I think I need to dig into the testcase code to better understand what's going on before I can say anything substantial.

I'll also look into how hypothesis does float shrinking a bit to see if there's any other options there.

Another option would be to add the new float encoding as a feature flag since it seems there are cases where the new encoding is more beneficial and some cases where the old encoding is still better.

Since, it may be still some time before this PR can be merged, would it be possible to cut a new release with the floating point clamping while this is WIP?

Thanks for all your feedback and for taking on the task of bringing proptests back to julia.

@Seelengrab
Copy link
Owner

I'll also look into how hypothesis does float shrinking a bit to see if there's any other options there.

As far as I understand it, they're doing a kind of more limited version of what I would ultimately like to do for #25. The shrinking itself shouldn't be an issue here though - it's the targeting that's unaware of the structure, so it never ends up finding a counterexample to shrink.

That gives me an idea though - if we move the fancy encoding from generation to the shrinking logic, we should be able to circumvent this entirely 🤔 E.g. by first shrinking the mantissa until the number is an integer, then the exponent.. I'll have to think about that. This would keep the target! functionality working (the hill climbing still works), and make shrinks pretty too.

Another option would be to add the new float encoding as a feature flag since it seems there are cases where the new encoding is more beneficial and some cases where the old encoding is still better.

Well, it's not like the old encoding is fundamentally broken; the counterexamples just shrink a bit differently by default, and don't shrink to "pretty" integers first.

Since, it may be still some time before this PR can be merged, would it be possible to cut a new release with the floating point clamping while this is WIP?

Yeah, definitely - I just wanted to let you know that the new approach does have some downsides too. It's a bit unfortunate, but that's how this goes sometimes :/

Still, thank you very much for putting in the effort!

@raineszm
Copy link
Author

Utilizing the new encoding in the shrinking step, while keeping the simple encoding for the generation and target steps seems like a good possibility.

AFAICT right now though there isn't any place in the code that simultaneously knows that shrinking is occurring and that a particular choice represents a float. I guess maybe #25 needs to be implemented first?

@Seelengrab
Copy link
Owner

AFAICT right now though there isn't any place in the code that simultaneously knows that shrinking is occurring and that a particular choice represents a float.

Well.. sort of 😅

function shrink_float(ts::TestState, attempt::Attempt)
new = Attempt(copy(attempt.choices), attempt.generation, attempt.max_generation)
for idx in eachindex(new.choices)
old = new.choices[idx]
old_float = reinterpret(Float64, old)
if isnan(old_float)
n_val = copysign(Inf, old_float)
new.choices[idx] = reinterpret(UInt64, n_val)
if consider(ts, new)
return Some(new)
end
end
end
nothing
end
is the function currently responsible for shrinking floats, but as you say, it just attempts to do that in every place and hopes that works out. #25 is indeed one way that would change, by associating an area in the recorded IR with a given shrinker object. I do have a prototype of that locally, but it ended up regressing shrinking of vectors severely and I haven't yet looked into why.

Nevertheless, if you want to give it a try, throwing the "inverted encoding" into shrink_float is what I'd try first.

@raineszm
Copy link
Author

Nevertheless, if you want to give it a try, throwing the "inverted encoding" into shrink_float is what I'd try first.

I gave quick try to using the lexical encoding in shrink float, but because it is applied to all choices it screws up shrinking of integers.

I did this for a quick and dirty test

function shrink_float(ts::TestState, attempt::Attempt)
    new = Attempt(copy(attempt.choices), attempt.generation, attempt.max_generation)
    for idx in eachindex(new.choices)
        old = new.choices[idx]
        old_float = reinterpret(Float64, old)
        if isnan(old_float)
            n_val = copysign(Inf, old_float)
            new.choices[idx] = reinterpret(UInt64, n_val)
            if consider(ts, new)
                return Some(new)
            end
        end

        lex = FloatEncoding.float_to_lex(old_float)
        res = bin_search_down(0, lex, n -> begin
            new_float = FloatEncoding.lex_to_float(Float64, n)
            new.choices[idx] = reinterpret(UInt64, copysign(old_float, new_float))
            consider(ts, new)
        end)

        new.choices[idx] = @something res Some(new.choices[idx])
    end
    if new.choices == attempt.choices
        return nothing
    else
        return Some(new)
    end
end

while reverting produce! to its old behavior and three tests immediately fail

@testset "finds small list" begin

@testset "finds small list even with bad lists" begin

@testset "reduces additive pairs" begin

because the code now shrinks to counterintuitive examples for integers.

@Seelengrab
Copy link
Owner

Haah.. that's unfortunate :( Guess this really does need #25 ...

Thanks for giving it a try though, much appreciated! I'll probably cut a release with the current state of main tomorrow so that the minimum/maximum change is available.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants