-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix and test lcm([1//2, 1//2]) == 1//1 #56423
Conversation
@test gcd(Int[]) === 0 | ||
@test lcm(Int[]) === 1 | ||
@test gcd(Rational{Int}[]) === 0//1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These three tests are unnecessarily strict.
@test gcd(Int[]) === 0 | |
@test lcm(Int[]) === 1 | |
@test gcd(Rational{Int}[]) === 0//1 | |
@test gcd(Int[]) == 0 | |
@test gcd(Rational{Int}[]) == 0 |
Relaxing them should help avoid spurious test failures in the future.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If someone changes the return type of gcd(Int[])
or gcd(Rational{Int}[])
, I want a test to fail. Similarly, if someone changes the return type or value of lcm(Int[])
, I want a test to fail. Those would be breaking changes that should not be made accidentally.
I'm adding them because I noticed that #56113 would break all three of these new tests but doesn't break any existing tests; I want to make sure that behavior change is only made if folks are aware that we are changing existing behavior.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changing the return type isn't breaking, as long as the value is correct and the type subtypes the correct abstract type.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Making a mathematical operation on Int
s that used to return Int
s instead return a different type could introduce errors or correctness bugs based on overflow behavior changes or failed conversions and could also introduce significant performance issues through the introduction of type instability in user code.
Changing 1+1
to return an Int8 is breaking even through Int8 has all the same abstract supertypes as Int. A similar change to lcm
or gcd
would be much less impactful because +
is so much more commonly used but would still be breaking.
@@ -191,6 +191,13 @@ end | |||
|
|||
@test lcm(T[2, 4, 6]) ⟷ T(12) | |||
end | |||
|
|||
# Issue #55379 | |||
@test lcm([1//2; 1//2]) === lcm([1//2, 1//2]) === lcm(1//2, 1//2) === 1//2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IMO, this is wrong. Rational numbers are a field, and as such to not have an LCM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
xref #56166
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it is perfectly sensible to define lcm on rationals, provided we choose a definition for "multiple" that is based on integers. A definition that is mathematically sensible and in line with current behavior (except for #55379) is
For a signed ring R
, m
in R
is an lcm of a_1, a_2, ... a_n in R
iff m
is positive and is a multiple of a_1
, a_2
, ... and a_n
and for all other m'
in R
that are multiples of a_1
, a_2
, ... and a_n
, m'
is a multiple of m
.
And we define "multiple" as
a
is a multiple of b
iff there exists an integer n
such that b * n = a
where *
refers to multiplication in the ring of integers, not in the ring of integers mod 2^64.
While there is certainly a case to be made that lcm
should throw on rationals, now is not the time to make that case. That time was #33910. lcm
has supported rationals under the definition above since 1.4.
I am not aware of any definitions of lcm
over the integers or the rationals that would result in returning different answers; though some definitions should throw errors:
Using "m <= m'
" instead of "m'
is a multiple of m
" produces the same results on rationals and integers.
Defining "multiple" based on the existence of a ring element that takes b
to a
instead of based on an integer that takes b
to a
is equivalent for the ring of integers. For the rationals, every positive rational is the lcm
of every set of rationals which doesn't seem like a particularly useful definition. Folks sometimes (e.g. #27039 (comment), #56166 (comment)) say that lcm(a,b) == 1
for a field which, if we are defining multiple as "all field elements are multiples of all other field elements", is not wrong but we could just as correctly say that lcm(a,b) == 17
for all field elements a
and b
under that definition which means the right answer for a programming language is an error, not the number 1 nor the number 17.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I took a different route to obtain the same answer here #55379 (comment)
with the FTA based definition, lcm(1//2, 1//2) == 1//2
. in my opinion it is perfectly clean & consistent this way.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah yes, the FTA definition is nice. AFAICT these two definitions agree in all cases.
For the empty case over the rationals the definition I listed indicates there is no LCM. Using the FTA definition extended to rationals requires computing max
over an empty set of integers which also does not exist.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
as counterpoint to my interpretation, it would lead to gcd([]) = 1
as well, which is normatively worse than gcd([]) = 0
to be honest though, I think all these choices are basically just convention and I don't think it matters that much that there is some underlying purity determining the convention as long as what we end up with makes sense.
I think the current state of this PR makes good choices for each convention, and afaict it looks like the court of public opinion --- which seems to arise from the recursive definitions --- agrees https://math.stackexchange.com/questions/1755266/gcd-of-an-empty-set https://www.reddit.com/r/learnmath/comments/v9vmfm/whats_the_empty_lcm/
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@adienes, please take a look at #56166
all these choices are basically just convention and I don't think it matters that much that there is some underlying purity determining the convention as long as what we end up with makes sense
The only way to end up with something that makes sense is to respect the math ("underlying purity").
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
respectfully, I don't think "the math" makes any particularly consistent demands about what to do in these edge cases. and any choice made is essentially just convention (which can and does vary among authors)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As described in the linked issue, there indeed are multiple possible choices here. That doesn't imply that anything goes, though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think a more charitable interpretation of my comments (rather than "anything" goes) is "a lot of things could go, and I think the choices made here are good ones"
but anyway I guess let's just leave to to triage. I don't typically attend triage but if the discussion comes up, I would upvote the implementation in this PR.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems like there should be a decision on the desired semantics, no point in making PRs before that.
We decided on semantics in #27039 and #33910. Were this a PR that changed existing non-buggy behavior, I would agree that we need to come to a decision about desired semantics before merging. As is, it's a simple bugfix: julia> lcm(1//6, 1//3)
1//3 => 1//3
julia> lcm(1//6, 2//3)
2//3 => 2//3
julia> lcm([1//6, 1//3])
1//1 => 1//3
julia> lcm([1//6, 2//3])
2//1 => 2//3
julia> lcm(Rational[])
1//1 => 1//1 We should not wait for a decision on revising previously decided semantics before merging bugfixes for existing semantics. |
The solution seems simple, make empty LCM throw, do a PkgEval. |
just to be clear
so Julia would certainly not be alone in this convention and I think calling it "doubling down on a bug" is unnecessarily inflammatory. There certainly are many languages which choose to error --- it seems that MATLAB (& Octave), R, GMP, numpy, Magma, GAP, probably many more error. but crucially as far as I can see none of these define so I repeat: it's not a "bug" it's a convention we can choose. and if we choose to define the empty |
This PR doesn't change the behavior of |
This PR introduces dubious special casing to fix the behavior of LCM for nonintegers. Doing that before good semantics are decided upon could make it more difficult or impossible to fix these semantics in the future. Right now LCM for nonintegers is incorrect anyway, so behavioral changes are more easily possible. |
For Mathematica, it is left undefined apparently:
|
To add another data point, Maple accepts
Regarding Mathematica, I'm not sure if In[1]:= LCM[List[]]
Out[1]:= {} |
For two argument julia> lcm(1//0, 3//2)
3//2 # Should throw
julia> lcm(1//0, 1//0)
1//0 # Correct
julia> lcm(0//1, 2//3)
0//1 # Correct
julia> lcm(1//0, -1//0)
1//0 # Correct
Other than the non-integer empty case, this pull request correctly generalizes Action steps:
|
This now implements triage's proposed semantics and is ready for review. With the semantics change requested by triage, this becomes slightly breaking so I removed the backport labels. |
Some folks define `lcm(x::T,y::T)` as any `z::T` such that there exists `a::T, b::T` with `a*x==z` and `b*y==z` and for all `zʹ::T` such that there exist `a::T, b::T` with `a*x==zʹ` and `b*y==zʹ`, there also exists `c::T` with `z*c==zʹ`. This is a reasonable definition, but not what we use. Notably, it makes `lcm(x::Rational, y::Rational) = z::Rational` true for all finite, nonzero `x`, `y`, and `z`. The definition we use requires `a`, `b`, and `c` to all be _integers_, not rationals in the case of `lcm(x::Rational, y::Rational)`. This clarifies what we mean when we define `lcm(x::Rational, y::Rational)` and also how the generic function should be extended. See [this thread](#56423 (comment)) for more discussion
Fixes #55379
An alternative to #56113 that does not include minor breaking changes to
gcd
orlcm(::AbstractArray{<:Integer})
.