adding reduce to Ops.jl #840

tharittk · 2025-03-03T14:49:34Z

I deliberately did not test on non-commutative operators—it will fail when tested against Julia's reduce. I read the discussion on the internet that it is supposed to be so, i.e., reduce should take a commutative operator; otherwise, the result will be non-deterministic, especially in a parallel system.

giordano

I deliberately did not test on non-commutative operators—it will fail when tested against Julia's reduce.

Does that mean that the reduce method introduced here has different semantics than Julia's reduce? That'd be unideal, asi it can lead to hard-to-track bugs when people write generic code and Reactant suddenly changes their meaning (similar for example to #755)

src/Ops.jl

mofeing · 2025-03-03T19:37:35Z

one thing to note is that methods in Ops should replicate the semantics of StableHLO (and possibly other MLIR dialects), not Julia.

the adaptation from StableHLO to Julia semantics is done in the method specializations outside the Ops module.

giordano · 2025-03-03T19:55:04Z

one thing to note is that methods in Ops should replicate the semantics of StableHLO (and possibly other MLIR dialects), not Julia.

Ah, yes, that's a good point. This is not extending Base.reduce, so that's fine (although the same name is slightly confusing 😅).

mofeing · 2025-03-03T21:56:58Z

... (although the same name is slightly confusing 😅).

yep 😅, but it's because it wraps the stablehlo.reduce op. as a rule of thumb, you should not import Ops but call them explicitly like Ops.reduce to avoid confusion.

src/Ops.jl

avik-pal · 2025-03-03T22:31:20Z

src/Ops.jl

@@ -2313,4 +2313,50 @@ Produces a [`Reactant.MLIR.Dialects.sdy.sharding_constraint`](@ref) operation wi
    end
 end

+@noinline function reduce(


Can you follow up with a PR to update the mapreduce impl to call this function

Sure I'll do that

avik-pal · 2025-03-04T01:37:02Z

the cuda reduce tests are failing

tharittk · 2025-03-04T03:30:00Z

Based on my initial investigation, it seems to be about how the init_values is handled. In case of CPU - via Reactant.set_default_backend("cpu"), there is no issue.

But when using GPU, if the init_values is set to be 1 for * and 0 for +, for example, it will be OK. Other than that, it looks like the GPU applies init_values multiple times.

For example, if we start with
A = [1 3; 2 4;;; 5 7; 6 8;;; 9 11; 10 12]

with init_values = 2, and dim = [3] then
we have [(15+2) (21+2); (18+2) (24+2)] = [17 23; 20 26] -- both CPU and GPU version agree

with init_values = 2, and dim = [1, 3] then
We should have [(33 + 2) (45+2)] -- init_value only applied once (this is what Julia's reduce() gives also)
but with stablehlo + GPU, we get [(17+20) (23+26)] -- effectively init_values is applied twice

I am not sure if this is a sematic issue or something else.

What I think happens is stablehlo broadcasts the init_values at the beginning and do reduce while Julia's reduce does the reduce and then broadcasts the init_values to whatever size is after reduce at the end

tharittk · 2025-03-04T14:16:05Z

Ok I was not reading the specification carefully. Here what it says

Semantics
Applies a reduction function body to inputs and init_values along the dimensions and produces results tensors.
The order of reductions is implementation-defined, which means that body and init_values must form a monoid to
guarantee that the operation produces the same results for all inputs on all implementations. However, this condition
doesn't hold for many popular reductions. E.g. floating-point addition for body and zero for init_values don't actually form
a monoid because floating-point addition is not associative.

So it is the word monoid. I honestly google that word and, to my understanding, it means the operation must be associative and the init_values has to be the identity of that operation to guarantee the same result across all implementation.

giordano · 2025-03-04T14:22:35Z

Here what it says

For the benefit of readers, this is the source: https://openxla.org/stablehlo/spec#semantics_71

avik-pal · 2025-03-05T21:32:02Z

Run JuliaFormatter on the code to fix the formatter ci

codecov · 2025-03-05T23:54:13Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 39.82%. Comparing base (b6ffc96) to head (a18e826).
Report is 538 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main     #840       +/-   ##
===========================================
+ Coverage   21.66%   39.82%   +18.15%     
===========================================
  Files          46      104       +58     
  Lines        8048    16919     +8871     
===========================================
+ Hits         1744     6738     +4994     
- Misses       6304    10181     +3877

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Pangoraw · 2025-03-06T08:27:52Z

Could you add a docstring explaining the difference with Julia's reduce?

tharittk · 2025-03-07T16:31:55Z

hmm.. the latest update from upstream branch seems to break this, specifically for macOS. Anyone has any suggestion?

src/Ops.jl

giordano reviewed Mar 3, 2025

View reviewed changes

src/Ops.jl Outdated Show resolved Hide resolved

avik-pal reviewed Mar 3, 2025

View reviewed changes

src/Ops.jl Outdated Show resolved Hide resolved

avik-pal reviewed Mar 3, 2025

View reviewed changes

avik-pal force-pushed the ops-reduce branch from 6fc44ef to b7ec728 Compare March 5, 2025 23:26

avik-pal reviewed Mar 7, 2025

View reviewed changes

src/Ops.jl Outdated Show resolved Hide resolved

tharittk and others added 6 commits March 7, 2025 21:59

adding reduce to Ops.jl

6c799bf

Update src/Ops.jl

b2b1e32

change Ops.reduce test case to reflect stablehlo semantics

7592eff

Run through formatter

a999a96

add docstring as comments suggest

85388b0

Update src/Ops.jl

45ebd5e

avik-pal force-pushed the ops-reduce branch from b14a1e1 to 45ebd5e Compare March 8, 2025 02:59

avik-pal approved these changes Mar 8, 2025

View reviewed changes

tharittk mentioned this pull request Mar 8, 2025

Shorten mapreduce by using Ops.reduce #858

Draft

avik-pal reviewed Mar 8, 2025

View reviewed changes

src/Ops.jl Outdated Show resolved Hide resolved

Update src/Ops.jl

a18e826

avik-pal merged commit 5f9d523 into EnzymeAD:main Mar 8, 2025
37 of 40 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding reduce to Ops.jl #840

adding reduce to Ops.jl #840

tharittk commented Mar 3, 2025

giordano left a comment

mofeing commented Mar 3, 2025

giordano commented Mar 3, 2025

mofeing commented Mar 3, 2025

avik-pal Mar 3, 2025

tharittk Mar 3, 2025

avik-pal commented Mar 4, 2025

tharittk commented Mar 4, 2025 •

edited

Loading

tharittk commented Mar 4, 2025 •

edited

Loading

giordano commented Mar 4, 2025

avik-pal commented Mar 5, 2025

codecov bot commented Mar 5, 2025 •

edited

Loading

Pangoraw commented Mar 6, 2025

tharittk commented Mar 7, 2025

adding reduce to Ops.jl #840

adding reduce to Ops.jl #840

Conversation

tharittk commented Mar 3, 2025

giordano left a comment

Choose a reason for hiding this comment

mofeing commented Mar 3, 2025

giordano commented Mar 3, 2025

mofeing commented Mar 3, 2025

avik-pal Mar 3, 2025

Choose a reason for hiding this comment

tharittk Mar 3, 2025

Choose a reason for hiding this comment

avik-pal commented Mar 4, 2025

tharittk commented Mar 4, 2025 • edited Loading

tharittk commented Mar 4, 2025 • edited Loading

giordano commented Mar 4, 2025

avik-pal commented Mar 5, 2025

codecov bot commented Mar 5, 2025 • edited Loading

Codecov Report

Pangoraw commented Mar 6, 2025

tharittk commented Mar 7, 2025

tharittk commented Mar 4, 2025 •

edited

Loading

tharittk commented Mar 4, 2025 •

edited

Loading

codecov bot commented Mar 5, 2025 •

edited

Loading