Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Possible performance issue in MPolyAnyMap / ring flattening #4112

Open
fingolfin opened this issue Sep 19, 2024 · 2 comments
Open

Possible performance issue in MPolyAnyMap / ring flattening #4112

fingolfin opened this issue Sep 19, 2024 · 2 comments
Labels
optimization Simpler/more performant code or more/better tests

Comments

@fingolfin
Copy link
Member

fingolfin commented Sep 19, 2024

During the "Julia performance tuning crash course" I did this week's OSCAR workshop, we looked at some of the "big" test suites in our tests, those which use 60GB of RAM and more, and found various sources of RAM usage.

One that was prominent across the board was the evaluate function for MPolyAnyMap, e.g. used for ring flattening.

The problem: it seems that images are constructed using + and ^. So if you map e.g. x^2*y + 5*y^3 to an isomorphic polynomial ring, then it gradually builds x^2, x^2*y, y^3, 5*y^3, x^2*y + 5*y^3 i.e. a ton of intermediate objects.

Of course in general, this is the best we can do.

But here the codomain is a polynomial ring.

There are multiple ways I can think of to address this, not necessarily mutually exclusive. e.g.

  1. We could perhaps add a special case to MPolyAnyMap for when the codomain is an MPolyRing? Ideally it would then use (the equivalent of) an MPolyBuildCtx.

  2. Perhaps we need additional kinds of maps that are optimized for certain setups?

  3. Perhaps the ring flattening code should simply not use MPolyAnyMap in the first place, but do its job differently?

  4. And perhaps we are not using MPolyAnyMap optimally in some cases?

  5. ... something else?

Here is a very simple way how R = k[x...], S = R[y...] could be mapped more efficiently to U = k[x...,y...]: consider a term c * Y in S, where c in R and Y is a monomial y^e with e=(e1,...,en). To map it, we can iterate over the terms in c which have the form d * x^f for some d in k and exponent vector f. For each of these terms, we just need to push (c, vcat(f,e)) into the MPolyBuildCtx.

Of course this only works if we map variables to variable, and if the variables are arranged in just this way. But it is my understanding that this is what we have in ring flattening situations anyway.

The process can also be adjusted to more than two "levels" (i.e. k[x...][y...][z...]) if necessary.

To go beyond MPolyRing yet more thoughts are required, but I am confident we can do something.

@fingolfin fingolfin added the optimization Simpler/more performant code or more/better tests label Sep 19, 2024
@thofma
Copy link
Collaborator

thofma commented Sep 19, 2024

Just a few comments:

  1. Since MPolyRingAny is just a fancy wrapper for evaluate, the problem should lie there. Indeed it seems to be the case that evaluate(f::MPolyRingElem, values::Vector) does not do clever things in case the target is a polynomial ring itself. Interestingly, partial evaluation evaluate(f::MPolyRingElem, variables::Vector{Int}, values::Vector) is optimized. It might be just a matter of fixing this.
  2. I agree that for the best performance, RingFlattening should role its own evaluate.

Can you provide us with a simple example? I wonder how much one gains by fixing 1.

@fingolfin
Copy link
Member Author

This cropped up on Wednesday morning when analyzing some of the big examples in our test suite. I will try to reproduce it and post details.

It may have been during one of the testsets of

@ test/AlgebraicGeometry/Schemes/elliptic_surface.jl:1 @
# runs 480 seconds, allocates 122 GB

By the way, a related issue affects the literature models (@HereAround is already looking into this, but is hampered by a broken laptop)

@ experimental/FTheoryTools/test/literature_models.jl:1 @
# runs 400 seconds, allocates 80 GB

@ experimental/FTheoryTools/test/tate_models.jl:1 @
# runs 215 seconds, allocates 65 GB

Here polynomials are loaded by parsing them via Meta.parse and then "evaluating" the resulting syntax tree. I think evaluate also shows up here, so this code would likely also benefit from an improved evaluate. But I think the real fix will be to not call evaluate at all. Perhaps it can switch to the serialization by @antonydellavecchia?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
optimization Simpler/more performant code or more/better tests
Projects
None yet
Development

No branches or pull requests

2 participants