Allow empty partitions in parallel runs #162

jrper · 2017-08-07T12:00:23Z

I'm raising this pull request slightly early in hopes of building a consensus on what the preferred behaviour and testing method should be, as regards continuing with empty partitions after Zoltan driven adaptivity in a Fluidity run.

This changeset allows the code to continue on with an empty partition, with the previous exit messages becoming warnings visible in the log file. If a later adapt reactivates the empty partition this should happen transparently.

@tmbgreaves Could I get a buildbot branch for this, since it's liable to interact with several other features.

tmbgreaves · 2017-08-07T13:04:37Z

Added and running: http://buildbot.ese.ic.ac.uk:8080/builders/allow_empty_partitions

jrper · 2017-08-22T15:01:55Z

I'm finally confident that this will go green on buildbot, while actually testing the new functionality, so I'm tagging @stephankramer for a formal review.

stephankramer · 2020-02-11T22:58:40Z

Coming back to this, as this is indeed a very useful addition:

I think we may have discussed this previously - the code all looks very good, but I would like some clarification as to what the approach is here, as I can't easily work it out from the changes. It seems to introduce a new communicator MPI_COMM_NONEMPTY but it is unclear to me when this should be used, as MPI_COMM_FEMTOOLS is also still around in many places. So from a developer perspective it's unclear which one they should be using.

Taking a step back to what the original problem. So we go into zoltan, it comes back with a partition where some processes receive 0 nodes/elements. Now naively there should be nothing wrong with that, the empty process just going through the normal code path but all its loops reduced to zero, it allocating a lot of zero-length arrays and in MPI communication just contributing a zero length array, or providing 0. when we're summing up. I'm guessing, but please correct me if I'm wrong, the main issue would be where it tries to short-cut subroutines that should be collective?

Now in your approach, if I understand correctly, you create a MPI_COMM_NONEMPTY in which presumably only the nonempty processes take part. Then you have to be careful with this giving a different process numbering, which I've seen being addressed in the code indeed. My question is though: what do the empty process(es) do at this point. Presumably they'll have to sit back and do nothing, as they can't take part in MPI_COMM_NONEMPTY communication, but how does this then gel with there being lots of places where MPI_COMM_FEMTOOLS is still being used which presumably they do need to be around for?

As you'll notice there's quite a few presumablys in my comment - and I quite likely have completely the wrong end of the stick - so it would be great if you could clarify.

jrper · 2020-02-13T08:49:26Z

I'm addressing @stephankramer's comments here, so we can sort out what commenting needs to be inserted into the code:

This was done a long time ago now, but in general all your suppositions are correct, or correctish.

If one simply turns the "you have empty partitions" into a warning, then the normal symptom observed is a block at the first collective communication which needs to be done following an adapt. At first I just tried patching the scalar calls to do sensible things (e.g. have allmax return a large magnitude negative number on empty partitions), however I eventually realised that 1) many of the difficulties were in generating the logic as to whether a collective call should happen, 2) in several places the actuall FE code we implicitly assume that the node count isn't zero, leading to division by zero errors, bad allocations, deallocation and initialization and 3), this was going to involve a trawl through the entire code base to check that the first two problems were fixed everywhere.

As such, in the interests of expediency I took the mathematician's way out and reduced things to a problem we'd already solved, patching the zoltan routines to create a new MPI communicator over the non-empty processes. This is the MPI_COMM_NONEMPTY, which as far as I remember should be used for all the finite element/static mesh calls.

The preexisting MPI_COMM_FEMTOOLS (which in real life is just an alias to MPI_COMM_WORLD, but I acknowledge there are reasonable reasons not to make the assumption of calling it that) remains for the next Zoltan balancing cycle, in hopes that as the mesh adapts Zoltan will start returning a decomposition using the full computational power available to it.

I freely admit that after 18 months it's probably that I'm forgetting subtleties in the implementation, and it's possible that there are hidden bugs in the code sections I didn't fully explore. @gnikit has indicated he's will to devote effort to getting this merged, so there is man-power to cleaning things up if this summary is enough for you to formulate changes.

jrper added 2 commits August 4, 2017 18:31

Modify adaptivity code to support empty partitiions.

9fb0064

Work on checkpointing and empty partitions

63f5a38

jrper added 5 commits August 8, 2017 16:03

Remove spurious optimization which caused unit tests to fail.

d17f20c

Fix up for flredecomping.

525fb93

Fix when redecomping down to a smaller number of processes.

6057376

Fixup 2plus1 adaptivity.

d9bdd63

Add a test for the empty partition behaviour.

5b7cbdc

jrper force-pushed the allow_empty_partitions branch from 9e35ada to 5b7cbdc Compare August 22, 2017 10:18

Fix memory problem when reading empty mesh and not in debugging.

1d63789

jrper requested a review from stephankramer August 22, 2017 15:02

Merge branch 'master' into allow_empty_partitions

995cef5

gnikit mentioned this pull request Feb 3, 2020

Merging empty partitions branch #256

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow empty partitions in parallel runs #162

Allow empty partitions in parallel runs #162

jrper commented Aug 7, 2017

tmbgreaves commented Aug 7, 2017

jrper commented Aug 22, 2017

stephankramer commented Feb 11, 2020

jrper commented Feb 13, 2020

Allow empty partitions in parallel runs #162

Are you sure you want to change the base?

Allow empty partitions in parallel runs #162

Conversation

jrper commented Aug 7, 2017

tmbgreaves commented Aug 7, 2017

jrper commented Aug 22, 2017

stephankramer commented Feb 11, 2020

jrper commented Feb 13, 2020