Fix for cut timestep size nonlinear solver strategy #6235

jdannberg · 2025-02-14T17:40:29Z

If the nonlinear solver fails in a timestep with mesh refinement and "cut timestep size" is selected as the Nonlinear solver failure strategy, ASPECT now correctly repeats the timestep first before refining the mesh. Before, ASPECT would execute refinement/coarsening of the mesh first, which could lead to changes of the mesh in each failure cycle.

I also added a test case: If you run it on main, it refines the mesh in timestep 1 before the cutback, so the solve fails again (because the mesh is now different) triggering a second cutback. With the current version, the solve succeeds after 1 cutback, refining the mesh and moving to timestep 2 afterwards.

Some background on this:
I tried out the "cut timestep size" nonlinear solver strategy and it made a model crash that otherwise simply had the nonlinear solver not converge in some steps. I looked for the reason, and it turned out the temperature in my model had increased by a constant everywhere (see image below). This was a way bigger change than between individual time steps. I don't know why changing the mesh first before repeating the time step would lead to the temperature suddenly having really weird values (the change in temperature had a linear relationship to the "Cut back factor"). But in any case, I think refining the mesh first is not what we want.

Screenshot showing temperature before (green) and after (red) cutback. This PR fixes the problem.

For new features/models or changes of existing features:

I have tested my new feature locally to ensure it is correct.
I have created a testcase for the new feature/benchmark in the tests/ directory.
I have added a changelog entry in the doc/modules/changes directory that will inform other users of my change.

danieldouglas92 · 2025-02-14T21:47:59Z

@jdannberg Yes this is great!! I ran into the same thing and made an issue #6049 many months ago but I completely forgot about it until I saw this. The temperature in my models increased by several hundred degrees. I'll try running that model with your changes in this branch to confirm that it fixes my case as well.

danieldouglas92 · 2025-02-14T22:15:15Z

It did fix my model as well!!! So happy you found the solution to this 🥳

jdannberg · 2025-02-17T13:03:45Z

@danieldouglas92 I totally missed your issue (I should pay attention to these more carefully), but yes, it looks like it's exactly the same problem I saw. Glad this fixed it!

gassmoeller

Thanks for figuring out the bug! This might be serious enough that we need a point release (a bug that is easy enough to trigger and that changes the solution without crashing is dangerous!)

However, I have one wrinkle to this solution: We have one reaction that is called refine_and_repeat_step, I think this should then refine the mesh and then repeat the time step on solver failure. With the change you did here would that reaction still work? It seems to me we would then skip the refinement altogether and just repeat the time step. Maybe we need to split the fix as follows:

        if (time_stepping_manager.should_refine_mesh())
          {
            pcout << "Refining the mesh based on the time stepping manager ...\n" << std::endl;
            refine_mesh(max_refinement_level);
          }

        if (time_stepping_manager.should_repeat_time_step())
          {
...
 }
        if (time_stepping_manager.should_refine_mesh() == false)
          maybe_refine_mesh(new_time_step_size, max_refinement_level);

So that we split the regular refinement from the refinement for time-stepping reasons.
Did I understand this correctly?

jdannberg · 2025-02-17T21:53:02Z

This does make sense to me. But I did not see a test or a place where either the refine_and_repeat_step or the refine_and_advance reactions are being used.

tjhei

Thank you. I think the logic makes sense. 👍🏻

jdannberg requested a review from tjhei February 17, 2025 13:09

gassmoeller reviewed Feb 17, 2025

View reviewed changes

tjhei approved these changes Feb 17, 2025

View reviewed changes

jdannberg added 2 commits February 19, 2025 14:12

do not refine the mesh if we repeat the timestep

3e8e430

add test and changelog

acfd813

jdannberg force-pushed the failure_cut_timestep_fix branch from 280170e to acfd813 Compare February 19, 2025 13:13

gassmoeller mentioned this pull request Feb 20, 2025

Fix Sphinx warnings #6239

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for cut timestep size nonlinear solver strategy #6235

Fix for cut timestep size nonlinear solver strategy #6235

jdannberg commented Feb 14, 2025

danieldouglas92 commented Feb 14, 2025

danieldouglas92 commented Feb 14, 2025

jdannberg commented Feb 17, 2025

gassmoeller left a comment

jdannberg commented Feb 17, 2025

tjhei left a comment

Fix for cut timestep size nonlinear solver strategy #6235

Are you sure you want to change the base?

Fix for cut timestep size nonlinear solver strategy #6235

Conversation

jdannberg commented Feb 14, 2025

For new features/models or changes of existing features:

danieldouglas92 commented Feb 14, 2025

danieldouglas92 commented Feb 14, 2025

jdannberg commented Feb 17, 2025

gassmoeller left a comment

Choose a reason for hiding this comment

jdannberg commented Feb 17, 2025

tjhei left a comment

Choose a reason for hiding this comment