Add re-tries to restore #4065

Michal-Leszczynski · 2024-10-09T07:56:13Z

Right now, when a given node fails to restore a batch, the restore ends with an error.
We should make it more robust so that:

failed batch can be re-tried by other nodes in the context of a single restore task execution
failed node can still work on other batches, but originating from different backup datacenters

The first one is self-explanatory.
The second one comes from the fact that the failure might be dc-related. An example of such failure could be seen in #3871. This approach also reduces the amount of failed batches when the source of the problem is node related.

Now, if batch restoration failed on one node, it can still be retried by other nodes. Failed node is no longer used for the restore. Fixes #4065

Michal-Leszczynski added the restore label Oct 9, 2024

Michal-Leszczynski self-assigned this Oct 9, 2024

Michal-Leszczynski added this to the 3.4 milestone Oct 9, 2024

Michal-Leszczynski added a commit that referenced this issue Oct 16, 2024

feat(restore): make batches retryable

0feb1bc

Now, if batch restoration failed on one node, it can still be retried by other nodes. Failed node is no longer used for the restore. Fixes #4065

Michal-Leszczynski added a commit that referenced this issue Oct 16, 2024

feat(restore): make batches retryable

53a05e9

Now, if batch restoration failed on one node, it can still be retried by other nodes. Failed node is no longer used for the restore. Fixes #4065

Michal-Leszczynski mentioned this issue Oct 16, 2024

Restore improvement: batch retry #4071

Merged

Michal-Leszczynski closed this as completed in #4071 Oct 22, 2024

Michal-Leszczynski closed this as completed in 7ab9235 Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add re-tries to restore #4065

Add re-tries to restore #4065

Michal-Leszczynski commented Oct 9, 2024

Add re-tries to restore #4065

Add re-tries to restore #4065

Comments

Michal-Leszczynski commented Oct 9, 2024