You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Right now, when a given node fails to restore a batch, the restore ends with an error.
We should make it more robust so that:
failed batch can be re-tried by other nodes in the context of a single restore task execution
failed node can still work on other batches, but originating from different backup datacenters
The first one is self-explanatory.
The second one comes from the fact that the failure might be dc-related. An example of such failure could be seen in #3871. This approach also reduces the amount of failed batches when the source of the problem is node related.
The text was updated successfully, but these errors were encountered:
Right now, when a given node fails to restore a batch, the restore ends with an error.
We should make it more robust so that:
The first one is self-explanatory.
The second one comes from the fact that the failure might be dc-related. An example of such failure could be seen in #3871. This approach also reduces the amount of failed batches when the source of the problem is node related.
The text was updated successfully, but these errors were encountered: