Create RepairTask Abstract Class to Handle Repair Operations #717
Labels
enhancement
New feature or request
PoC/Agent
Tasks related to new generation of ecchronos as an agent
Story Description:
We need to create an abstract class RepairTask that will handle the logic of executing repair operations within a distributed system. This class will manage the repair process through a combination of JMX proxy interactions, progress notifications, and failure handling.
Acceptance Criteria:
[Acceptance Criterion 1]: The execute() method should start the repair task, connect to the Cassandra node via JMX, and handle successful and failed repairs. In case of failure, the ScheduledJobException should be thrown, and the appropriate failure metrics should be recorded..
[Acceptance Criterion 2]: The task should handle notification updates via JMX, including progress updates, lost notifications, and task termination upon completion or error.
[Acceptance Criterion 3]: The repair task should prevent hangs by periodically checking the status of the Cassandra node. If the node becomes unresponsive or the repair task hangs for too long, it should attempt to terminate the session and abort the task, and updating the nodes_sync table if the jmx connection is unavailable.
Definition of Done:
Notes:
Depends of #716 and #715
Related to #652
The text was updated successfully, but these errors were encountered: