Could not discover and join another node. Proceeding as a standalone node - wait a minute![Suggestion] #13257
-
RabbitMQ series4.0.x Operating system (distribution) used*nix How is RabbitMQ deployed?Community Docker image What would you like to suggest for a future version of RabbitMQ?Hey, Lets say we have a situation where we have a cluster of 3 nodes, A, B and C, that is up and running, and clustered with peer discovery. I wonder if there has been any thoughts about having configuration to not allowing a node to start if it fails to join a configured cluster using peer discovery, and after the 30 (default) retries, stop the node from starting at all, to avoid a situation where we have '2' clusters started, instead of 1? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 10 replies
-
@SimonUnge it was discussed. There wasn't much opposition to it but then the focus has shifted to #13050 and this particular change went nowhere. |
Beta Was this translation helpful? Give feedback.
-
@michaelklishin Oh ok, interesting. So, basically the default would now be unlimited retires? This might actually be 'good enough', as one should have mechanisms that notices that a node fails to come up and the logs would give us the heads up on why. If my understanding is correct, I think we can close this one as answered :) |
Beta Was this translation helpful? Give feedback.
We discussed this today and agree that:
Moving this behaviour form Mnesia->Khepri migration to RabbitMQ directly in a way which would cover both Mnesia and Khepri-enabled clusters would be the best approach. Right now it's the khepri migration code (which also executes on Khepri startup) that reconfigures the remote nodes, effectively making them forget the node and allowing it to join as a new node.