Replies: 1 comment
-
Hi. Sorry, seems like we missed your question. Good news is, your problem looks exactly like what we have been debugging over the last few days and have a PR in review: #3631. I'm pretty sure this fix will address your problem. Best, |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I have a 3 node cluster running with rabbitmq 3.8.19 and erlang 24.0.3 on ubuntu VMs.
cluster_partition_handling = pause_minority
is enabled on all nodes.There are 10k quues, but only 3 active connections.
If I reset the first node, I can see that all nodes are going down: Partial partition detected

In the past (Version 3.7?) only one node stopped if a partial partition was detected.
Now the two remaining nodes are stopping and everything is down for around two minutes.
The cluster recovers which is fine BUT additionally I can see that some queues are not synced anymore.
This can be solved by restarting every node once.
Example
Related error messages
Is there anything I can do to prevent the cluster downtime if one nodes goes down?
Config
used plugins:
rabbitmq_management,rabbitmq_shovel,rabbitmq_shovel_management,rabbitmq_auth_backend_http,rabbitmq_auth_backend_cache,rabbitmq_mqtt,rabbitmq_prometheus
rabbitmq.conf
ha policy (applied to all queues)
Beta Was this translation helpful? Give feedback.
All reactions