Skip to content

Conversation

dirkmueller
Copy link
Contributor

The failure timeout of 30s is far too low. Essentially it means that a
failed node is considered ready after 30s. Given that any start or
stop operation takes considerably more than 30s. We should only expire
failures after around 30 minutes to prevent flapping services.

The failure timeout of 30s is far too low. Essentially it means that a
failed node is considered ready after 30s. Given that any start or
stop operation takes considerably more than 30s. We should only expire
failures after around 30 minutes to prevent flapping services.
@JanZerebecki
Copy link
Contributor

Changed commit message to match regex checked by travis.
Sounds good to me, but I don't know the finer details of these timings, really needs someone with rabitmq clustering knowledge.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

Successfully merging this pull request may close these issues.

2 participants