On 30 Dec 2022, at 15:14, Nathan Angelacos nangel@tetrasec.net wrote:
On Fri, 2022-12-30 at 14:26 +0100, Olle E. Johansson wrote:
Dear Kamailians!
I’m trying to figure out DMQ and see how it reacts to server outages and downtime - the base protocol.
<snip>
Same here. The best I could figure is "it works when it works, until it doesn't" In our tests, the "edge cases" got weird.
Anyone who wants to explain how DMQ works in failure scenarios - there are two of us who are interested!
There’s a core protocol that updates each node about node status - active, pending, disabled, timeout - according to the source code. I haven’t seen timeout in my tests.
On top of that there’s other implementations using this - htable and usrloc are two examples - and each app have implemented their own protocol on top of the DMQ bus.
I’m trying to focus on the core protocol at this point.
/O