3主3从正常情况,一个机器上2个mastr同时挂掉后,集群故障

3主3从正常情况:

192.168.137.2:7000> CLUSTER nodes
9809b72ec290d73d99a3e1b0d12c4c7bf8583c45 192.168.137.3:7003@17003 slave 1b83e27acd5235726aea44702526a8ca0ede9a48 0 1583515333281 8 connected
1b83e27acd5235726aea44702526a8ca0ede9a48 192.168.137.2:7000@17000 myself,master - 0 1583515329000 8 connected 0-5460
bf0edaba80c4f31e9b56101572d2a5ccc8aa145c 192.168.137.4:7005@17005 slave 191d7306b81ffa85b5837898562eb6bf1479122c 0 1583515331249 6 connected
a7287834bc7db37249614d23e06ed8f9a6c7b3d3 192.168.137.4:7004@17004 master - 0 1583515331754 14 connected 10923-16383
3c6510bd29af80703ae7c0be5a5884caaa60cd4e 192.168.137.2:7001@17001 slave a7287834bc7db37249614d23e06ed8f9a6c7b3d3 0 1583515331751 14 connected
191d7306b81ffa85b5837898562eb6bf1479122c 192.168.137.3:7002@17002 master - 0 1583515332254 3 connected 5461-10922


192.168.137.2:7000> CLUSTER info
cluster_state:ok

干掉这个master:192.168.137.2:7000@17000 myself,master -
ps node1:/root/cluster/7001#ps -ef | grep 7000
root      2130     1  1 01:20 ?        00:00:05 redis-server 192.168.137.2:7000 [cluster]
root      2179  2147  0 01:21 pts/1    00:00:00 redis-cli -h 192.168.137.2 -p 7000 -c
root      2192  2099  0 01:27 pts/0    00:00:00 grep 7000
node1:/root/cluster/7001#kill -9 2130
node1:/root/cluster/7001#ps -ef | grep 7000
root      2179  2147  0 01:21 pts/1    00:00:00 redis-cli -h 192.168.137.2 -p 7000 -c
root      2194  2099  0 01:27 pts/0    00:00:00 grep 7000



192.168.137.2:7001> CLUSTER nodes
1b83e27acd5235726aea44702526a8ca0ede9a48 192.168.137.2:7000@17000 master,fail - 1583515657890 1583515656073 8 disconnected
191d7306b81ffa85b5837898562eb6bf1479122c 192.168.137.3:7002@17002 master - 0 1583515745001 3 connected 5461-10922
a7287834bc7db37249614d23e06ed8f9a6c7b3d3 192.168.137.4:7004@17004 master - 0 1583515743764 14 connected 10923-16383
bf0edaba80c4f31e9b56101572d2a5ccc8aa145c 192.168.137.4:7005@17005 slave 191d7306b81ffa85b5837898562eb6bf1479122c 0 1583515744884 6 connected
3c6510bd29af80703ae7c0be5a5884caaa60cd4e 192.168.137.2:7001@17001 myself,slave a7287834bc7db37249614d23e06ed8f9a6c7b3d3 0 1583515656000 13 connected
9809b72ec290d73d99a3e1b0d12c4c7bf8583c45 192.168.137.3:7003@17003 master - 0 1583515744452 15 connected 0-5460
192.168.137.2:7001> CLUSTER info
cluster_state:ok

此时集群正常切换:


启动192.168.137.2:7000 端口

192.168.137.2:7001> CLUSTER nodes
1b83e27acd5235726aea44702526a8ca0ede9a48 192.168.137.2:7000@17000 slave 9809b72ec290d73d99a3e1b0d12c4c7bf8583c45 0 1583515819258 15 connected
191d7306b81ffa85b5837898562eb6bf1479122c 192.168.137.3:7002@17002 master - 0 1583515817239 3 connected 5461-10922
a7287834bc7db37249614d23e06ed8f9a6c7b3d3 192.168.137.4:7004@17004 master - 0 1583515819258 14 connected 10923-16383
bf0edaba80c4f31e9b56101572d2a5ccc8aa145c 192.168.137.4:7005@17005 slave 191d7306b81ffa85b5837898562eb6bf1479122c 0 1583515818245 6 connected
3c6510bd29af80703ae7c0be5a5884caaa60cd4e 192.168.137.2:7001@17001 myself,slave a7287834bc7db37249614d23e06ed8f9a6c7b3d3 0 1583515818000 13 connected
9809b72ec290d73d99a3e1b0d12c4c7bf8583c45 192.168.137.3:7003@17003 master - 0 1583515817747 15 connected 0-5460
192.168.137.2:7001> CLUSTER info
cluster_state:ok

此时192.168.137.3 上面有两个master ,重启主机


192.168.137.4:7004> CLUSTER nodes
9809b72ec290d73d99a3e1b0d12c4c7bf8583c45 192.168.137.3:7003@17003 master,fail? - 1583519405087 1583519402552 15 disconnected 0-5460
3c6510bd29af80703ae7c0be5a5884caaa60cd4e 192.168.137.2:7001@17001 slave a7287834bc7db37249614d23e06ed8f9a6c7b3d3 0 1583519441000 14 connected
bf0edaba80c4f31e9b56101572d2a5ccc8aa145c 192.168.137.4:7005@17005 slave 191d7306b81ffa85b5837898562eb6bf1479122c 0 1583519442529 6 connected
1b83e27acd5235726aea44702526a8ca0ede9a48 192.168.137.2:7000@17000 slave 9809b72ec290d73d99a3e1b0d12c4c7bf8583c45 0 1583519443039 15 connected
a7287834bc7db37249614d23e06ed8f9a6c7b3d3 192.168.137.4:7004@17004 myself,master - 0 1583519440000 14 connected 10923-16383
191d7306b81ffa85b5837898562eb6bf1479122c 192.168.137.3:7002@17002 master,fail? - 1583519405190 1583519402653 3 disconnected 5461-10922
192.168.137.4:7004> CLUSTER info
cluster_state:fail
原文地址:https://www.cnblogs.com/hzcya1995/p/13348522.html