亚洲香蕉成人av网站在线观看_欧美精品成人91久久久久久久_久久久久久久久久久亚洲_热久久视久久精品18亚洲精品_国产精自产拍久久久久久_亚洲色图国产精品_91精品国产网站_中文字幕欧美日韩精品_国产精品久久久久久亚洲调教_国产精品久久一区_性夜试看影院91社区_97在线观看视频国产_68精品久久久久久欧美_欧美精品在线观看_国产精品一区二区久久精品_欧美老女人bb

首頁 > 數據庫 > Redis > 正文

詳細分析Redis集群故障

2020-10-28 21:35:21
字體:
來源:轉載
供稿:網友

故障表象:

業務層面顯示提示查詢redis失敗

集群組成:

3主3從,每個節點的數據有8GB

機器分布:

在同一個機架中,

xx.x.xxx.199
xx.x.xxx.200
xx.x.xxx.201

redis-server進程狀態:

通過命令ps -eo pid,lstart | grep $pid,

發現進程已經持續運行了3個月

發生故障前集群的節點狀態:

xx.x.xxx.200:8371(bedab2c537fe94f8c0363ac4ae97d56832316e65) master
xx.x.xxx.199:8373(792020fe66c00ae56e27cd7a048ba6bb2b67adb6) slave
xx.x.xxx.201:8375(5ab4f85306da6d633e4834b4d3327f45af02171b) master
xx.x.xxx.201:8372(826607654f5ec81c3756a4a21f357e644efe605a) slave
xx.x.xxx.199:8370(462cadcb41e635d460425430d318f2fe464665c5) master
xx.x.xxx.200:8374(1238085b578390f3c8efa30824fd9a4baba10ddf) slave

---------------------------------下面是日志分析--------------------------------------

步1:
主節點8371失去和從節點8373的連接:
46590:M 09 Sep 18:57:51.379 # Connection with slave xx.x.xxx.199:8373 lost.

步2:
主節點8370/8375判定8371失聯:
42645:M 09 Sep 18:57:50.117 * Marking node bedab2c537fe94f8c0363ac4ae97d56832316e65 as failing (quorum reached).

步3:
從節點8372/8373/8374收到主節點8375說8371失聯:
46986:S 09 Sep 18:57:50.120 * FAIL message received from 5ab4f85306da6d633e4834b4d3327f45af02171b about bedab2c537fe94f8c0363ac4ae97d56832316e65

步4:
主節點8370/8375授權8373升級為主節點轉移:
42645:M 09 Sep 18:57:51.055 # Failover auth granted to 792020fe66c00ae56e27cd7a048ba6bb2b67adb6 for epoch 16

步5:
原主節點8371修改自己的配置,成為8373的從節點:
46590:M 09 Sep 18:57:51.488 # Configuration change detected. Reconfiguring myself as a replica of 792020fe66c00ae56e27cd7a048ba6bb2b67adb6

步6:
主節點8370/8375/8373明確8371失敗狀態:
42645:M 09 Sep 18:57:51.522 * Clear FAIL state for node bedab2c537fe94f8c0363ac4ae97d56832316e65: master without slots is reachable again.

步7:
新從節點8371開始從新主節點8373,第一次全量同步數據:
8373日志::
4255:M 09 Sep 18:57:51.906 * Full resync requested by slave xx.x.xxx.200:8371
4255:M 09 Sep 18:57:51.906 * Starting BGSAVE for SYNC with target: disk
4255:M 09 Sep 18:57:51.941 * Background saving started by pid 5230
8371日志::
46590:S 09 Sep 18:57:51.948 * Full resync from master: d7751c4ebf1e63d3baebea1ed409e0e7243a4423:440721826993

步8:
主節點8370/8375判定8373(新主)失聯:
42645:M 09 Sep 18:58:00.320 * Marking node 792020fe66c00ae56e27cd7a048ba6bb2b67adb6 as failing (quorum reached).

步9:
主節點8370/8375判定8373(新主)恢復:
60295:M 09 Sep 18:58:18.181 * Clear FAIL state for node 792020fe66c00ae56e27cd7a048ba6bb2b67adb6: is reachable again and nobody is serving its slots after some time.

步10:
主節點8373完成全量同步所需要的BGSAVE操作:
5230:C 09 Sep 18:59:01.474 * DB saved on disk
5230:C 09 Sep 18:59:01.491 * RDB: 7112 MB of memory used by copy-on-write
4255:M 09 Sep 18:59:01.877 * Background saving terminated with success

步11:
從節點8371開始從主節點8373接收到數據:
46590:S 09 Sep 18:59:02.263 * MASTER <-> SLAVE sync: receiving 2657606930 bytes from master

步12:
主節點8373發現從節點8371對output buffer作了限制:
4255:M 09 Sep 19:00:19.014 # Client id=14259015 addr=xx.x.xxx.200:21772 fd=844 name= age=148 idle=148 flags=S db=0 sub=0 psub=0 multi=-1 qbuf=0 qbuf-free=0 obl=16349 oll=4103 omem=95944066 events=rw cmd=psync scheduled to be closed ASAP for overcoming of output buffer limits.
4255:M 09 Sep 19:00:19.015 # Connection with slave xx.x.xxx.200:8371 lost.

步13:
從節點8371從主節點8373同步數據失敗,連接斷了,第一次全量同步失?。?br />46590:S 09 Sep 19:00:19.018 # I/O error trying to sync with MASTER: connection lost
46590:S 09 Sep 19:00:20.102 * Connecting to MASTER xx.x.xxx.199:8373
46590:S 09 Sep 19:00:20.102 * MASTER <-> SLAVE sync started

步14:
從節點8371重新開始同步,連接失敗,主節點8373的連接數滿了:
46590:S 09 Sep 19:00:21.103 * Connecting to MASTER xx.x.xxx.199:8373
46590:S 09 Sep 19:00:21.103 * MASTER <-> SLAVE sync started
46590:S 09 Sep 19:00:21.104 * Non blocking connect for SYNC fired the event.
46590:S 09 Sep 19:00:21.104 # Error reply to PING from master: '-ERR max number of clients reached'

步15:
從節點8371重新連上主節點8373,第二次開始全量同步:
8371日志:
46590:S 09 Sep 19:00:49.175 * Connecting to MASTER xx.x.xxx.199:8373
46590:S 09 Sep 19:00:49.175 * MASTER <-> SLAVE sync started
46590:S 09 Sep 19:00:49.175 * Non blocking connect for SYNC fired the event.
46590:S 09 Sep 19:00:49.176 * Master replied to PING, replication can continue...
46590:S 09 Sep 19:00:49.179 * Partial resynchronization not possible (no cached master)
46590:S 09 Sep 19:00:49.501 * Full resync from master: d7751c4ebf1e63d3baebea1ed409e0e7243a4423:440780763454
8373日志:
4255:M 09 Sep 19:00:49.176 * Slave xx.x.xxx.200:8371 asks for synchronization
4255:M 09 Sep 19:00:49.176 * Full resync requested by slave xx.x.xxx.200:8371
4255:M 09 Sep 19:00:49.176 * Starting BGSAVE for SYNC with target: disk
4255:M 09 Sep 19:00:49.498 * Background saving started by pid 18413
18413:C 09 Sep 19:01:52.466 * DB saved on disk
18413:C 09 Sep 19:01:52.620 * RDB: 2124 MB of memory used by copy-on-write
4255:M 09 Sep 19:01:53.186 * Background saving terminated with success

步16:
從節點8371同步數據成功,開始加載經內存:
46590:S 09 Sep 19:01:53.190 * MASTER <-> SLAVE sync: receiving 2637183250 bytes from master
46590:S 09 Sep 19:04:51.485 * MASTER <-> SLAVE sync: Flushing old data
46590:S 09 Sep 19:05:58.695 * MASTER <-> SLAVE sync: Loading DB in memory

步17:
集群恢復正常:
42645:M 09 Sep 19:05:58.786 * Clear FAIL state for node bedab2c537fe94f8c0363ac4ae97d56832316e65: slave is reachable again.

步18:
從節點8371同步數據成功,耗時7分鐘:
46590:S 09 Sep 19:08:19.303 * MASTER <-> SLAVE sync: Finished with success

8371失聯原因分析:

由于幾臺機器在同一個機架,不太可能發生網絡中斷的情況,于是通過SLOWLOG GET命令查看了慢查詢日志,發現有一個KEYS命令被執行了,耗時8.3秒,再查看集群節點超時設置,發現是5s(cluster-node-timeout 5000)

出現節點失聯的原因:

客戶端執行了耗時1條8.3s的命令,

2016/9/9 18:57:43 開始執行KEYS命令
2016/9/9 18:57:50 8371被判斷失聯(redis日志)
2016/9/9 18:57:51 執行完KEYS命令

總結來說,有以下幾個問題:

1.由于cluster-node-timeout設置比較短,慢查詢KEYS導致了集群判斷節點8371失聯

2.由于8371失聯,導致8373升級為主,開始主從同步

3.由于配置client-output-buffer-limit的限制,導致第一次全量同步失敗了

4.又由于PHP客戶端的連接池有問題,瘋狂連接服務器,產生了類似SYN攻擊的效果

5.第一次全量同步失敗后,從節點重連主節點花了30秒(超過了最大連接數1w)

關于client-output-buffer-limit參數:

# The syntax of every client-output-buffer-limit directive is the following: # # client-output-buffer-limit <class> <hard limit> <soft limit> <soft seconds> # # A client is immediately disconnected once the hard limit is reached, or if # the soft limit is reached and remains reached for the specified number of # seconds (continuously). # So for instance if the hard limit is 32 megabytes and the soft limit is # 16 megabytes / 10 seconds, the client will get disconnected immediately # if the size of the output buffers reach 32 megabytes, but will also get # disconnected if the client reaches 16 megabytes and continuously overcomes # the limit for 10 seconds. # # By default normal clients are not limited because they don't receive data # without asking (in a push way), but just after a request, so only # asynchronous clients may create a scenario where data is requested faster # than it can read. # # Instead there is a default limit for pubsub and slave clients, since # subscribers and slaves receive data in a push fashion. # # Both the hard or the soft limit can be disabled by setting them to zero. client-output-buffer-limit normal 0 0 0 client-output-buffer-limit slave 256mb 64mb 60 client-output-buffer-limit pubsub 32mb 8mb 60 

采取措施:

1.單實例的切割到4G以下,否則發生主從切換會耗時很長

2.調整client-output-buffer-limit參數,防止同步進行到一半失敗

3.調整cluster-node-timeout,不能少于15s

4.禁止任何耗時超過cluster-node-timeout的慢查詢,因為會導致主從切換

5.修復客戶端類似SYN攻擊的瘋狂連接方式

總結

以上就是本文關于詳細分析Redis集群故障的全部內容,希望對大家有所幫助。感興趣的朋友可以參閱:Spring AOP實現Redis緩存數據庫查詢源碼、簡述Redis和MySQL的區別、oracle 數據庫啟動階段分析等,如有不足之處,請留言之處。小編會及時更正。感謝朋友們對武林網網站的支持!

發表評論 共有條評論
用戶名: 密碼:
驗證碼: 匿名發表
亚洲香蕉成人av网站在线观看_欧美精品成人91久久久久久久_久久久久久久久久久亚洲_热久久视久久精品18亚洲精品_国产精自产拍久久久久久_亚洲色图国产精品_91精品国产网站_中文字幕欧美日韩精品_国产精品久久久久久亚洲调教_国产精品久久一区_性夜试看影院91社区_97在线观看视频国产_68精品久久久久久欧美_欧美精品在线观看_国产精品一区二区久久精品_欧美老女人bb
日韩专区在线观看| 亚洲成人动漫在线播放| 亚洲精品久久久久久久久久久| 亚洲jizzjizz日本少妇| 亚洲精品一区二三区不卡| 538国产精品一区二区在线| 国产日韩欧美夫妻视频在线观看| 国产亚洲精品综合一区91| 91久久久国产精品| 日韩精品免费在线| 青青草国产精品一区二区| 欧美日韩福利在线观看| 国产精品av在线| 日韩欧美主播在线| 亚洲欧美国产高清va在线播| 国产成人精品网站| 国产精品久久久999| 综合网中文字幕| 亚洲精品成a人在线观看| 国产精彩精品视频| 九九视频这里只有精品| 日韩综合视频在线观看| 中文字幕久久久| 国产精品99久久久久久人| 欧美性少妇18aaaa视频| 国产亚洲欧美aaaa| 7777kkkk成人观看| 一区二区三区美女xx视频| 欧美又大粗又爽又黄大片视频| 欧美成人免费在线观看| 国产精品91久久久| 久久久久五月天| 国产成人涩涩涩视频在线观看| 亚洲国产成人一区| 亚洲久久久久久久久久久| 精品视频一区在线视频| 日韩欧美主播在线| 欧美专区国产专区| 国产精品久久久久久久久久ktv| 亚洲欧美日韩图片| 欧美性xxxx极品hd满灌| 亚洲国产精品专区久久| 欧美精品福利在线| 在线观看欧美日韩国产| 亚洲天堂av图片| 大伊人狠狠躁夜夜躁av一区| 亚洲第一中文字幕| 国产91精品不卡视频| 中文字幕精品国产| 欧美激情精品久久久久久黑人| 亚洲肉体裸体xxxx137| 亚洲国产成人久久综合| 国产精品日韩久久久久| 久久免费成人精品视频| 国产精品免费观看在线| 国产日韩欧美中文在线播放| 97在线视频免费观看| 日韩三级影视基地| 欧美老肥婆性猛交视频| 国产不卡av在线免费观看| 久久精品视频在线观看| 国产成人精品亚洲精品| 日韩欧美一区视频| 亚洲精品中文字幕女同| 九九热精品视频国产| 亚洲欧洲日韩国产| 91精品成人久久| 欧美大肥婆大肥bbbbb| 精品久久久在线观看| 久久久久久国产三级电影| 91午夜理伦私人影院| 中文字幕日韩高清| 日韩中文字幕在线视频播放| 欧美国产日韩一区| 欧美日韩一区二区三区在线免费观看| 欧美日韩中文字幕综合视频| 3344国产精品免费看| 亚洲高清福利视频| 91热精品视频| 日韩视频亚洲视频| 久久亚洲精品国产亚洲老地址| 国精产品一区一区三区有限在线| 亚洲成av人乱码色午夜| 亚洲福利小视频| 最近2019中文字幕第三页视频| 国产精品v片在线观看不卡| 久久精品国产v日韩v亚洲| 亚洲影视九九影院在线观看| 欧美日韩中文字幕| www.99久久热国产日韩欧美.com| 国产精品视频区1| 中文字幕无线精品亚洲乱码一区| 久久99精品久久久久久琪琪| 欧美激情18p| 欧美在线播放视频| 九九九久久国产免费| 亚洲高清不卡av| 中文在线资源观看视频网站免费不卡| 日本精品一区二区三区在线播放视频| 欧美最猛性xxxxx(亚洲精品)| 国产精品一区二区在线| 久久久久久久久久久久av| 国产成人精品久久亚洲高清不卡| 国产91精品视频在线观看| 亚洲日本中文字幕免费在线不卡| 中文字幕亚洲激情| 亚洲精品一二区| 色999日韩欧美国产| 亚洲欧美一区二区激情| 亚洲欧美国内爽妇网| 国产有码一区二区| 中文字幕亚洲欧美| 中文字幕视频在线免费欧美日韩综合在线看| 热久久免费国产视频| 欧美色视频日本高清在线观看| 亚洲激情国产精品| 欧美日韩亚洲成人| 欧美国产日韩一区二区| 91精品综合久久久久久五月天| 亚洲综合视频1区| 九九久久国产精品| 亚洲精品福利资源站| 日韩av在线网页| 亚洲激情在线视频| 日韩电影免费观看中文字幕| 国产亚洲视频在线观看| 日本欧美国产在线| 欧美激情视频在线观看| 久久精品国产综合| 亚洲男子天堂网| 亚洲国产精品久久91精品| 亚洲网站在线看| 久久久久国产精品www| 久久久久国产一区二区三区| 视频直播国产精品| 97av在线影院| 97精品在线视频| 中文字幕亚洲一区二区三区五十路| 日韩视频在线免费| 国产精品久久77777| 日韩av在线网址| 久久久久一本一区二区青青蜜月| 韩国日本不卡在线| 亚洲春色另类小说| 久久久久久久久久久亚洲| 亚洲午夜国产成人av电影男同| 欧美黑人国产人伦爽爽爽| 在线观看精品国产视频| 欧美高跟鞋交xxxxxhd| 久久久这里只有精品视频| 97在线视频免费| 国产日本欧美一区二区三区| 欧美精品少妇videofree| 久久久影视精品| 97超级碰碰碰| 91久久久久久久久久久久久| 色综合色综合网色综合| 日韩资源在线观看| 日韩在线观看视频免费| 欧美中在线观看| 久久成年人视频| 日韩av在线高清| 隔壁老王国产在线精品| 91av视频在线免费观看|