Skip to content

checkMaster失败导致主节点不恢复问题 #387

@htner

Description

@htner

node.go checkMaster() 函数

  1. 96行 判断如果n.master为空将不再检查,直接返回
  2. 106行 n.UpMaster(db.addr)可能会将n.master设置为空

那是否连接主节点的网络抖动后,可能会出现再也不能恢复的情况?

补充相关日志:
2017/10/18 20:42:08 - ERROR - node.go:[102] - [Node] "checkMaster" "Ping" "db.Addr=starmaker-live.cczpbf5hc2kb.us-west-2.rds.amazonaws.com:3306|error=connection was bad" conn_id=0
2017/10/18 20:44:32 - ERROR - node.go:[254] - [Node] "UpMaster" "dial tcp 34.210.172.35:3306: getsockopt: connection timed out" "" conn_id=0
2017/10/18 20:44:48 - ERROR - node.go:[97] - [Node] "checkMaster" "Master is no alive" "" conn_id=0

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions