Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TiDB Ansible deploy error #3863

Closed
wolaiye1010 opened this issue Jul 24, 2017 · 5 comments
Closed

TiDB Ansible deploy error #3863

wolaiye1010 opened this issue Jul 24, 2017 · 5 comments

Comments

@wolaiye1010
Copy link

wolaiye1010 commented Jul 24, 2017

到 启动这一步到时候报错 ansible-playbook start.yml,
错误内容如下:
TASK [wait up] **********************************************************************************************************************************************************************
fatal: [10.9.193.135]: FAILED! => {"changed": false, "elapsed": 300, "failed": true, "msg": "Timeout when waiting for PID:/home/tidb/deploy/status/tikv.pid to stop."}
fatal: [10.9.193.136]: FAILED! => {"changed": false, "elapsed": 300, "failed": true, "msg": "Timeout when waiting for PID:/home/tidb/deploy/status/tikv.pid to stop."}
fatal: [10.9.193.121]: FAILED! => {"changed": false, "elapsed": 300, "failed": true, "msg": "Timeout when waiting for PID:/home/tidb/deploy/status/tikv.pid to stop."}
to retry, use: --limit @retry_files/start.retry

PLAY RECAP **************************************************************************************************************************************************************************
10.9.193.119 : ok=23 changed=3 unreachable=0 failed=0
10.9.193.121 : ok=21 changed=3 unreachable=0 failed=1
10.9.193.135 : ok=19 changed=2 unreachable=0 failed=1
10.9.193.136 : ok=19 changed=2 unreachable=0 failed=1
localhost : ok=1 changed=0 unreachable=0 failed=0

我的配置文件:

TiDB Cluster Part

[tidb_servers]
10.9.193.119
10.9.193.121

[pd_servers]
10.9.193.119
10.9.193.121

[tikv_servers]
10.9.193.135
#10.9.193.137
10.9.193.121
10.9.193.136

Monitoring Part

[monitoring_servers]
10.9.193.119

[grafana_servers]
10.9.193.119

[monitored_servers:children]
tidb_servers
tikv_servers
pd_servers

Binlog Part

[pump_servers:children]
tidb_servers

[cistern_servers]

[drainer_servers]

[pd_servers:vars]

location_labels = ["zone","rack","host"]

Global variables

[all:vars]
deploy_dir = /home/tidb/deploy

Connection

ssh via root:

ansible_user = root
ansible_become = true
ansible_become_user = work

ssh via normal user

#ansible_user = tidb

cluster_name = test-cluster

misc

enable_elk = False
enable_firewalld = False
enable_ntpd = False
machine_benchmark = True
set_hostname = False

binlog trigger

enable_binlog = False

@buggithubs
Copy link
Contributor

@wolaiye1010

  1. 主机系统是否为ubuntu?
    1. ubunut 注释become与become_user;修改ansible_user = work
  2. 检查文件权限/home/tidb/deploy/status/tikv.pid 是否为work
  3. 可以尝试登录节点手动执行,然后查看是否有错误信息。
  4. 以上均无误,可以尝试多次执行stop.yml & start.yml

@wolaiye1010
Copy link
Author

wolaiye1010 commented Jul 24, 2017

@buggithubs ,
主机系统为 CentOS 6.6
/home/tidb/deploy/status/tikv.pid 是work 权限
多次 stop.yml & start.yml 都不行,报同一个错误
测试主机不能链接外网,但我的localhost可以,我是通过 localhost 进行的ansible-playbook start.yml
不知道跟这个有没有关系,

“可以尝试登录节点手动执行,然后查看是否有错误信息。”
大神,请教下,怎么手动执行呢

@ngaut ngaut changed the title TiDB Ansible 部署 报错 TiDB Ansible deploy error Jul 24, 2017
@zimulala
Copy link
Contributor

@LinuxGit

@LinuxGit
Copy link

@wolaiye1010 refer to https://github.com/pingcap/docs-cn/blob/master/op-guide/ansible-deployment.md. Please use CentOS 7 or later, default TiDB binaries is for CentOS7. And pd instances should be one or three.You can new issues in https://github.com/pingcap/tidb-ansible if you encounter any other deployment problems.

@wolaiye1010
Copy link
Author

@LinuxGit ,
Think you, CentOS 7 is ok!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants