Brief Description
-----------------
ansible-playbook failed at check for controller-0 online status
Severity
--------
Critical
Steps to Reproduce
------------------
Install controller-0
Run ansible-playbook to configure controller-0
Expected Behavior
------------------
ansible-playbook ran successfully
Actual Behavior
----------------
ansible-playbook failed. See Timestamp/Logs for details.
Reproducibility
---------------
Reproducible
System Configuration
--------------------
Any system
Branch/Pull Time/Commit
-----------------------
master 20190614T013000Z
Last Pass
---------
master 20190613T013000Z
Timestamp/Logs
--------------
TASK [bringup-essential-services : Update resolv.conf file for unlock] *********
changed: [localhost]
TASK [bringup-essential-services : Check for controller-0 online status] *******
FAILED - RETRYING: Check for controller-0 online status (10 retries left).
changed: [localhost]
TASK [bringup-essential-services : Wait for 60 seconds to ensure kube-system pods are all started] ***
TASK [bringup-essential-services : Start parallel tasks to wait for Kubernetes component, Networking and Tiller pods to reach ready state] ***
changed: [localhost] => (item=k8s-app=calico-node)
changed: [localhost] => (item=k8s-app=calico-kube-controllers)
changed: [localhost] => (item=k8s-app=kube-proxy)
changed: [localhost] => (item=app=multus)
changed: [localhost] => (item=app=sriov-cni)
changed: [localhost] => (item=app=helm)
changed: [localhost] => (item=component=kube-apiserver)
changed: [localhost] => (item=component=kube-controller-manager)
changed: [localhost] => (item=component=kube-scheduler)
TASK [bringup-essential-services : Get wait tasks results] *********************
FAILED - RETRYING: Get wait tasks results (10 retries left).
changed: [localhost] => (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'k8s-app=calico-node', u'ansible_job_id': u'915908010294.112925', 'failed': False, u'started': 1, 'changed': True, 'item': u'k8s-app=calico-node', u'finished': 0, u'results_file': u'/root/.ansible_async/915908010294.112925', '_ansible_ignore_errors': None, '_ansible_no_log': False})
FAILED - RETRYING: Get wait tasks results (10 retries left).
FAILED - RETRYING: Get wait tasks results (9 retries left).
changed: [localhost] => (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'k8s-app=calico-kube-controllers', u'ansible_job_id': u'61243884225.113078', 'failed': False, u'started': 1, 'changed': True, 'item': u'k8s-app=calico-kube-controllers', u'finished': 0, u'results_file': u'/root/.ansible_async/61243884225.113078', '_ansible_ignore_errors': None, '_ansible_no_log': False})
changed: [localhost] => (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'k8s-app=kube-proxy', u'ansible_job_id': u'583262417391.114092', 'failed': False, u'started': 1, 'changed': True, 'item': u'k8s-app=kube-proxy', u'finished': 0, u'results_file': u'/root/.ansible_async/583262417391.114092', '_ansible_ignore_errors': None, '_ansible_no_log': False})
changed: [localhost] => (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'app=multus', u'ansible_job_id': u'230395068872.114362', 'failed': False, u'started': 1, 'changed': True, 'item': u'app=multus', u'finished': 0, u'results_file': u'/root/.ansible_async/230395068872.114362', '_ansible_ignore_errors': None, '_ansible_no_log': False})
changed: [localhost] => (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'app=sriov-cni', u'ansible_job_id': u'189036162740.114451', 'failed': False, u'started': 1, 'changed': True, 'item': u'app=sriov-cni', u'finished': 0, u'results_file': u'/root/.ansible_async/189036162740.114451', '_ansible_ignore_errors': None, '_ansible_no_log': False})
changed: [localhost] => (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'app=helm', u'ansible_job_id': u'892848644877.114531', 'failed': False, u'started': 1, 'changed': True, 'item': u'app=helm', u'finished': 0, u'results_file': u'/root/.ansible_async/892848644877.114531', '_ansible_ignore_errors': None, '_ansible_no_log': False})
failed: [localhost] (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'component=kube-apiserver', u'ansible_job_id': u'302170236966.114604', 'failed': False, u'started': 1, 'changed': True, 'item': u'component=kube-apiserver', u'finished': 0, u'results_file': u'/root/.ansible_async/302170236966.114604', '_ansible_ignore_errors': None, '_ansible_no_log': False}) => {"ansible_job_id": "302170236966.114604", "attempts": 1, "changed": true, "cmd": ["kubectl", "--kubeconfig=/etc/kubernetes/admin.conf", "wait", "--namespace=kube-system", "--for=condition=Ready", "pods", "--selector", "component=kube-apiserver", "--timeout=30s"], "delta": "0:00:00.118897", "end": "2019-06-14 06:38:48.812429", "finished": 1, "item": {"ansible_job_id": "302170236966.114604", "changed": true, "failed": false, "finished": 0, "item": "component=kube-apiserver", "results_file": "/root/.ansible_async/302170236966.114604", "started": 1}, "msg": "non-zero return code", "rc": 1, "start": "2019-06-14 06:38:48.693532", "stderr": "error: no matching resources found", "stderr_lines": ["error: no matching resources found"], "stdout": "", "stdout_lines": []}
failed: [localhost] (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'component=kube-controller-manager', u'ansible_job_id': u'247619792932.115001', 'failed': False, u'started': 1, 'changed': True, 'item': u'component=kube-controller-manager', u'finished': 0, u'results_file': u'/root/.ansible_async/247619792932.115001', '_ansible_ignore_errors': None, '_ansible_no_log': False}) => {"ansible_job_id": "247619792932.115001", "attempts": 1, "changed": true, "cmd": ["kubectl", "--kubeconfig=/etc/kubernetes/admin.conf", "wait", "--namespace=kube-system", "--for=condition=Ready", "pods", "--selector", "component=kube-controller-manager", "--timeout=30s"], "delta": "0:00:00.120772", "end": "2019-06-14 06:38:49.916527", "finished": 1, "item": {"ansible_job_id": "247619792932.115001", "changed": true, "failed": false, "finished": 0, "item": "component=kube-controller-manager", "results_file": "/root/.ansible_async/247619792932.115001", "started": 1}, "msg": "non-zero return code", "rc": 1, "start": "2019-06-14 06:38:49.795755", "stderr": "error: no matching resources found", "stderr_lines": ["error: no matching resources found"], "stdout": "", "stdout_lines": []}
changed: [localhost] => (item={'_ansible_parsed': True, '_ansible_item_result': True, '_ansible_item_label': u'component=kube-scheduler', u'ansible_job_id': u'843007711619.115157', 'failed': False, u'started': 1, 'changed': True, 'item': u'component=kube-scheduler', u'finished': 0, u'results_file': u'/root/.ansible_async/843007711619.115157', '_ansible_ignore_errors': None, '_ansible_no_log': False})
PLAY RECAP *********************************************************************
localhost : ok=198 changed=120 unreachable=0 failed=1
Test Activity
-------------
Sanity
Please double check that the issue is missing online status.
Host came online at the 12 minute uptime mark and the online timeout for controllers is 20 minutes.
| d6f3c762- 772a-48f6- ba22-4cc4283cfd fe | platform | maintenance | controller_ boot_timeout | 1200 |
Auto install logs suggest the failure was a failure to start essential services.