SSH enablement workflow timeout during deploy of a large overcloud using config-download
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Medium
|
James Slagle |
Bug Description
Description of problem:
Trying to scale up a 103 node overcloud to 207 nodes, on a multiple attempts we see the heat stack update finish successfully, but the deploy fail due to timeout of ssh enablement workflow.
ssh admin enabssh admin enablement workflow - TIMED OUT.
We saw the same result even on bumping ENABLE_
James Slagle Looked at the ansible log file, andit appears ansible itself succeeded but in the logs we see
2019-08-29 16:40:59.867 409069 ERROR mistral.db.utils [req-704d2539-
James feels this could be related to the stdout geenrated by the command.
Version-Release number of selected component (if applicable):
13
How reproducible:
100% on an overcloud of this size
Steps to Reproduce:
1. deploy a large overcloud using config-donwload
2.
3.
Actual results:
Deploy fails after successful heat stack create/update but fails during ssh enablement workflow.
Expected results:
SSh enablement should succeed as well as overcloud deployment.
Additional info:
Changed in tripleo: | |
status: | New → In Progress |
importance: | Undecided → Medium |
assignee: | nobody → James Slagle (james-slagle) |
milestone: | none → train-3 |
tags: | added: stein-backport-potential |
tags: | added: queens-backport-potential rocky-backport-potential |
Fix proposed to branch: master /review. opendev. org/679475
Review: https:/