Deployment fails in step 1 with "not a directory" when mounting "/etc/ssh/ssh_known_hosts"

Bug #1772071 reported by Sai Sindhur Malleni
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
Critical
Alex Schultz

Bug Description

Description of problem:
Deploying OSP13 in the Red Hat Scale LAb with 3 controllers and 150 compute nodes, deployment fails in Step1 with the following error message.

overcloud.AllNodesDeploySteps.ControllerDeployment_Step1.1:
  resource_type: OS::Heat::StructuredDeployment
  physical_resource_id: 4de9d804-9db7-4904-b3f5-823b7117369f
  status: CREATE_FAILED
  status_reason: |
    Error: resources[1]: Deployment to server failed: deploy_status_code : Deployment exited with non-zero status
 code: 2
  deploy_stdout: |
    ...
            "Error running ['docker', 'run', '--name', 'memcached', '--label', 'config_id=tripleo_step1', '--labe
l', 'container_name=memcached', '--label', 'managed_by=paunch', '--label', 'config_data={\"start_order\": 1, \"co
mmand\": [\"/bin/bash\", \"-c\", \"source /etc/sysconfig/memcached; /usr/bin/memcached -p ${PORT} -u ${USER} -m $
{CACHESIZE} -c ${MAXCONN} $OPTIONS >> /var/log/memcached.log 2>&1\"], \"volumes\": [\"/etc/hosts:/etc/hosts:ro\",
 \"/etc/localtime:/etc/localtime:ro\", \"/etc/pki/ca-trust/extracted:/etc/pki/ca-trust/extracted:ro\", \"/etc/pki
/tls/certs/ca-bundle.crt:/etc/pki/tls/certs/ca-bundle.crt:ro\", \"/etc/pki/tls/certs/ca-bundle.trust.crt:/etc/pki
/tls/certs/ca-bundle.trust.crt:ro\", \"/etc/pki/tls/cert.pem:/etc/pki/tls/cert.pem:ro\", \"/dev/log:/dev/log\", \
"/etc/ssh/ssh_known_hosts:/etc/ssh/ssh_known_hosts:ro\", \"/etc/puppet:/etc/puppet:ro\", \"/var/lib/config-data/m
emcached/etc/sysconfig/memcached:/etc/sysconfig/memcached:ro\", \"/var/log/containers/memcached:/var/log/\"], \"i
mage\": \"192.168.0.1:8787/rhosp13/openstack-memcached:2018-05-07.2\", \"net\": \"host\", \"restart\": \"always\"
, \"privileged\": false}', '--detach=true', '--net=host', '--privileged=false', '--restart=always', '--volume=/et
c/hosts:/etc/hosts:ro', '--volume=/etc/localtime:/etc/localtime:ro', '--volume=/etc/pki/ca-trust/extracted:/etc/p
ki/ca-trust/extracted:ro', '--volume=/etc/pki/tls/certs/ca-bundle.crt:/etc/pki/tls/certs/ca-bundle.crt:ro', '--vo
lume=/etc/pki/tls/certs/ca-bundle.trust.crt:/etc/pki/tls/certs/ca-bundle.trust.crt:ro', '--volume=/etc/pki/tls/ce
rt.pem:/etc/pki/tls/cert.pem:ro', '--volume=/dev/log:/dev/log', '--volume=/etc/ssh/ssh_known_hosts:/etc/ssh/ssh_k
nown_hosts:ro', '--volume=/etc/puppet:/etc/puppet:ro', '--volume=/var/lib/config-data/memcached/etc/sysconfig/mem
cached:/etc/sysconfig/memcached:ro', '--volume=/var/log/containers/memcached:/var/log/', '192.168.0.1:8787/rhosp1
3/openstack-memcached:2018-05-07.2', '/bin/bash', '-c', 'source /etc/sysconfig/memcached; /usr/bin/memcached -p $
{PORT} -u ${USER} -m ${CACHESIZE} -c ${MAXCONN} $OPTIONS >> /var/log/memcached.log 2>&1']. [125]",
            "stdout: 6645bfdc5b90ce52921f1f8f875ac83a5b878b3d30e3293e5dbbbb39e4a5ae67",
            "stderr: /usr/bin/docker-current: Error response from daemon: oci runtime error: container_linux.go:2
47: starting container process caused \"process_linux.go:364: container init caused \\\"rootfs_linux.go:54: mount
ing \\\\\\\"/etc/ssh/ssh_known_hosts\\\\\\\" to rootfs \\\\\\\"/var/lib/docker/overlay2/2e613821885a32bebdb894820
d7c2e38a86c7b2024e029a12819dd5aed722751/merged\\\\\\\" at \\\\\\\"/var/lib/docker/overlay2/2e613821885a32bebdb894
820d7c2e38a86c7b2024e029a12819dd5aed722751/merged/etc/ssh/ssh_known_hosts\\\\\\\" caused \\\\\\\"not a directory\
\\\\\\"\\\"\""

Logging into the controllers we see all containers are exited for a similar reason
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
b66e67777b4b 192.168.0.1:8787/rhosp13/openstack-mariadb:2018-05-07.2 "/bin/bash -c '/us..." 7 hours ago Exited (0) 7 hours ago mysql_image_tag
6645bfdc5b90 192.168.0.1:8787/rhosp13/openstack-memcached:2018-05-07.2 "/bin/bash -c 'sou..." 7 hours ago Created memcached
25cdfcf14239 192.168.0.1:8787/rhosp13/openstack-haproxy:2018-05-07.2 "/bin/bash -c '/us..." 7 hours ago Exited (0) 7 hours ago haproxy_image_tag
10e7c2b0878a 192.168.0.1:8787/rhosp13/openstack-mariadb:2018-05-07.2 "bash -ec 'if [ -e..." 7 hours ago Created mysql_bootstrap
822abff373f4 192.168.0.1:8787/rhosp13/openstack-redis:2018-05-07.2 "/bin/bash -c '/us..." 7 hours ago Exited (0) 7 hours ago redis_image_tag
81619c0ad428 192.168.0.1:8787/rhosp13/openstack-rabbitmq:2018-05-07.2 "/bin/bash -c '/us..." 7 hours ago Exited (0) 7 hours ago rabbitmq_image_tag
eccdf31c874a 192.168.0.1:8787/rhosp13/openstack-cinder-volume:2018-05-07.2 "/bin/bash -c '/us..." 7 hours ago Exited (0) 7 hours ago cinder_volume_image_tag
000560604e74 192.168.0.1:8787/rhosp13/openstack-rabbitmq:2018-05-07.2 "kolla_start" 7 hours ago Exited (0) 7 hours ago rabbitmq_bootstrap
573e30c4c8c2 192.168.0.1:8787/rhosp13/openstack-memcached:2018-05-07.2 "/bin/bash -c 'sou..." 7 hours ago Exited (0) 7 hours ago memcached_init_logs
8a5d446f88d2 192.168.0.1:8787/rhosp13/openstack-mariadb:2018-05-07.2 "chown -R mysql: /..." 7 hours ago Exited (0) 7 hours ago mysql_data_ownership

Version-Release number of selected component (if applicable):
Queens

How reproducible:
100% on this env

Steps to Reproduce:
1. Deploy OSP13
2.
3.

Actual results:
Deploy fails in step1 when creating controllers

Expected results:
Deployment should succeed

Additional info:

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (master)

Fix proposed to branch: master
Review: https://review.openstack.org/569550

Changed in tripleo:
assignee: nobody → Alex Schultz (alex-schultz)
status: New → In Progress
Changed in tripleo:
importance: Undecided → Critical
milestone: none → rocky-2
Changed in tripleo:
assignee: Alex Schultz (alex-schultz) → Emilien Macchi (emilienm)
Changed in tripleo:
assignee: Emilien Macchi (emilienm) → Alex Schultz (alex-schultz)
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (master)

Reviewed: https://review.openstack.org/569550
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=7c973203345dbcf5ef94a5bcad2b089f5298c9b0
Submitter: Zuul
Branch: master

commit 7c973203345dbcf5ef94a5bcad2b089f5298c9b0
Author: Alex Schultz <email address hidden>
Date: Fri May 18 21:07:27 2018 -0600

    Use str_replace for known_hosts

    There is a limit to how long input data can be for the heat script hook.
    It turns out that data longer than 131072, will return an Argument list
    to long error. To get around this, we need to pass this data in a
    different way so that the heat script hook will work.

    Change-Id: Ie3bd17ca9863e7687721e8c2628e485ea1849321
    Closes-Bug: #1772071

Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/queens)

Fix proposed to branch: stable/queens
Review: https://review.openstack.org/569909

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/pike)

Fix proposed to branch: stable/pike
Review: https://review.openstack.org/569910

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/ocata)

Fix proposed to branch: stable/ocata
Review: https://review.openstack.org/569911

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/newton)

Fix proposed to branch: stable/newton
Review: https://review.openstack.org/569912

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/ocata)

Reviewed: https://review.openstack.org/569911
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=6ee93a3a5a35f460585e8b5260bb2ecaaa0c696b
Submitter: Zuul
Branch: stable/ocata

commit 6ee93a3a5a35f460585e8b5260bb2ecaaa0c696b
Author: Alex Schultz <email address hidden>
Date: Fri May 18 21:07:27 2018 -0600

    Use str_replace for known_hosts

    There is a limit to how long input data can be for the heat script hook.
    It turns out that data longer than 131072, will return an Argument list
    to long error. To get around this, we need to pass this data in a
    different way so that the heat script hook will work.

    Change-Id: Ie3bd17ca9863e7687721e8c2628e485ea1849321
    Closes-Bug: #1772071
    (cherry picked from commit 7c973203345dbcf5ef94a5bcad2b089f5298c9b0)

tags: added: in-stable-ocata
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (master)

Fix proposed to branch: master
Review: https://review.openstack.org/570029

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-heat-templates (stable/newton)

Change abandoned by Alex Schultz (<email address hidden>) on branch: stable/newton
Review: https://review.openstack.org/569912

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/ocata)

Fix proposed to branch: stable/ocata
Review: https://review.openstack.org/570030

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/newton)

Reviewed: https://review.openstack.org/569912
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=c77278093e789515204c5dba374493b61b104ce6
Submitter: Zuul
Branch: stable/newton

commit c77278093e789515204c5dba374493b61b104ce6
Author: Alex Schultz <email address hidden>
Date: Fri May 18 21:07:27 2018 -0600

    Use str_replace for known_hosts

    There is a limit to how long input data can be for the heat script hook.
    It turns out that data longer than 131072, will return an Argument list
    to long error. To get around this, we need to pass this data in a
    different way so that the heat script hook will work.

    NOTE: Includes fix from If7a4c6f3581bed270f478b98b8da8fe010acbbc1

    Change-Id: Ie3bd17ca9863e7687721e8c2628e485ea1849321
    Closes-Bug: #1772071
    (cherry picked from commit 7c973203345dbcf5ef94a5bcad2b089f5298c9b0)

tags: added: in-stable-newton
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/ocata)

Reviewed: https://review.openstack.org/570030
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=57b81654f7cae989d8dcacf1e87901e041744b19
Submitter: Zuul
Branch: stable/ocata

commit 57b81654f7cae989d8dcacf1e87901e041744b19
Author: Alex Schultz <email address hidden>
Date: Tue May 22 13:22:17 2018 -0600

    Fix known hosts config script value

    There was an extra | that was missed. It's not tested in Master but is
    in <= Queens.

    Change-Id: If7a4c6f3581bed270f478b98b8da8fe010acbbc1
    Closes-Bug: #1772071

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/pike)

Reviewed: https://review.openstack.org/569910
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=57789d7c9d6fac191e60a2ba57c7dd86e945dee5
Submitter: Zuul
Branch: stable/pike

commit 57789d7c9d6fac191e60a2ba57c7dd86e945dee5
Author: Alex Schultz <email address hidden>
Date: Fri May 18 21:07:27 2018 -0600

    Use str_replace for known_hosts

    There is a limit to how long input data can be for the heat script hook.
    It turns out that data longer than 131072, will return an Argument list
    to long error. To get around this, we need to pass this data in a
    different way so that the heat script hook will work.

    NOTE: Includes fix from If7a4c6f3581bed270f478b98b8da8fe010acbbc1

    Change-Id: Ie3bd17ca9863e7687721e8c2628e485ea1849321
    Closes-Bug: #1772071
    (cherry picked from commit 7c973203345dbcf5ef94a5bcad2b089f5298c9b0)

tags: added: in-stable-pike
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (master)

Reviewed: https://review.openstack.org/570029
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=75d860500510a60226fee549f73090fc8d1550f3
Submitter: Zuul
Branch: master

commit 75d860500510a60226fee549f73090fc8d1550f3
Author: Alex Schultz <email address hidden>
Date: Tue May 22 13:22:17 2018 -0600

    Fix known hosts config script value

    There was an extra | that was missed. It's not tested in Master but is
    in <= Queens.

    Change-Id: If7a4c6f3581bed270f478b98b8da8fe010acbbc1
    Closes-Bug: #1772071

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/queens)

Reviewed: https://review.openstack.org/569909
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=482f2eb2f736e8d5d01a775dae160f819240d708
Submitter: Zuul
Branch: stable/queens

commit 482f2eb2f736e8d5d01a775dae160f819240d708
Author: Alex Schultz <email address hidden>
Date: Fri May 18 21:07:27 2018 -0600

    Use str_replace for known_hosts

    There is a limit to how long input data can be for the heat script hook.
    It turns out that data longer than 131072, will return an Argument list
    to long error. To get around this, we need to pass this data in a
    different way so that the heat script hook will work.

    NOTE: Includes fix from If7a4c6f3581bed270f478b98b8da8fe010acbbc1

    Change-Id: Ie3bd17ca9863e7687721e8c2628e485ea1849321
    Closes-Bug: #1772071
    (cherry picked from commit 7c973203345dbcf5ef94a5bcad2b089f5298c9b0)

tags: added: in-stable-queens
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 7.0.13

This issue was fixed in the openstack/tripleo-heat-templates 7.0.13 release.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 6.2.14

This issue was fixed in the openstack/tripleo-heat-templates 6.2.14 release.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 8.0.3

This issue was fixed in the openstack/tripleo-heat-templates 8.0.3 release.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 9.0.0.0b3

This issue was fixed in the openstack/tripleo-heat-templates 9.0.0.0b3 development milestone.

Revision history for this message
Doug Hellmann (doug-hellmann) wrote : Fix included in openstack/tripleo-heat-templates 5.3.13

This issue was fixed in the openstack/tripleo-heat-templates 5.3.13 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.