Grenade job fails due to timeout waiting for SSH
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Cinder |
In Progress
|
Undecided
|
Unassigned |
Bug Description
Seen in a nova-grenade-
In grenade/
After a failure, the server console log is dumped and in it the last message is "Starting dropbear sshd: OK" which makes it seem likely that sshd was not up and running yet and that's why SSH failed.
We could increase the timeout here to give the server a bit more time to get sshd running before giving up.
Excerpt from controller/
2024-05-17 21:51:37.799 | + /opt/stack/
2024-05-17 21:51:37.801 | + /opt/stack/
2024-05-17 21:51:37.804 | ++ /opt/stack/
2024-05-17 21:51:37.809 | + /opt/stack/
2024-05-17 21:51:37.811 | + /opt/stack/
2024-05-17 21:51:37.825 | OpenSSH_8.9p1 Ubuntu-3ubuntu0.7, OpenSSL 3.0.2 15 Mar 2022
2024-05-17 21:51:37.825 | debug1: Reading configuration data /etc/ssh/ssh_config
2024-05-17 21:51:37.826 | debug1: /etc/ssh/ssh_config line 19: include /etc/ssh/
2024-05-17 21:51:37.826 | debug1: /etc/ssh/ssh_config line 21: Applying options for *
2024-05-17 21:51:37.827 | debug1: Connecting to 172.24.5.150 [172.24.5.150] port 22.
2024-05-17 21:51:37.836 | debug1: connect to address 172.24.5.150 port 22: Connection refused
2024-05-17 21:51:37.836 | ssh: connect to host 172.24.5.150 port 22: Connection refused
2024-05-17 21:51:37.841 | + /opt/stack/
2024-05-17 21:51:37.844 | + /opt/stack/
2024-05-17 21:51:37.846 | + /opt/stack/
2024-05-17 21:51:37.846 | SSH not responding yet, trying again...
2024-05-17 21:51:37.848 | + /opt/stack/
[...]
2024-05-17 21:52:06.181 | ++ /opt/stack/
2024-05-17 21:52:06.186 | + /opt/stack/
2024-05-17 21:52:06.188 | + /opt/stack/
2024-05-17 21:52:06.190 | + /opt/stack/
2024-05-17 21:52:06.192 | + /opt/stack/
2024-05-17 21:52:06.193 | + /opt/stack/
2024-05-17 21:52:06.196 | ++ /opt/stack/
2024-05-17 21:52:06.200 | + /opt/stack/
2024-05-17 21:52:06.203 | + /opt/stack/
2024-05-17 21:52:06.210 | OpenSSH_8.9p1 Ubuntu-3ubuntu0.7, OpenSSL 3.0.2 15 Mar 2022
2024-05-17 21:52:06.210 | debug1: Reading configuration data /etc/ssh/ssh_config
2024-05-17 21:52:06.210 | debug1: /etc/ssh/ssh_config line 19: include /etc/ssh/
2024-05-17 21:52:06.210 | debug1: /etc/ssh/ssh_config line 21: Applying options for *
2024-05-17 21:52:06.211 | debug1: Connecting to 172.24.5.150 [172.24.5.150] port 22.
2024-05-17 21:52:06.214 | debug1: connect to address 172.24.5.150 port 22: Connection refused
2024-05-17 21:52:06.214 | ssh: connect to host 172.24.5.150 port 22: Connection refused
2024-05-17 21:52:06.217 | + /opt/stack/
2024-05-17 21:52:06.220 | + /opt/stack/
2024-05-17 21:52:06.223 | + /opt/stack/
2024-05-17 21:52:06.223 | SSH not responding yet, trying again...
2024-05-17 21:52:06.225 | + /opt/stack/
2024-05-17 21:52:07.231 | ++ /opt/stack/
2024-05-17 21:52:07.235 | + /opt/stack/
2024-05-17 21:52:07.238 | + /opt/stack/
2024-05-17 21:52:07.241 | + /opt/stack/
2024-05-17 21:52:07.243 | + /opt/stack/
2024-05-17 21:52:07.245 | + /opt/stack/
2024-05-17 21:52:09.257 | [ 0.000000] Linux version 5.15.0-71-generic (buildd@
2024-05-17 21:52:09.258 | [ 0.000000] Command line: LABEL=cirros-rootfs ro console=tty1 console=ttyS0
[...]
2024-05-17 21:52:09.268 | Top of dropbear init script
2024-05-17 21:52:09.268 | Starting dropbear sshd: OK
2024-05-17 21:52:09.472 | + /opt/stack/
2024-05-17 21:52:09.474 | + /opt/stack/
2024-05-17 21:52:09.476 | [Call Trace]
2024-05-17 21:52:09.476 | /opt/stack/
2024-05-17 21:52:09.476 | /opt/stack/
2024-05-17 21:52:09.480 | [ERROR] /opt/stack/
Proposed: https:/ /review. opendev. org/c/openstack /grenade/ +/919988