LXD containers fail to download on a slow-ish internet connection
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Triaged
|
Wishlist
|
Unassigned |
Bug Description
Hi,
When deploying bundles on MAAS 2.7.1 or 2.8, lxd containers will begin to download as normal, but as more containers are started and start their respective installs more stress is applied to the network, and the later containers fail to download.
MAAS/Juju will retry this download like on line 21: https:/
It will then fail again, and will not try again, and all remaining containers on that machine will subsequently fail (as there is only one container being downloaded for that machine).
Resulting in this result: https:/
I am seeing this happen on internet connections around 40MBit/s to 100Mbit/s on multiple different MAAS deployments (Orangeboxes).
Many thanks,
Peter
no longer affects: | maas |
I think the request here is to add a --limit or --sequential or --slow flag to juju deploy, to reduce the number of machines it will try to spin up simultaneously.
Either that, or we'd want to be able to control the timeout or number of retries when allocating a machine. (Overall, though, I think it would be preferred to limit the number of retries we need in the first place, rather than soaking the connection and then trying to handle the consequences after the fact.)