OpenStack Nova Compute Charm

Sensible cap to worker-multiplier is needed

Bug #1843011 reported by Nobuto Murata on 2019-09-06

This bug affects 1 person

	Status	Importance	Assigned to	Milestone
Charm Helpers	Fix Released	Undecided	Nobuto Murata
OpenStack API Layer	Fix Released	Undecided	Nobuto Murata	OpenStack API Layer 21.04
OpenStack Charm Guide	Fix Released	High	Nobuto Murata
OpenStack Nova Compute Charm	Fix Released	Wishlist	Nobuto Murata	OpenStack Nova Compute Charm 21.04

Bug Description

worker-multiplier is a common option across multiple OpenStack charms. Most of the control plane related charms will be deployed into LXD containers because of higher density and better separations. However, nova-compute cannot be deployed into LXD of course by nature, so no cap will be applied to worker-multiplier if no value is set.

https://jaas.ai/nova-compute#charm-config-worker-multiplier
> worker-multiplier
> (float) The CPU core multiplier to use when configuring worker processes for this services e.g. metadata-api. By default, the number of workers for each daemon is set to twice the number of CPU cores a service unit has. When deployed in a LXD container, this default value will be capped to 4 workers unless this configuration option is set.

One example was that a customer had 150+ workers which ate almost all memory of the system and led to OOM killer. While users can set an explicit value through the charm option, sensible default cap is nice to have even for bare metal.

Tags:

Alex Kavanagh (ajkavanagh) on 2019-09-06

Changed in charm-nova-compute:
status:	New → Triaged
importance:	Undecided → Wishlist

Brett Milford (brettmilford) on 2020-10-25

tags:

added: sts

Revision history for this message

Brett Milford (brettmilford) wrote on 2020-10-25:

We have encountered this in a deployment using reserved hugepages.
350 1G pages were reserved, leaving 22G for hypervisor processes.
Around 14G of this was consumed by metadata-api processes which in combination with the other system processes was leaving little free memory and resulting in memory fragmentation.
The outcome was that qemu was unable to allocate order 6 pages when launching VMs (which were to be backed by hugepages).
I expect in setups using reserved hugepages the effects of this bug would be more pronounced and occur more often.

Revision history for this message

Andrea Ieri (aieri) wrote on 2020-10-26:

subscribed field-high given the impact this bug has on environments using hugepages

Revision history for this message

Celia Wang (ziyiwang) wrote on 2020-10-27:

Current value we used in customers' cloud is worker-multiplier="0.1". (Brett and me tried to use 4 but that's not too helpful)
So a more sensible default value is needed.

Revision history for this message

Nobuto Murata (nobuto) wrote on 2020-12-21:

https://github.com/juju/charm-helpers/pull/553

Revision history for this message

James Page (james-page) wrote on 2021-03-02:

PR looks good and has been updated - will merge today.

I've dropped field-high and added field-medium for this bug as there is a documented workaround using the existing configuration option but this is still a good feature to get landed for our next release.

Changed in charm-nova-compute:
status:	Triaged → In Progress
assignee:	nobody → Nobuto Murata (nobuto)
milestone:	none → 21.04

Nobuto Murata (nobuto) on 2021-03-05

Changed in charm-helpers:
assignee:	nobody → Nobuto Murata (nobuto)
status:	New → Fix Committed

Revision history for this message

Nobuto Murata (nobuto) wrote on 2021-03-05:

For reactive charms (update config.yaml to reflect the change):
https://review.opendev.org/c/openstack/charm-layer-openstack-api/+/778839

For a classic charm, nova-compute as the example:
https://review.opendev.org/c/openstack/charm-nova-compute/+/778838
(I will copy the same to other classic charms once nova-compute one is approved)

Revision history for this message

Aurelien Lourot (aurelien-lourot) wrote on 2021-03-05:

@nobuto, thanks for doing this! Could you please also open a gerrit review to the charm-guide's release notes? Thanks!

Changed in charm-guide:
status:	New → Triaged
importance:	Undecided → High

Nobuto Murata (nobuto) on 2021-03-05

Changed in charm-guide:
assignee:	nobody → Nobuto Murata (nobuto)

Revision history for this message

Nobuto Murata (nobuto) wrote on 2021-03-30:

https://opendev.org/openstack/charm-nova-compute/commit/5db8e14d4d09726cace01e6cfe3fe40ce9c05913

Changed in charm-nova-compute:
status:	In Progress → Fix Committed

Revision history for this message

Nobuto Murata (nobuto) wrote on 2021-03-30:

https://opendev.org/openstack/charm-layer-openstack-api/commit/34311a62e963d0ce903b7ddb9d70b8f071f71651

Changed in layer-openstack-api:
assignee:	nobody → Nobuto Murata (nobuto)
status:	New → Fix Committed

Revision history for this message

Nobuto Murata (nobuto) wrote on 2021-03-30:

#10

https://review.opendev.org/q/topic:bug/1843011

Nobuto Murata (nobuto) on 2021-03-31

Changed in charm-guide:
status:	Triaged → In Progress

Alex Kavanagh (ajkavanagh) on 2021-05-03

Changed in layer-openstack-api:
milestone:	none → 21.04
Changed in charm-nova-compute:
status:	Fix Committed → Fix Released
Changed in layer-openstack-api:
status:	Fix Committed → Fix Released

Revision history for this message

Billy Olsen (billy-olsen) wrote on 2021-05-16:

#11

charm-helpers fix was included in the v0.20.21 release.

~/work/charms/charm-helpers (master) $ git tag --contains 63dfd93
v0.20.21

Changed in charm-helpers:
status:	Fix Committed → Fix Released

Revision history for this message

Nobuto Murata (nobuto) wrote on 2021-05-17:

#12

https://review.opendev.org/c/openstack/charm-guide/+/783781

Changed in charm-guide:
status:	In Progress → Fix Released

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.