Bug #1902264 “os-*-hostname change renders cloud unusable” : Bugs : OpenStack Keystone Charm

Dan Ackerson (dan.ackerson) on 2020-11-02

description:

updated

Revision history for this message

Dorina Timbur (dorina-t) wrote on 2020-11-04:

#1

To clarify further on the impact previously observed, after the FQDN was changed, the customer couldn't access the cloud from outside any more. Log in via Horizon was not working with "An error occurred authenticating. Please try again later.", they couldn't authenticate with a local or ldap account. Swift was still pointing to the previous url.
Would be great if someone from product can replicate a FQDN change in a lab and provide documentation on how best to do it in a production environment.

Billy Olsen (billy-olsen) on 2020-11-17

Changed in charm-keystone:
assignee:	nobody → Chris MacNaughton (chris.macnaughton)

Revision history for this message

Chris MacNaughton (chris.macnaughton) wrote on 2020-11-18:

#2

Looking at the nova-cloud-controller charm, it looks like endpoint evaluation is only ever done in the relation-joined hook with keystone (https://github.com/openstack/charm-nova-cloud-controller/blob/25da3180b53abc9843cba37b12e08258de8644bf/hooks/nova_cc_hooks.py#L443). I haven't yet evaluated if that's the case across the rest of the charms but I suspect it is.

What that means is that whatever hostname(s) are configured when the initial relation to keystone is made is the hostname that will be put into the keystone catalog, and the config option will then never be re-evaluated in the context of the keystone relation. The charms consuming the keystone identity service relation need to be updated to ensure that they propagate URL updates to Keystone.

Chris MacNaughton (chris.macnaughton) on 2020-11-19

no longer affects:

charm-nova-cloud-controller

Revision history for this message

Chris MacNaughton (chris.macnaughton) wrote on 2020-11-19:

#3

Looking more thoroughly, nova-cloud-controller does get the hostnames and things updated in the config changed, and many of the relation hooks.

Revision history for this message

Chris MacNaughton (chris.macnaughton) wrote on 2020-11-20:

#4

I have validated, with the latest charms in ~openstack-charmers-next, that the hostnames in the keystone catalog change correctly when the config is updated, although it does take a few minutes. Additionally, I've confirmed that, when using a wildcard certificate (*.first-domain) and then updating that certificate (*.second.domain) at the same time as the hostname changes also works as expected, and that the radosgw and dashboard continues to work as expected. I am able to access the (updated) Swift endpoint in the catalog as well as login to the dashboard.

I'm going to close this bug as incomplete. If it can be reproduced, please include more information about how to reproduce it and update it to New!

Changed in charm-keystone:
status:	New → Incomplete

Revision history for this message

James Troup (elmo) wrote on 2020-11-20:

#5

Hi, it'd be *really* helpful if your validation could be done with released stable charms, because that's what we run on customer clouds. Proving it's broken with *next* charms is... not particularly helpful to us.

Revision history for this message

James Troup (elmo) wrote on 2020-11-20:

#6

s/it's broken/it's NOT broken/

Revision history for this message

Chris MacNaughton (chris.macnaughton) wrote on 2020-11-20:

#7

Sure, the longest part of my test was getting everything setup so that I was using hostnames, and could update everything; will re-run the same check on the stable charms

Revision history for this message

Chris MacNaughton (chris.macnaughton) wrote on 2020-11-20:

#8

Download full text (5.1 KiB)

As with the -next charms, the stable charms have successfully updated all endpoints in the keystone catalog:

As with the -next charms, the stable charms have successfully updated all endpoints in the keystone catalog:

and all of the services seem to be responsive. I can log in through the dashboard and I can query all of the backend services successfully. As in the original deployment scenario, I am using different SSL certificate and key material matching a wildcard for each of the domains in question (*.os.test.alph.ac and *.new-os.test.alph.ac, respectively) rather than using Vault.

Revision history for this message

Facundo Ciccioli (fandanbango) wrote on 2020-11-23:

#9

Hi there. I'm attaching the extract of the juju status with the charms' versions where the issue was observed.

Revision history for this message

Facundo Ciccioli (fandanbango) wrote on 2020-11-23:

#10

juju-versions-where-bug-was-observed.txt Edit (9.4 KiB, text/plain)

Revision history for this message

Xav Paice (xavpaice) wrote on 2021-01-13:

#11

Hi,

Using charm cs:keystone-319, when we changed the os-admin-hostname, os-internal-hostname, and
os-public-hostname we found that on the leader unit, the relation data provided over the identity-service relation was updated correctly, but on the two non-leader keystone units, the same data was not updated (i.e. old names). This caused some services to fail to update their config, causing issues.

By manually updating the relation data using the following, we were able to workaround:

relation-set -r identity-service:$i auth_host=$newhost service_host=$newhost
relation-set -r identity-credentials:$i credentials_host=$newhost auth_host=$newhost

Note that this was seen on both the identity-service and identity-credentials relations.

Changed in charm-keystone:
status:	Incomplete → New

Billy Olsen (billy-olsen) on 2021-03-11

Changed in charm-keystone:
status:	New → Triaged
importance:	Undecided → Low

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-08-16: Fix proposed to charm-keystone (master)

#12

Fix proposed to branch: master
Review: https://review.opendev.org/c/openstack/charm-keystone/+/804802

Changed in charm-keystone:
status:	Triaged → In Progress

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-10-08: Fix merged to charm-keystone (master)

#13

Reviewed: https://review.opendev.org/c/openstack/charm-keystone/+/804802
Committed: https://opendev.org/openstack/charm-keystone/commit/9b8b81a0bc8406f03b2de884eeec91b8e8f2d442
Submitter: "Zuul (22348)"
Branch: master

commit 9b8b81a0bc8406f03b2de884eeec91b8e8f2d442
Author: Chris MacNaughton <email address hidden>
Date: Mon Aug 16 16:55:14 2021 -0500

Use the application data bag to set id and id_service notifications

    When purely using relation-set from a leader, updates after
    the leader has changed can lead to old data being persisted
    on a relation in addition to newer data being set by the new
    leader. When this happens, there can be issues with services
    using old data to talk to other related services.

    This change introduces the use of the application data bag
    to ensure that all units related to keystone get the same
    data from the leader, regardless of leadership changes.
    While this change enables the application data bag for these
    relations, it still sends the per-unit relation data as well
    to maintain backwards compatibility. Charms that consume the
    identity-service and identity-notification relations will
    need an update to use the application data bag to complete
    this change.

Partial-Bug: #1902264
Change-Id: Iadd795fec605e7704e5a6673906452279bbecb34

Billy Olsen (billy-olsen) on 2023-10-02

Changed in charm-keystone:
status:	In Progress → Fix Released
no longer affects:	charm-keystone/ussuri

OpenStack Keystone Charm

os-*-hostname change renders cloud unusable

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches