Bug #1797814 “Keystone register.yml task errors during deploy on...” : Bugs : kolla-ansible

Revision history for this message

Eric Miller (erickmiller) wrote on 2018-10-14:

#1

I missed that the "failed_when" line was referring to the "register" variable in this task - so it is NOT related to the bootstrap.yml file I mentioned.

So, something in the docker exec statement failed. I'll re-run with -vvv to see if I can get the output from the command (I only had -vv this last run).

Eric

Revision history for this message

Eric Miller (erickmiller) wrote on 2018-10-15:

#2

It looks like the failure is caused by this SQL statement error (please forgive the lack of spaces - due to a JSON formatter):

inraise_mysql_exception2018-10-1419: 04: 07.664192ERRORkeystoneraiseerrorclass(errno,
errval)2018-10-1419: 04: 07.664192ERRORkeystoneDBError: (pymysql.err.InternalError)(1054,
u\\"Unknown column \'description\' in \'field list\'\\")[SQL: u\'INSERTINTO`role`(id,
name,
description,
extra)VALUES(%(id)s,
%(name)s,
%(description)s,
%(extra)s)\'][parameters: {
\'extra\': \'{

  }\',
  \'description\': None,
  \'name\': \'reader\',
  \'id\': \'<redacted>\'
}](Backgroundonthiserrorat: http: //sqlalche.me/e/2j85)2018-10-1419: 04: 07.664192ERRORkeystone\\u001b[00m\\", \\"changed\\": true}",

Revision history for this message

Eric Miller (erickmiller) wrote on 2018-10-15:

#3

After reviewing a bit more - the SQL error above is "expected", indicating that the role already exists. And thus the "failed" return value, in the JSON response, should be "true".

This is the first chunk of the JSON response:

<controller001> (1, '\r\n{"changed": true, "end": "2018-10-14 19:04:07.801951", "stdout": "{\\"failed\\": true, \\"msg\\": \\"2018-10-14 19:04:07.013 192 DEBUG keystone.notifications [-] Callback: `keystone.application_credential.core.Manager._delete_app_creds_on_user_delete_callback` subscribed to event `identity.user.deleted`. register_event_callback /var/lib/kolla/venv/lib/python2.7/site-packages/keystone/notifications.py:286\\u001b[00m 2018-10-14 19:04:07.014 192 DEBUG keystone.notifications [-] Callback: `keystone.application_credential.core.Manager._delete_app_creds_on_user_delete_callback` subscribed to event `identity.user.disabled`. register_event_callback /var/lib/kolla/venv/lib/python2.7/site-packages/keystone/notifications.py:286\\u001b[00m

If I only take the "stdout" value:

{\\"failed\\": true, \\"msg\\": \\"2018-10-14 19:04:07.013 192 DEBUG keystone.notifications [-] Callback: `keystone.application_credential.core.Manager._delete_app_creds_on_user_delete_callback` subscribed to event `identity.user.deleted`. register_event_callback /var/lib/kolla/venv/lib/python2.7/site-packages/keystone/notifications.py:286\\u001b[00m 2018-10-14 19:04:07.014 192 DEBUG keystone.notifications [-] Callback: `keystone.application_credential.core.Manager._delete_app_creds_on_user_delete_callback` subscribed to event `identity.user.disabled`. register_event_callback /var/lib/kolla/venv/lib/python2.7/site-packages/keystone/notifications.py:286\\u001b[00m

I have no idea what it is complaining about at Character 331 (specified in the original error as the issue). But obviously the JSON parser in Ansible is unhappy.

Eric

Revision history for this message

Eric Miller (erickmiller) wrote on 2018-10-15:

#4

I ran the docker command directly on the host so I could get clean output and it looks like this (the first few lines out of hundreds):

{"failed": true, "msg": "2018-10-14 19:55:01.731 210 DEBUG keystone.notifications [-] Callback: `keystone.application_credential.core.Manager._delete_app_creds_on_user_delete_callback` subscribed to event `identity.user.deleted`. register_event_callback /var/lib/kolla/venv/lib/python2.7/site-packages/keystone/notifications.py:286 2018-10-14 19:55:01.732 210 DEBUG keystone.notifications [-] Callback: `keystone.application_credential.core.Manager._delete_app_creds_on_user_delete_callback` subscribed to event `identity.user.disabled`. register_event_callback /var/lib/kolla/venv/lib/python2.7/site-packages/keystone/notifications.py:286 2018-10-14 19:55:01.733 210 DEBUG keystone.notifications [-] Callback:

It seems to be pointing to the space between 286 and 2018 here? notifications.py:286 2018-10-14

Any ideas?

Eric

Revision history for this message

Ingo Ebel (savar) wrote on 2018-11-23:

#5

Any news on that?
I'm seeing the same when tying to deploy rocky.

fatal: [toscomp02]: FAILED! => {"msg": "The conditional check '(keystone_bootstrap.stdout | from_json).changed' failed. The error was: Invalid control character at: line 1 column 101 (char 100)"}

Doug Szumski (dszumski) on 2019-07-01

Changed in kolla-ansible:
status:	New → Confirmed

Revision history for this message

Doug Szumski (dszumski) wrote on 2019-07-01:

#6

Download full text (13.0 KiB)

I see something very similar to this on a three node deploy, in stein and in master.

The following task fails:

```
TASK [keystone : Creating admin project, user, role, service, and endpoint] ******************************************************************************************************************************************************************
fatal: [control01]: FAILED! => {"msg": "The conditional check '(keystone_bootstrap.stdout | from_json).changed' failed. The error was: Invalid control character at: line 1 column 284 (char 283)"}
```

If I run the command manually I get the following output:

```
$ docker exec keystone kolla_keystone_bootstrap admin top_secret admin admin http://172.28.128.254:35357 http://172.28.128.254:5000 http://172.28.128.254:5000 RegionOne
{"failed": true, "msg": "2019-07-01 12:47:09.993 92 WARNING keystone.access_rules_config.backends.json [-] No config file found for access rules, application credential access rules will be unavailable.: IOError: [Errno 2] No such file or directory: '/etc/keystone/access_rules.json' 2019-07-01 12:47:10.256 92 CRITICAL keystone [-] Unhandled error: ProgrammingError: (pymysql.err.ProgrammingError) (1146, u"Table 'keystone.project' doesn't exist") [SQL: u'INSERT INTO project (id, name, domain_id, description, enabled, extra, parent_id, is_domain) VALUES (%(id)s, %(name)s, %(domain_id)s, %(description)s, %(enabled)s, %(extra)s, %(parent_id)s, %(is_domain)s)'] [parameters: {'is_domain': 1, 'description': 'The default domain', 'extra': '{}', 'enabled': 1, 'domain_id': '<<keystone.domain.root>>', 'parent_id': None, 'id': 'default', 'name': 'Default'}] (Background on this error at: http://sqlalche.me/e/f405) 2019-07-01 12:47:10.256 92 ERROR keystone Traceback (most recent call last): 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/bin/keystone-manage", line 10, in <module> 2019-07-01 12:47:10.256 92 ERROR keystone sys.exit(main()) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/manage.py", line 40, in main 2019-07-01 12:47:10.256 92 ERROR keystone cli.main(argv=sys.argv, developer_config_file=developer_config) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/cli.py", line 1358, in main 2019-07-01 12:47:10.256 92 ERROR keystone CONF.command.cmd_class.main() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/cli.py", line 179, in main 2019-07-01 12:47:10.256 92 ERROR keystone klass.do_bootstrap() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/cli.py", line 170, in do_bootstrap 2019-07-01 12:47:10.256 92 ERROR keystone self.bootstrapper.bootstrap() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/bootstrap.py", line 61, in bootstrap 2019-07-01 12:47:10.256 92 ERROR keystone self._bootstrap_default_domain() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/bootstrap.py", line 82, in _bootstrap_default_domain 2019-07-01 12:47:10.256 92 ERROR keystone domain=default_domain) 2019-07-01 12:47:10.256 92 ERROR keysto...

I see something very similar to this on a three node deploy, in stein and in master.

The following task fails:

```
TASK [keystone : Creating admin project, user, role, service, and endpoint] ******************************************************************************************************************************************************************
fatal: [control01]: FAILED! => {"msg": "The conditional check '(keystone_bootstrap.stdout | from_json).changed' failed. The error was: Invalid control character at: line 1 column 284 (char 283)"}
```

If I run the command manually I get the following output:

```
$ docker exec keystone kolla_keystone_bootstrap admin top_secret admin admin http://172.28.128.254:35357 http://172.28.128.254:5000 http://172.28.128.254:5000 RegionOne
{"failed": true, "msg": "2019-07-01 12:47:09.993 92 WARNING keystone.access_rules_config.backends.json [-] No config file found for access rules, application credential access rules will be unavailable.: IOError: [Errno 2] No such file or directory: '/etc/keystone/access_rules.json' 2019-07-01 12:47:10.256 92 CRITICAL keystone [-] Unhandled error: ProgrammingError: (pymysql.err.ProgrammingError) (1146, u"Table 'keystone.project' doesn't exist") [SQL: u'INSERT INTO project (id, name, domain_id, description, enabled, extra, parent_id, is_domain) VALUES (%(id)s, %(name)s, %(domain_id)s, %(description)s, %(enabled)s, %(extra)s, %(parent_id)s, %(is_domain)s)'] [parameters: {'is_domain': 1, 'description': 'The default domain', 'extra': '{}', 'enabled': 1, 'domain_id': '<<keystone.domain.root>>', 'parent_id': None, 'id': 'default', 'name': 'Default'}] (Background on this error at: http://sqlalche.me/e/f405) 2019-07-01 12:47:10.256 92 ERROR keystone Traceback (most recent call last): 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/bin/keystone-manage", line 10, in <module> 2019-07-01 12:47:10.256 92 ERROR keystone sys.exit(main()) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/manage.py", line 40, in main 2019-07-01 12:47:10.256 92 ERROR keystone cli.main(argv=sys.argv, developer_config_file=developer_config) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/cli.py", line 1358, in main 2019-07-01 12:47:10.256 92 ERROR keystone CONF.command.cmd_class.main() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/cli.py", line 179, in main 2019-07-01 12:47:10.256 92 ERROR keystone klass.do_bootstrap() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/cli.py", line 170, in do_bootstrap 2019-07-01 12:47:10.256 92 ERROR keystone self.bootstrapper.bootstrap() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/bootstrap.py", line 61, in bootstrap 2019-07-01 12:47:10.256 92 ERROR keystone self._bootstrap_default_domain() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/cmd/bootstrap.py", line 82, in _bootstrap_default_domain 2019-07-01 12:47:10.256 92 ERROR keystone domain=default_domain) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/common/manager.py", line 116, in wrapped 2019-07-01 12:47:10.256 92 ERROR keystone __ret_val = __f(*args, **kwargs) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/resource/core.py", line 750, in create_domain 2019-07-01 12:47:10.256 92 ERROR keystone domain_id, project_from_domain, initiator) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/common/manager.py", line 116, in wrapped 2019-07-01 12:47:10.256 92 ERROR keystone __ret_val = __f(*args, **kwargs) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/resource/core.py", line 221, in create_project 2019-07-01 12:47:10.256 92 ERROR keystone ret = self.driver.create_project(project_id, project) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/common/sql/core.py", line 516, in wrapper 2019-07-01 12:47:10.256 92 ERROR keystone return method(*args, **kwargs) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/keystone/resource/backends/sql.py", line 235, in create_project 2019-07-01 12:47:10.256 92 ERROR keystone return project_ref.to_dict() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/contextlib.py", line 24, in __exit__ 2019-07-01 12:47:10.256 92 ERROR keystone self.gen.next() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/oslo_db/sqlalchemy/enginefacade.py", line 1062, in _transaction_scope 2019-07-01 12:47:10.256 92 ERROR keystone yield resource 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/contextlib.py", line 24, in __exit__ 2019-07-01 12:47:10.256 92 ERROR keystone self.gen.next() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/oslo_db/sqlalchemy/enginefacade.py", line 667, in _session 2019-07-01 12:47:10.256 92 ERROR keystone self.session.rollback() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/oslo_utils/excutils.py", line 220, in __exit__ 2019-07-01 12:47:10.256 92 ERROR keystone self.force_reraise() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/oslo_utils/excutils.py", line 196, in force_reraise 2019-07-01 12:47:10.256 92 ERROR keystone six.reraise(self.type_, self.value, self.tb) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/oslo_db/sqlalchemy/enginefacade.py", line 664, in _session 2019-07-01 12:47:10.256 92 ERROR keystone self._end_session_transaction(self.session) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/oslo_db/sqlalchemy/enginefacade.py", line 692, in _end_session_transaction 2019-07-01 12:47:10.256 92 ERROR keystone session.commit() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/session.py", line 1023, in commit 2019-07-01 12:47:10.256 92 ERROR keystone self.transaction.commit() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/session.py", line 487, in commit 2019-07-01 12:47:10.256 92 ERROR keystone self._prepare_impl() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/session.py", line 466, in _prepare_impl 2019-07-01 12:47:10.256 92 ERROR keystone self.session.flush() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/session.py", line 2436, in flush 2019-07-01 12:47:10.256 92 ERROR keystone self._flush(objects) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/session.py", line 2574, in _flush 2019-07-01 12:47:10.256 92 ERROR keystone transaction.rollback(_capture_exception=True) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/util/langhelpers.py", line 67, in __exit__ 2019-07-01 12:47:10.256 92 ERROR keystone compat.reraise(exc_type, exc_value, exc_tb) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/session.py", line 2534, in _flush 2019-07-01 12:47:10.256 92 ERROR keystone flush_context.execute() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/unitofwork.py", line 416, in execute 2019-07-01 12:47:10.256 92 ERROR keystone rec.execute(self) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/unitofwork.py", line 583, in execute 2019-07-01 12:47:10.256 92 ERROR keystone uow, 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/persistence.py", line 245, in save_obj 2019-07-01 12:47:10.256 92 ERROR keystone insert, 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/orm/persistence.py", line 1063, in _emit_insert_statements 2019-07-01 12:47:10.256 92 ERROR keystone c = cached_connections[connection].execute(statement, multiparams) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/engine/base.py", line 980, in execute 2019-07-01 12:47:10.256 92 ERROR keystone return meth(self, multiparams, params) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/sql/elements.py", line 273, in _execute_on_connection 2019-07-01 12:47:10.256 92 ERROR keystone return connection._execute_clauseelement(self, multiparams, params) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/engine/base.py", line 1099, in _execute_clauseelement 2019-07-01 12:47:10.256 92 ERROR keystone distilled_params, 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/engine/base.py", line 1240, in _execute_context 2019-07-01 12:47:10.256 92 ERROR keystone e, statement, parameters, cursor, context 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/engine/base.py", line 1456, in _handle_dbapi_exception 2019-07-01 12:47:10.256 92 ERROR keystone util.raise_from_cause(newraise, exc_info) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/util/compat.py", line 296, in raise_from_cause 2019-07-01 12:47:10.256 92 ERROR keystone reraise(type(exception), exception, tb=exc_tb, cause=cause) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/engine/base.py", line 1236, in _execute_context 2019-07-01 12:47:10.256 92 ERROR keystone cursor, statement, parameters, context 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib64/python2.7/site-packages/sqlalchemy/engine/default.py", line 536, in do_execute 2019-07-01 12:47:10.256 92 ERROR keystone cursor.execute(statement, parameters) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/cursors.py", line 170, in execute 2019-07-01 12:47:10.256 92 ERROR keystone result = self._query(query) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/cursors.py", line 328, in _query 2019-07-01 12:47:10.256 92 ERROR keystone conn.query(q) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/connections.py", line 516, in query 2019-07-01 12:47:10.256 92 ERROR keystone self._affected_rows = self._read_query_result(unbuffered=unbuffered) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/connections.py", line 727, in _read_query_result 2019-07-01 12:47:10.256 92 ERROR keystone result.read() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/connections.py", line 1066, in read 2019-07-01 12:47:10.256 92 ERROR keystone first_packet = self.connection._read_packet() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/connections.py", line 683, in _read_packet 2019-07-01 12:47:10.256 92 ERROR keystone packet.check_error() 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/protocol.py", line 220, in check_error 2019-07-01 12:47:10.256 92 ERROR keystone err.raise_mysql_exception(self._data) 2019-07-01 12:47:10.256 92 ERROR keystone File "/usr/lib/python2.7/site-packages/pymysql/err.py", line 109, in raise_mysql_exception 2019-07-01 12:47:10.256 92 ERROR keystone raise errorclass(errno, errval) 2019-07-01 12:47:10.256 92 ERROR keystone ProgrammingError: (pymysql.err.ProgrammingError) (1146, u"Table 'keystone.project' doesn't exist") [SQL: u'INSERT INTO project (id, name, domain_id, description, enabled, extra, parent_id, is_domain) VALUES (%(id)s, %(name)s, %(domain_id)s, %(description)s, %(enabled)s, %(extra)s, %(parent_id)s, %(is_domain)s)'] [parameters: {'is_domain': 1, 'description': 'The default domain', 'extra': '{}', 'enabled': 1, 'domain_id': '<<keystone.domain.root>>', 'parent_id': None, 'id': 'default', 'name': 'Default'}] (Background on this error at: http://sqlalche.me/e/f405) 2019-07-01 12:47:10.256 92 ERROR keystone ", "changed": true}
```

The issue appears to be (at least in this case) that bootstrap isn't working properly. If I manually run the command (which should run as part of bootstrap):

`$ docker exec keystone keystone-manage db_sync
2019-07-01 12:49:51.780 101 INFO migrate.versioning.api [-] 66 -> 67... 
2019-07-01 12:49:52.683 101 INFO migrate.versioning.api [-] done
2019-07-01 12:49:52.684 101 INFO migrate.versioning.api [-] 67 -> 68... 
2019-07-01 12:49:52.694 101 INFO migrate.versioning.api [-] done
2019-07-01 12:49:52.695 101 INFO migrate.versioning.api [-] 68 -> 69... 
2019-07-01 12:49:52.705 101 INFO migrate.versioning.api [-] done
<snip>`

I can run the failing command without error:

```
$ docker exec keystone kolla_keystone_bootstrap admin top_secret admin admin http://172.28.128.254:35357 http://172.28.128.254:5000 http://172.28.128.254:5000 RegionOne
{"failed": false, "changed": true}
```

Revision history for this message

Mark Goddard (mgoddard) wrote on 2019-07-01:

#7

The problem is this:

Table 'keystone.project' doesn't exist

For some reason the DB bootstrap has not happened. We do see this kind of error fairly often, due to only running the bootstrap if the DB is created in ansible/roles/keystone/tasks/bootstrap.yml.

OpenStack Infra (hudson-openstack) on 2019-07-01

Changed in kolla-ansible:
assignee:	nobody → Mark Goddard (mgoddard)
status:	Confirmed → In Progress

Mark Goddard (mgoddard) on 2019-07-01

Changed in kolla-ansible:
importance:	Undecided → Medium

Revision history for this message

Doug Szumski (dszumski) wrote on 2019-07-01:

#8

A manual workaround (truncated in post above) is to run the db_sync manually for the failed service, eg:

$ docker exec keystone keystone-manage db_sync

OpenStack Infra (hudson-openstack) on 2019-07-12

Changed in kolla-ansible:
assignee:	Mark Goddard (mgoddard) → Radosław Piliszek (yoctozepto)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-07-12: Fix merged to kolla-ansible (master)

#9

Reviewed: https://review.opendev.org/650962
Committed: https://git.openstack.org/cgit/openstack/kolla-ansible/commit/?id=d5e5e885d11e338806425839a361d868c1f4ff10
Submitter: Zuul
Branch: master

commit d5e5e885d11e338806425839a361d868c1f4ff10
Author: Mark Goddard <email address hidden>
Date: Mon Apr 8 17:51:07 2019 +0100

During deploy, always sync DB

A common class of problems goes like this:

    * kolla-ansible deploy
    * Hit a problem, often in ansible/roles/*/tasks/bootstrap.yml
    * Re-run kolla-ansible deploy
    * Service fails to start

    This happens because the DB is created during the first run, but for some
    reason we fail before performing the DB sync. This means that on the second run
    we don't include ansible/roles/*/tasks/bootstrap_service.yml because the DB
    already exists, and therefore still don't perform the DB sync. However this
    time, the command may complete without apparent error.

    We should be less careful about when we perform the DB sync, and do it whenever
    it is necessary. There is an argument for not doing the sync during a
    'reconfigure' command, although we will not change that here.

This change only always performs the DB sync during 'deploy' and
'reconfigure' commands.

    Change-Id: I82d30f3fcf325a3fdff3c59f19a1f88055b566cc
    Closes-Bug: #1823766
    Closes-Bug: #1797814

Changed in kolla-ansible:
status:	In Progress → Fix Released

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-07-12: Fix proposed to kolla-ansible (stable/stein)

#10

Fix proposed to branch: stable/stein
Review: https://review.opendev.org/670546

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-07-12: Fix proposed to kolla-ansible (stable/rocky)

#11

Fix proposed to branch: stable/rocky
Review: https://review.opendev.org/670548

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-07-12: Fix proposed to kolla-ansible (stable/queens)

#12

Fix proposed to branch: stable/queens
Review: https://review.opendev.org/670549

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-07-16: Fix merged to kolla-ansible (stable/stein)

#13

Reviewed: https://review.opendev.org/670546
Committed: https://git.openstack.org/cgit/openstack/kolla-ansible/commit/?id=d62e927dacfa42822296e7a0720cb85019c08a9b
Submitter: Zuul
Branch: stable/stein

commit d62e927dacfa42822296e7a0720cb85019c08a9b
Author: Mark Goddard <email address hidden>
Date: Mon Apr 8 17:51:07 2019 +0100

During deploy, always sync DB

A common class of problems goes like this:

    * kolla-ansible deploy
    * Hit a problem, often in ansible/roles/*/tasks/bootstrap.yml
    * Re-run kolla-ansible deploy
    * Service fails to start

    This happens because the DB is created during the first run, but for some
    reason we fail before performing the DB sync. This means that on the second run
    we don't include ansible/roles/*/tasks/bootstrap_service.yml because the DB
    already exists, and therefore still don't perform the DB sync. However this
    time, the command may complete without apparent error.

    We should be less careful about when we perform the DB sync, and do it whenever
    it is necessary. There is an argument for not doing the sync during a
    'reconfigure' command, although we will not change that here.

This change only always performs the DB sync during 'deploy' and
'reconfigure' commands.

    Change-Id: I82d30f3fcf325a3fdff3c59f19a1f88055b566cc
    Closes-Bug: #1823766
    Closes-Bug: #1797814
    (cherry picked from commit d5e5e885d11e338806425839a361d868c1f4ff10)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-07-18: Fix included in openstack/kolla-ansible 8.0.0.0rc2

#14

This issue was fixed in the openstack/kolla-ansible 8.0.0.0rc2 release candidate.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-08-02: Fix merged to kolla-ansible (stable/queens)

#15

Reviewed: https://review.opendev.org/670549
Committed: https://git.openstack.org/cgit/openstack/kolla-ansible/commit/?id=8e219a91738048ff16f2caed4c24f85832211aad
Submitter: Zuul
Branch: stable/queens

commit 8e219a91738048ff16f2caed4c24f85832211aad
Author: Mark Goddard <email address hidden>
Date: Mon Apr 8 17:51:07 2019 +0100

During deploy, always sync DB

A common class of problems goes like this:

    * kolla-ansible deploy
    * Hit a problem, often in ansible/roles/*/tasks/bootstrap.yml
    * Re-run kolla-ansible deploy
    * Service fails to start

    This happens because the DB is created during the first run, but for some
    reason we fail before performing the DB sync. This means that on the second run
    we don't include ansible/roles/*/tasks/bootstrap_service.yml because the DB
    already exists, and therefore still don't perform the DB sync. However this
    time, the command may complete without apparent error.

    We should be less careful about when we perform the DB sync, and do it whenever
    it is necessary. There is an argument for not doing the sync during a
    'reconfigure' command, although we will not change that here.

This change only always performs the DB sync during 'deploy' and
'reconfigure' commands.

    Change-Id: I82d30f3fcf325a3fdff3c59f19a1f88055b566cc
    Closes-Bug: #1823766
    Closes-Bug: #1797814
    (cherry picked from commit d5e5e885d11e338806425839a361d868c1f4ff10)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-08-02: Fix merged to kolla-ansible (stable/rocky)

#16

Reviewed: https://review.opendev.org/670548
Committed: https://git.openstack.org/cgit/openstack/kolla-ansible/commit/?id=441c5a03f7e788336f3a7c1cb15dcec207073b07
Submitter: Zuul
Branch: stable/rocky

commit 441c5a03f7e788336f3a7c1cb15dcec207073b07
Author: Mark Goddard <email address hidden>
Date: Mon Apr 8 17:51:07 2019 +0100

During deploy, always sync DB

A common class of problems goes like this:

    * kolla-ansible deploy
    * Hit a problem, often in ansible/roles/*/tasks/bootstrap.yml
    * Re-run kolla-ansible deploy
    * Service fails to start

    This happens because the DB is created during the first run, but for some
    reason we fail before performing the DB sync. This means that on the second run
    we don't include ansible/roles/*/tasks/bootstrap_service.yml because the DB
    already exists, and therefore still don't perform the DB sync. However this
    time, the command may complete without apparent error.

    We should be less careful about when we perform the DB sync, and do it whenever
    it is necessary. There is an argument for not doing the sync during a
    'reconfigure' command, although we will not change that here.

This change only always performs the DB sync during 'deploy' and
'reconfigure' commands.

    Change-Id: I82d30f3fcf325a3fdff3c59f19a1f88055b566cc
    Closes-Bug: #1823766
    Closes-Bug: #1797814
    (cherry picked from commit d5e5e885d11e338806425839a361d868c1f4ff10)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-09-09: Fix included in openstack/kolla-ansible 6.2.2

#17

This issue was fixed in the openstack/kolla-ansible 6.2.2 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-09-09: Fix included in openstack/kolla-ansible 7.1.2

#18

This issue was fixed in the openstack/kolla-ansible 7.1.2 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-11-11: Fix included in openstack/kolla-ansible 9.0.0.0rc1

#19

This issue was fixed in the openstack/kolla-ansible 9.0.0.0rc1 release candidate.

Revision history for this message

Rowan Potgieter (rowan-potgieter) wrote on 2019-11-25:

#20

Hi, we have hit this issue using kolla-ansible 8.0.1 (installed from pip) as well as the `stable/stein` branch when deploying using the git repository.

From what we are seeing the error does not appear to be database related at all. There is actually a "NULL" value in the output of this command:

docker exec keystone kolla_keystone_bootstrap {{ openstack_auth.username }} {{ openstack_auth.password }} {{ openstack_auth.project_name }} admin {{ keystone_admin_url }} {{ keystone_internal_url }} {{ keystone_public_url }} {{ item }}

I ran the command manually and saved the result in file.txt. I then cat the file showing non-printable characters:
$ cat -vE file.txt
{"failed": true, "msg": "2019-11-25 12:57:43.952 541 WARNING keystone.access_rules_config.backends.json [-] No config file found for access rules, application credential access rules will be unavailable.: FileNotFoundError: [Errno 2] No such file or directory: '/etc/keystone/access_rules.json'^[[00m /etc/keystone/fernet-keys/ does not exist", "changed": true}$

I believe the ^[[00m in the "msg" field is causing the issue. If I pipe the response to jq I receive similar errors:
parse error: Invalid string: control characters from U+0000 through U+001F must be escaped at line 1, column 342

I think it is worth investigating why "docker exec keystone kolla_keystone_bootstrap" outputs a NULL character.

Revision history for this message

Rowan Potgieter (rowan-potgieter) wrote on 2019-11-25:

#21

Sorry - minor correction to my previous statement, it's an escape character (^[) not a NULL character.
I tested piping the same file.txt through a tr command to remove non-printable characters:

# tr -cd "[:print:]\n" < file.txt
2019-11-25 13:53:52.264 619 WARNING keystone.access_rules_config.backends.json [-] No config file found for access rules, application credential access rules will be unavailable.: FileNotFoundError: [Errno 2] No such file or directory: '/etc/keystone/access_rules.json'[00m
/etc/keystone/fernet-keys/ does not exist

Revision history for this message

Mark Goddard (mgoddard) wrote on 2019-11-25:

#22

That's good information Rowan, we do see some odd failures during this task. I did a little googling and ^]]00m is an escape character for terminal colour control that should reset the attributes to normal. Perhaps we need to remove the control codes from the output?

There is also the underlying issue, which is that /etc/keystone/fernet-keys/ does not exist

Revision history for this message

Rowan Potgieter (rowan-potgieter) wrote on 2019-11-27:

#23

Hi Mark, I would recommend removing the control codes, it will break anytime you pipe stdout or stderr to from_json to get the changed status.

I never managed to figure out the issue with the missing fernet-keys, in the end we decided to perform a fresh kolla-ansible deploy since I suspect we broke something.

Revision history for this message

Mark Goddard (mgoddard) wrote on 2019-12-09:

#24

Raised a separate bug for the control code issue: https://bugs.launchpad.net/kolla-ansible/+bug/1855701

Mark Goddard (mgoddard) on 2020-06-02

Changed in kolla-ansible:
status:	Fix Committed → Fix Released

kolla-ansible

Keystone register.yml task errors during deploy on existing environment

Bug Description

Duplicates of this bug

Other bug subscribers

Remote bug watches

	Status	Importance	Assigned to	Milestone
kolla-ansible	Fix Released	Medium	Mark Goddard	kolla-ansible 9.0.0 "Train"
Queens	Fix Released	Medium	Mark Goddard	kolla-ansible 6.2.2 "queens"
Rocky	Fix Released	Medium	Mark Goddard	kolla-ansible 7.1.2 "rocky"
Stein	Fix Released	Medium	Mark Goddard	kolla-ansible 8.0.0 "Stein"
Train	Fix Released	Medium	Mark Goddard	kolla-ansible 9.0.0 "Train"