OpenStack Compute (nova)

Placement duplicate aggregate uuid handling during concurrent aggregate create insufficiently robust

Bug #1786703 reported by Chris Dent on 2018-08-12

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	OpenStack Compute (nova)	Fix Released	Medium	Jay Pipes

Bug Description

NOTE: This may be just a postgresql problem, not sure.

When doing some further experiments with load testing placement, my resource provider create script, which uses asyncio was able to cause several 500 errors from the placement service of the following form:

```
cdent-a01:~/src/placeload(master) $ docker logs zen_murdock |grep 'req-d4dcbfed-b050-4a3b-ab0f-d2489a31c3f2'
2018-08-12 16:03:30.698 9 DEBUG nova.api.openstack.placement.requestlog [req-d4dcbfed-b050-4a3b-ab0f-d2489a31c3f2 admin admin - - -] Starting request: 172.17.0.1 "PUT /resource_providers/13b09bc9-164f-4d03-8a61-5e78c05a73ad/aggregates" __call__ /usr/lib/python3.6/site-packages/nova/api/openstack/placement/requestlog.py:38
2018-08-12 16:03:30.903 9 ERROR nova.api.openstack.placement.fault_wrap [req-d4dcbfed-b050-4a3b-ab0f-d2489a31c3f2 admin admin - - -] Placement API unexpected error: This Session's transaction has been rolled back due to a previous exception during flush. To begin a new transaction with this Session, first issue Session.rollback(). Original exception was: (psycopg2.IntegrityError) duplicate key value violates unique constraint "uniq_placement_aggregates0uuid"
2018-08-12 16:03:30.914 9 INFO nova.api.openstack.placement.requestlog [req-d4dcbfed-b050-4a3b-ab0f-d2489a31c3f2 admin admin - - -] 172.17.0.1 "PUT /resource_providers/13b09bc9-164f-4d03-8a61-5e78c05a73ad/aggregates" status: 500 len: 997 microversion: 1.29
```

"DETAIL: Key (uuid)=(14a5c8a3-5a99-4e8f-88be-00d85fcb1c17) already exists."

This is because the code at https://github.com/openstack/nova/blob/a29ace1d48b5473b9e7b5decdf3d5d19f3d262f3/nova/api/openstack/placement/objects/resource_provider.py#L519-L529 is not trapping the right error when the server thinks it needs to create a new aggregate at the same time that it is already creating it.

It's not clear to me if this is because oslo_db is not transforming the postgresql error properly or that the generic error there is the wrong one and we've never noticed before because we don't hit the concurrency situation hard enough.

Tags:

Revision history for this message

Matt Riedemann (mriedem) wrote on 2018-08-13:

Which version of postgresql are you using? I'm guessing 8.x?

This is the oslo.db code that is meant to translate the IntegrityError:

http://git.openstack.org/cgit/openstack/oslo.db/tree/oslo_db/sqlalchemy/exc_filters.py#n104

Revision history for this message

Matt Riedemann (mriedem) wrote on 2018-08-13:

Sorry I meant 9.x.

Revision history for this message

Chris Dent (cdent) wrote on 2018-08-13:

10.4, I'm using bionic (in this case).

Revision history for this message

Jay Pipes (jaypipes) wrote on 2018-08-13:

Seems that this might be an issue with whatever is wrapping the transaction rollback is not re-raising the underlying exception properly and the oslo_db.exception.DBDuplicateEntry isn't being raised.

Revision history for this message

Chris Dent (cdent) wrote on 2018-08-14:

Download full text (18.0 KiB)

When using the mysql, the same high concurrency results in a different problem (pasted from http://logs.openstack.org/67/591367/4/check/nova-next/f8892df/logs/screen-placement-api.txt.gz#_Aug_13_23_11_22_561724 )

Aug 13 23:11:22.553969 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: DEBUG nova.api.openstack.placement.requestlog [None req-b524d09f-0bae-4855-9e3b-776abe7c6c2f None None] Starting request: 198.72.124.103 "PUT /placement/resource_providers/381ee627-4cc6-4dee-88a1-4f442de6e553/aggregates" {{(pid=634) __call__ /opt/stack/new/nova/nova/api/openstack/placement/requestlog.py:38}}
Aug 13 23:11:22.561724 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: ERROR nova.api.openstack.placement.fault_wrap [None req-216bb3ea-5d1e-40af-a7f6-d8f27dbfbc28 None None] Placement API unexpected error: (pymysql.err.OperationalError) (1213, u'Deadlock found when trying to get lock; try restarting transaction') [SQL: u'INSERT INTO resource_provider_aggregates (resource_provider_id, aggregate_id, created_at) SELECT 93, placement_aggregates.id, %(created_at)s AS anon_1 \nFROM placement_aggregates \nWHERE placement_aggregates.uuid IN (%(uuid_1)s, %(uuid_2)s)'] [parameters: {u'uuid_2': u'66d98e7c-3c25-485d-a0dc-1cea651884de', 'created_at': datetime.datetime(2018, 8, 13, 23, 11, 22, 548464), u'uuid_1': u'14a5c8a3-5a99-4e8f-88be-00d85fcb1c17'}] (Background on this error at: http://sqlalche.me/e/e3q8): DBDeadlock: (pymysql.err.OperationalError) (1213, u'Deadlock found when trying to get lock; try restarting transaction') [SQL: u'INSERT INTO resource_provider_aggregates (resource_provider_id, aggregate_id, created_at) SELECT 93, placement_aggregates.id, %(created_at)s AS anon_1 \nFROM placement_aggregates \nWHERE placement_aggregates.uuid IN (%(uuid_1)s, %(uuid_2)s)'] [parameters: {u'uuid_2': u'66d98e7c-3c25-485d-a0dc-1cea651884de', 'created_at': datetime.datetime(2018, 8, 13, 23, 11, 22, 548464), u'uuid_1': u'14a5c8a3-5a99-4e8f-88be-00d85fcb1c17'}] (Background on this error at: http://sqlalche.me/e/e3q8)
Aug 13 23:11:22.562429 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: ERROR nova.api.openstack.placement.fault_wrap Traceback (most recent call last):
Aug 13 23:11:22.562605 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: ERROR nova.api.openstack.placement.fault_wrap File "/opt/stack/new/nova/nova/api/openstack/placement/fault_wrap.py", line 40, in __call__
Aug 13 23:11:22.562773 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: ERROR nova.api.openstack.placement.fault_wrap return self.application(environ, start_response)
Aug 13 23:11:22.562959 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: ERROR nova.api.openstack.placement.fault_wrap File "/usr/local/lib/python2.7/dist-packages/webob/dec.py", line 129, in __call__
Aug 13 23:11:22.563123 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: ERROR nova.api.openstack.placement.fault_wrap resp = self.call_func(req, *args, **kw)
Aug 13 23:11:22.563297 ubuntu-xenial-inap-mtl01-0001304255 <email address hidden>[628]: ERROR nova.api....

Aug 13 23:11:22.553969 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: DEBUG nova.api.openstack.placement.requestlog [None req-b524d09f-0bae-4855-9e3b-776abe7c6c2f None None] Starting request: 198.72.124.103 "PUT /placement/resource_providers/381ee627-4cc6-4dee-88a1-4f442de6e553/aggregates" {{(pid=634) __call__ /opt/stack/new/nova/nova/api/openstack/placement/requestlog.py:38}}
Aug 13 23:11:22.561724 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap [None req-216bb3ea-5d1e-40af-a7f6-d8f27dbfbc28 None None] Placement API unexpected error: (pymysql.err.OperationalError) (1213, u'Deadlock found when trying to get lock; try restarting transaction') [SQL: u'INSERT INTO resource_provider_aggregates (resource_provider_id, aggregate_id, created_at) SELECT 93, placement_aggregates.id, %(created_at)s AS anon_1 \nFROM placement_aggregates \nWHERE placement_aggregates.uuid IN (%(uuid_1)s, %(uuid_2)s)'] [parameters: {u'uuid_2': u'66d98e7c-3c25-485d-a0dc-1cea651884de', 'created_at': datetime.datetime(2018, 8, 13, 23, 11, 22, 548464), u'uuid_1': u'14a5c8a3-5a99-4e8f-88be-00d85fcb1c17'}] (Background on this error at: http://sqlalche.me/e/e3q8): DBDeadlock: (pymysql.err.OperationalError) (1213, u'Deadlock found when trying to get lock; try restarting transaction') [SQL: u'INSERT INTO resource_provider_aggregates (resource_provider_id, aggregate_id, created_at) SELECT 93, placement_aggregates.id, %(created_at)s AS anon_1 \nFROM placement_aggregates \nWHERE placement_aggregates.uuid IN (%(uuid_1)s, %(uuid_2)s)'] [parameters: {u'uuid_2': u'66d98e7c-3c25-485d-a0dc-1cea651884de', 'created_at': datetime.datetime(2018, 8, 13, 23, 11, 22, 548464), u'uuid_1': u'14a5c8a3-5a99-4e8f-88be-00d85fcb1c17'}] (Background on this error at: http://sqlalche.me/e/e3q8)
Aug 13 23:11:22.562429 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap Traceback (most recent call last):
Aug 13 23:11:22.562605 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/fault_wrap.py", line 40, in __call__
Aug 13 23:11:22.562773 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return self.application(environ, start_response)
Aug 13 23:11:22.562959 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/webob/dec.py", line 129, in __call__
Aug 13 23:11:22.563123 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     resp = self.call_func(req, *args, **kw)
Aug 13 23:11:22.563297 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/webob/dec.py", line 193, in call_func
Aug 13 23:11:22.563460 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return self.func(req, *args, **kwargs)
Aug 13 23:11:22.563635 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/microversion_parse/middleware.py", line 80, in __call__
Aug 13 23:11:22.563797 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     response = req.get_response(self.application)
Aug 13 23:11:22.563960 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/webob/request.py", line 1313, in send
Aug 13 23:11:22.564128 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     application, catch_exc_info=False)
Aug 13 23:11:22.564315 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/webob/request.py", line 1277, in call_application
Aug 13 23:11:22.564477 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     app_iter = application(self.environ, start_response)
Aug 13 23:11:22.564634 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/handler.py", line 209, in __call__
Aug 13 23:11:22.564795 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return dispatch(environ, start_response, self._map)
Aug 13 23:11:22.564979 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/handler.py", line 146, in dispatch
Aug 13 23:11:22.565142 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return handler(environ, start_response)
Aug 13 23:11:22.565303 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/webob/dec.py", line 129, in __call__
Aug 13 23:11:22.565467 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     resp = self.call_func(req, *args, **kw)
Aug 13 23:11:22.565640 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/wsgi_wrapper.py", line 29, in call_func
Aug 13 23:11:22.565814 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     super(PlacementWsgify, self).call_func(req, *args, **kwargs)
Aug 13 23:11:22.565977 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/webob/dec.py", line 193, in call_func
Aug 13 23:11:22.566139 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return self.func(req, *args, **kwargs)
Aug 13 23:11:22.566301 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/util.py", line 191, in decorated_function
Aug 13 23:11:22.566462 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return f(req)
Aug 13 23:11:22.566623 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/microversion.py", line 164, in decorated_func
Aug 13 23:11:22.566785 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return _find_method(f, version, status_code)(req, *args, **kwargs)
Aug 13 23:11:22.567030 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/handlers/aggregate.py", line 131, in set_aggregates
Aug 13 23:11:22.567196 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     increment_generation=consider_generation)
Aug 13 23:11:22.567367 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/handlers/aggregate.py", line 72, in _set_aggregates
Aug 13 23:11:22.567530 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     aggregate_uuids, increment_generation=increment_generation)
Aug 13 23:11:22.567692 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/objects/resource_provider.py", line 954, in set_aggregates
Aug 13 23:11:22.567854 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     increment_generation=increment_generation)
Aug 13 23:11:22.568017 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/oslo_db/sqlalchemy/enginefacade.py", line 993, in wrapper
Aug 13 23:11:22.568195 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return fn(*args, **kwargs)
Aug 13 23:11:22.568371 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/opt/stack/new/nova/nova/api/openstack/placement/objects/resource_provider.py", line 545, in _set_aggregates
Aug 13 23:11:22.568533 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     context.session.execute(insert_aggregates)
Aug 13 23:11:22.568696 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/orm/session.py", line 1176, in execute
Aug 13 23:11:22.568880 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     bind, close_with_result=True).execute(clause, params or {})
Aug 13 23:11:22.569045 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/engine/base.py", line 948, in execute
Aug 13 23:11:22.569207 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return meth(self, multiparams, params)
Aug 13 23:11:22.569383 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/sql/elements.py", line 269, in _execute_on_connection
Aug 13 23:11:22.569546 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     return connection._execute_clauseelement(self, multiparams, params)
Aug 13 23:11:22.569708 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/engine/base.py", line 1060, in _execute_clauseelement
Aug 13 23:11:22.569899 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     compiled_sql, distilled_params
Aug 13 23:11:22.570257 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/engine/base.py", line 1200, in _execute_context
Aug 13 23:11:22.570423 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     context)
Aug 13 23:11:22.570585 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/engine/base.py", line 1409, in _handle_dbapi_exception
Aug 13 23:11:22.570746 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     util.raise_from_cause(newraise, exc_info)
Aug 13 23:11:22.570927 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/util/compat.py", line 265, in raise_from_cause
Aug 13 23:11:22.571091 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     reraise(type(exception), exception, tb=exc_tb, cause=cause)
Aug 13 23:11:22.571252 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/engine/base.py", line 1193, in _execute_context
Aug 13 23:11:22.571415 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     context)
Aug 13 23:11:22.571586 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/sqlalchemy/engine/default.py", line 509, in do_execute
Aug 13 23:11:22.571749 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     cursor.execute(statement, parameters)
Aug 13 23:11:22.571911 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/cursors.py", line 170, in execute
Aug 13 23:11:22.572072 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     result = self._query(query)
Aug 13 23:11:22.572249 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/cursors.py", line 328, in _query
Aug 13 23:11:22.572412 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     conn.query(q)
Aug 13 23:11:22.572574 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/connections.py", line 516, in query
Aug 13 23:11:22.572735 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     self._affected_rows = self._read_query_result(unbuffered=unbuffered)
Aug 13 23:11:22.572906 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/connections.py", line 727, in _read_query_result
Aug 13 23:11:22.573068 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     result.read()
Aug 13 23:11:22.573232 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/connections.py", line 1066, in read
Aug 13 23:11:22.573407 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     first_packet = self.connection._read_packet()
Aug 13 23:11:22.573584 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/connections.py", line 683, in _read_packet
Aug 13 23:11:22.573747 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     packet.check_error()
Aug 13 23:11:22.573909 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/protocol.py", line 220, in check_error
Aug 13 23:11:22.574071 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     err.raise_mysql_exception(self._data)
Aug 13 23:11:22.574232 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap   File "/usr/local/lib/python2.7/dist-packages/pymysql/err.py", line 109, in raise_mysql_exception
Aug 13 23:11:22.574394 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap     raise errorclass(errno, errval)
Aug 13 23:11:22.574557 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap DBDeadlock: (pymysql.err.OperationalError) (1213, u'Deadlock found when trying to get lock; try restarting transaction') [SQL: u'INSERT INTO resource_provider_aggregates (resource_provider_id, aggregate_id, created_at) SELECT 93, placement_aggregates.id, %(created_at)s AS anon_1 \nFROM placement_aggregates \nWHERE placement_aggregates.uuid IN (%(uuid_1)s, %(uuid_2)s)'] [parameters: {u'uuid_2': u'66d98e7c-3c25-485d-a0dc-1cea651884de', 'created_at': datetime.datetime(2018, 8, 13, 23, 11, 22, 548464), u'uuid_1': u'14a5c8a3-5a99-4e8f-88be-00d85fcb1c17'}] (Background on this error at: http://sqlalche.me/e/e3q8)
Aug 13 23:11:22.575164 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: ERROR nova.api.openstack.placement.fault_wrap 
Aug 13 23:11:22.575328 ubuntu-xenial-inap-mtl01-0001304255 devstack@placement-api.service[628]: INFO nova.api.openstack.placement.requestlog [None req-216bb3ea-5d1e-40af-a7f6-d8f27dbfbc28 None None] 198.72.124.103 "PUT /placement/resource_providers/6417c672-91f8-42b7-8adc-2dbbf50e44de/aggregates" status: 500 len: 863 microversion: 1.29

Revision history for this message

Jay Pipes (jaypipes) wrote on 2018-08-14:

Second MySQL problem is different. We might be able to fix that one by converting the code to executing multiple single-row INSERT statements instead of a single INSERT SELECT.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2018-08-14: Fix proposed to nova (master)

Fix proposed to branch: master
Review: https://review.openstack.org/591609

Changed in nova:
assignee:	nobody → Jay Pipes (jaypipes)
status:	New → In Progress

OpenStack Infra (hudson-openstack) on 2018-08-14

Changed in nova:
assignee:	Jay Pipes (jaypipes) → Chris Dent (cdent)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2018-08-16:

Fix proposed to branch: master
Review: https://review.openstack.org/592654

Changed in nova:
assignee:	Chris Dent (cdent) → Jay Pipes (jaypipes)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2018-08-16: Change abandoned on nova (master)

Change abandoned by Jay Pipes (<email address hidden>) on branch: master
Review: https://review.openstack.org/591609
Reason: didn't work, clearly...

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2018-08-23: Fix merged to nova (master)

#10

Reviewed: https://review.openstack.org/592654
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=2d7ed309ec4e656ce9d6f21f03ea158278f2526d
Submitter: Zuul
Branch: master

commit 2d7ed309ec4e656ce9d6f21f03ea158278f2526d
Author: Jay Pipes <email address hidden>
Date: Thu Aug 16 14:56:47 2018 -0400

placement: use single-shot INSERT/DELETE agg

When replacing a provider's set of aggregate associations, we were
issuing a call to:

DELETE resource_provider_aggregates WHERE resource_provider_id = $rp

and then a single call to:

     INSERT INTO resource_provider_aggregates
     SELECT $rp, aggs.id
     FROM provider_aggregates AS aggs
     WHERE aggs.uuid IN ($agg_uuids)

    This patch changes the _set_aggregates() function in a few ways.
    First, we grab the aggregate's internal ID value when creating new
    aggregate records (or grabbing a provider's existing aggregate
    associations). This eliminates the need for any join to
    provider_aggregates in an INSERT/DELETE statement.

Second, instead of a multi-row INSERT .. SELECT statement, we do
single-shot INSERT ... VALUES statements, one for each added aggregate.

    Third, we no longer DELETE all aggregate associations for the provider
    in question. Instead, we issue single-shot DELETE statements for only
    the aggregates that are being disassociated.

    Finally, I've added a number of log debug statements so that we can have
    a little more information if this particular patch does not fix the
    deadlock issue described in the associated bug.

Change-Id: I87e765305017eae1424005f7d6f419f42a2f8370
Closes-bug: #1786703

Changed in nova:
status:	In Progress → Fix Released

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2018-08-30: Related fix merged to nova (master)

#11

Reviewed: https://review.openstack.org/597486
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=757983a4cfe3107ea6ffd0b416790ae23d91ef2e
Submitter: Zuul
Branch: master

commit 757983a4cfe3107ea6ffd0b416790ae23d91ef2e
Author: Chris Dent <email address hidden>
Date: Wed Aug 29 13:36:16 2018 +0100

[placement] Make _ensure_aggregate context not independent

    The use of the independent context on _ensure_aggregate appears to
    be unnecessary. It causes file-based uses of SQLite dbs to fail
    (with database locked errors, as reported in the associated bug,
    1789633) and thus may mask issues with other databases. Adding the
    independent context manager was the result of a series of "throw
    stuff at the wall and see what sticks" patches, but it looks now
    that it is not required, and in some situations causes problems.

Runs through the gate show that the behavior it was fixing (as
described in bug 1786703) is not happening.

    Change-Id: I1f325d55ec256db34a4c3bbd230dcd8a91bce542
    Related-Bug: #1786703
    Closes-Bug: #1789633

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2019-03-22: Fix included in openstack/nova 19.0.0.0rc1

#12

This issue was fixed in the openstack/nova 19.0.0.0rc1 release candidate.

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.