archive policy still in use (random failure)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Gnocchi |
Fix Released
|
High
|
gordon chung |
Bug Description
for some unknown reason, gabbilive archive policy says still in use sometimes (seen it on file and ceph drivers). all resources with metrics using policy should have been deleted... and pifpaf says it's expunging deleted metrics every 1s so wtf!?! it could mean:
1) gnocchi isn't expunging (can't verify because pifpaf lack of logging)
2) metric is not marked for deletion
2017-04-11 18:17:12.866864 | gabbi.suitemake
2017-04-11 18:17:12.866912 | -------
2017-04-11 18:17:12.866927 |
2017-04-11 18:17:12.866945 | Captured traceback:
2017-04-11 18:17:12.866964 | ~~~~~~~~~~~~~~~~~~~
2017-04-11 18:17:12.866986 | Traceback (most recent call last):
2017-04-11 18:17:12.867043 | File "/home/
2017-04-11 18:17:12.867062 | func(self)
2017-04-11 18:17:12.867133 | File "/home/
2017-04-11 18:17:12.867157 | self._run_test()
2017-04-11 18:17:12.867215 | File "/home/
2017-04-11 18:17:12.867252 | raise failure
2017-04-11 18:17:12.867308 | AssertionError: '400' not found in ['204'], response:
2017-04-11 18:17:12.867337 | 400 Bad Request
2017-04-11 18:17:12.867352 |
2017-04-11 18:17:12.867392 | The server could not comply with the request since it is either malformed or otherwise incorrect.
2017-04-11 18:17:12.867407 |
2017-04-11 18:17:12.867432 | Archive policy gabbilive is still in use
2017-04-11 18:17:12.867446 |
2017-04-11 18:17:12.867458 |
2017-04-11 18:17:12.867490 | gabbi.suitemake
2017-04-11 18:17:12.867523 | -------
2017-04-11 18:17:12.867535 |
2017-04-11 18:17:12.867552 | Captured traceback:
2017-04-11 18:17:12.867570 | ~~~~~~~~~~~~~~~~~~~
2017-04-11 18:17:12.867592 | Traceback (most recent call last):
2017-04-11 18:17:12.867647 | File "/home/
2017-04-11 18:17:12.867665 | func(self)
2017-04-11 18:17:12.867722 | File "/home/
2017-04-11 18:17:12.867741 | self._run_test()
2017-04-11 18:17:12.867797 | File "/home/
2017-04-11 18:17:12.867818 | self._assert_
2017-04-11 18:17:12.867876 | File "/home/
2017-04-11 18:17:12.867909 | self._test_
2017-04-11 18:17:12.867966 | File "/home/
2017-04-11 18:17:12.867995 | self.assert_
2017-04-11 18:17:12.868055 | File "/home/
2017-04-11 18:17:12.868075 | self.fail(msg)
2017-04-11 18:17:12.868130 | File "/home/
2017-04-11 18:17:12.868153 | raise self.failureExc
2017-04-11 18:17:12.868181 | AssertionError: '200' not found in ['404'], response:
2017-04-11 18:17:12.868195 | {
2017-04-11 18:17:12.868214 | "name": "gabbilive",
2017-04-11 18:17:12.868234 | "aggregation_
2017-04-11 18:17:12.868251 | "mean",
2017-04-11 18:17:12.868268 | "max",
2017-04-11 18:17:12.868301 | "min"
2017-04-11 18:17:12.868329 | ],
2017-04-11 18:17:12.868351 | "definition": [
2017-04-11 18:17:12.868366 | {
2017-04-11 18:17:12.868387 | "timespan": "0:01:00",
2017-04-11 18:17:12.868409 | "granularity": "0:00:01",
2017-04-11 18:17:12.868427 | "points": 60
2017-04-11 18:17:12.868442 | },
2017-04-11 18:17:12.868457 | {
2017-04-11 18:17:12.868478 | "timespan": "0:01:00",
2017-04-11 18:17:12.868499 | "granularity": "0:00:02",
2017-04-11 18:17:12.868517 | "points": 30
2017-04-11 18:17:12.868532 | },
2017-04-11 18:17:12.868547 | {
2017-04-11 18:17:12.868567 | "timespan": "0:05:00",
2017-04-11 18:17:12.868589 | "granularity": "0:01:00",
2017-04-11 18:17:12.868618 | "points": 5
2017-04-11 18:17:12.868634 | }
2017-04-11 18:17:12.868649 | ],
2017-04-11 18:17:12.868667 | "back_window": 0
2017-04-11 18:17:12.868680 | }
Changed in gnocchi: | |
importance: | Undecided → High |
status: | New → Triaged |
status: | Triaged → New |
importance: | High → Medium |
importance: | Medium → Undecided |
Changed in gnocchi: | |
status: | New → Triaged |
summary: |
- archive policy still in use (random failture) + archive policy still in use (random failure) |
Changed in gnocchi: | |
importance: | Undecided → High |
Changed in gnocchi: | |
assignee: | nobody → gordon chung (chungg) |
status: | Triaged → In Progress |
something is wrong with cleanup process but i can't tell because of pifpaf. seems deleted metric is not cleaned up so it can't delete archivepolicy. patch here: https:/ /review. openstack. org/#/c/ 456413/ 3
state after 'delete archive policy' attempt
2017-04-21 17:14:08.958924 | #### get all metrics to verify #### localhost: 8041/v1/ metric localhost: 8041/v1/ metric? status= delete by_user_ id": "admin", by_project_ id": "", 2f71-42a0- 8021-74e680745a a4", methods" : [
2017-04-21 17:14:08.964021 | > GET http://
2017-04-21 17:14:08.966243 | > x-auth-token:
2017-04-21 17:14:08.989412 | > authorization: basic YWRtaW46
2017-04-21 17:14:08.995254 | > user-agent: gabbi/1.33.0 (Python urllib3)
2017-04-21 17:14:08.995377 |
2017-04-21 17:14:08.996602 | < 200 OK
2017-04-21 17:14:09.001726 | < Date: Fri, 21 Apr 2017 17:14:08 GMT
2017-04-21 17:14:09.007103 | < Server: WSGIServer/0.2 CPython/3.5.2
2017-04-21 17:14:09.009870 | < Content-Length: 2
2017-04-21 17:14:09.028967 | < Content-Type: application/json
2017-04-21 17:14:09.029094 |
2017-04-21 17:14:09.029363 | []
2017-04-21 17:14:09.029479 |
2017-04-21 17:14:09.054252 | #### get all deleted metrics to verify ####
2017-04-21 17:14:09.064531 | > GET http://
2017-04-21 17:14:09.066197 | > x-auth-token:
2017-04-21 17:14:09.069386 | > authorization: basic YWRtaW46
2017-04-21 17:14:09.073763 | > user-agent: gabbi/1.33.0 (Python urllib3)
2017-04-21 17:14:09.073883 |
2017-04-21 17:14:09.074737 | < 200 OK
2017-04-21 17:14:09.078506 | < Date: Fri, 21 Apr 2017 17:14:08 GMT
2017-04-21 17:14:09.082348 | < Server: WSGIServer/0.2 CPython/3.5.2
2017-04-21 17:14:09.084569 | < Content-Length: 496
2017-04-21 17:14:09.087811 | < Content-Type: application/json
2017-04-21 17:14:09.087931 |
2017-04-21 17:14:09.088123 | [
2017-04-21 17:14:09.088533 | {
2017-04-21 17:14:09.091966 | "created_
2017-04-21 17:14:09.094340 | "creator": "admin",
2017-04-21 17:14:09.096846 | "resource_id": null,
2017-04-21 17:14:09.098885 | "name": "vcpus",
2017-04-21 17:14:09.102184 | "created_
2017-04-21 17:14:09.104037 | "unit": null,
2017-04-21 17:14:09.115264 | "id": "e9f190a7-
2017-04-21 17:14:09.121779 | "archive_policy": {
2017-04-21 17:14:09.129021 | "name": "gabbilive",
2017-04-21 17:14:09.134894 | "definition": [
2017-04-21 17:14:09.137582 | {
2017-04-21 17:14:09.147791 | "timespan": "0:01:00",
2017-04-21 17:14:09.150210 | "points": 60,
2017-04-21 17:14:09.153689 | "granularity": "0:00:01"
2017-04-21 17:14:09.154780 | },
2017-04-21 17:14:09.155769 | {
2017-04-21 17:14:09.159013 | "timespan": "0:01:00",
2017-04-21 17:14:09.161415 | "points": 30,
2017-04-21 17:14:09.164898 | "granularity": "0:00:02"
2017-04-21 17:14:09.165949 | },
2017-04-21 17:14:09.166924 | {
2017-04-21 17:14:09.175654 | "timespan": "0:05:00",
2017-04-21 17:14:09.178018 | "points": 5,
2017-04-21 17:14:09.181525 | "granularity": "0:01:00"
2017-04-21 17:14:09.182506 | }
2017-04-21 17:14:09.183385 | ],
2017-04-21 17:14:09.186458 | "aggregation_
2017-04-21 17:14:09.188172 | "max",
2017-0...