failing metrics cause 500 error
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Fix Released
|
High
|
Alexsander de Souza | ||
3.3 |
Fix Released
|
High
|
Alexsander de Souza |
Bug Description
MAAS 3.2.6
Long running MAAS with tens of deployments daily, today every juju deploy and and also cleanup would result in error containing `Exception: 'i' format requires -2147483648 <= number <= 2147483647`
Here is the first occurrence
2022-11-10 06:21:27 maasserver: [error] #######
2022-11-10 06:21:27 maasserver: [error] Traceback (most recent call last):
File "/snap/
raise exc from exc.__cause__
File "/snap/
response = django_
File "/snap/
response = self._middlewar
File "/snap/
self.
File "/snap/
self.
File "/snap/
metric = metric.
File "/snap/
self.
File "/snap/
super(
File "/snap/
self.
File "/snap/
self.
File "/snap/
self.__reset()
File "/snap/
self._value = self._file.
File "/snap/
self.
File "/snap/
_pack_
File "/snap/
data[pos:pos + 4] = _pack_integer_
struct.error: 'i' format requires -2147483648 <= number <= 2147483647
2022-11-10 06:21:27 regiond: [info] 127.0.0.1 POST /MAAS/api/
After discussion with MAAS team it was determined that metrics DB is probably corrupted and service restart should help.
I will report if restart helped but this failure should be handled more gracefully and not break MAAS.
Related branches
- Alexsander de Souza: Approve
- MAAS Lander: Approve
-
Diff: 37 lines (+12/-3)1 file modifiedsrc/maasserver/prometheus/middleware.py (+12/-3)
- Christian Grabowski: Approve
- MAAS Lander: Approve
-
Diff: 37 lines (+12/-3)1 file modifiedsrc/maasserver/prometheus/middleware.py (+12/-3)
Changed in maas: | |
importance: | Undecided → High |
status: | New → Triaged |
milestone: | none → 3.4.0 |
Changed in maas: | |
status: | Triaged → In Progress |
Changed in maas: | |
status: | In Progress → New |
Changed in maas: | |
status: | New → In Progress |
Changed in maas: | |
status: | In Progress → Fix Committed |
Changed in maas: | |
milestone: | 3.4.0 → 3.4.0-beta3 |
Changed in maas: | |
status: | Fix Committed → Fix Released |
Just an update - problem is gone after a restart