Cluster disconnecting periodically - [ClusterClient,client] Failed to refresh power state
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Fix Released
|
Critical
|
Gavin Panella |
Bug Description
We're seeing issues with cluster disconnecting periodically..
From regiond.log:
ubuntu@
2015-07-16 22:03:46 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:07:14 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:07:16 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:07:18 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:07:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
.. .. repeats multiple times
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
2015-07-16 22:19:20 [maasserver] ERROR: Unable to get RPC connection for cluster 'OIL Cluster' (037c960b-
From clusterd.log:
2015-07-16 22:03:27+0000 [-] donphan: Power could not be turned off; timed out.
2015-07-16 22:03:45+0000 [ClusterClient,
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:27+0000 [-] donphan: Power could not be turned off; timed out.
2015-07-16 22:03:45+0000 [ClusterClient,
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:45+0000 [ClusterClient,
Traceback (most recent call last):
Failure: twisted.
Traceback (most recent call last):
Failure: twisted.
2015-07-16 22:03:53+0000 [TFTP (UDP)] Datagram received from ('10.245.0.152', 1024): <RRQDatagram(
Changed in maas: | |
assignee: | nobody → Gavin Panella (allenap) |
importance: | Undecided → Critical |
Changed in maas: | |
status: | New → Triaged |
milestone: | none → 1.9.0 |
no longer affects: | maas/1.9 |
Changed in maas: | |
status: | New → Triaged |
Changed in maas: | |
status: | Fix Committed → Fix Released |
We've recreated this issue this morning.