new controller machines stay in "adding-vote" forever
Bug #1720251 reported by
Jason Hobbs
This bug affects 2 people
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Fix Released
|
High
|
Tim Penhey |
Bug Description
juju version: 2.2.4
maas version: 2.2.3-6114-
After running enable-ha, my two new controller machines have been in 'adding-vote' state for over an hour. Their machine logs have this:
2017-09-28 21:59:47 ERROR juju.worker.
Here is the status:
http://
I've attached a crashdump from the deployment.
This doesn't happen everytime - we've had quite a few runs where it didn't happen and this is the first where it did.
The longest I've let it stay in this state is 8 hours - it seems like it will be stuck forever.
tags: | added: intermittent-failure |
tags: | added: cpe-onsite |
Changed in juju: | |
milestone: | 2.3.1 → none |
summary: |
- new controller machines stay in "adding-vote" for over an hour + new controller machines stay in "adding-vote" forever |
description: | updated |
Changed in juju: | |
milestone: | none → 2.3.2 |
Changed in juju: | |
assignee: | nobody → John A Meinel (jameinel) |
Changed in juju: | |
status: | Triaged → In Progress |
Changed in juju: | |
assignee: | John A Meinel (jameinel) → Tim Penhey (thumper) |
Changed in juju: | |
status: | In Progress → Fix Committed |
Changed in juju: | |
status: | Fix Committed → Fix Released |
To post a comment you must log in.
Related to mongo on these additional controller machines, I am seeing this in 1/baremetal/ log/syslog (for example, machine 1):
Sep 28 20:45:56 juju-2 mongod. 37017[11659] : [initandlisten] MongoDB starting : pid=11659 port=37017 dbpath= /var/lib/ juju/db 64-bit host=juju-2 37017[11659] : [initandlisten] db version v3.2.12 37017[11659] : [initandlisten] git version: ef3e1bc78e997f0 d9f22f45aeb1d8e 3b6ac14a14 37017[11659] : [initandlisten] OpenSSL version: OpenSSL 1.0.2g 1 Mar 2016 37017[11659] : [initandlisten] allocator: tcmalloc 37017[11659] : [initandlisten] modules: none 37017[11659] : [initandlisten] build environment: 37017[11659] : [initandlisten] distarch: x86_64 37017[11659] : [initandlisten] target_arch: x86_64 37017[11659] : [initandlisten] options: { net: { ipv6: true, port: 37017, ssl: { PEMKeyFile: "/var/lib/ juju/server. pem", PEMKeyPassword: "<password>", mode: "requireSSL" } }, replication: { oplogSizeMB: 1024, replSet: "juju" }, security: { authorization: "enabled", keyFile: "/var/lib/ juju/shared- secret" }, storage: { dbPath: "/var/lib/juju/db", engine: "wiredTiger", journal: { enabled: true }, wiredTiger: { engineConfig: { cacheSizeGB: 1 } } }, systemLog: { destination: "syslog", quiet: true } } 37017[11659] : [initandlisten] wiredtiger_open config: create, cache_size= 1G,session_ max=20000, eviction= (threads_ max=4), config_ base=false, statistics= (fast), log=(enabled= true,archive= true,path= journal, compressor= snappy) ,file_manager= (close_ idle_time= 100000) ,checkpoint= (wait=60, log_size= 2GB),statistics _log=(wait= 0), 37017[11659] : [initandlisten] ** WARNING: You are running this process as the root user, which is not recommended. 37017[11659] : [initandlisten] 37017[11659] : [initandlisten] 37017[11659] : [initandlisten] ** WARNING: soft rlimits too low. rlimits set to 20000 processes, 65000 files. Number of processes should be at least 32500 : 0.5 times number of files. 37017[11659] : [initandlisten] Did not find local voted for document at startup. 37017[11659] : [initandlisten] Did not find local replica set configuration document at startup; NoMatchingDocument: Did not find replica set configuration document in local.system. replset 37017[11659] : [HostnameCanoni calizationWorke r] Starting hostname canonicalization worker 37017[11659] : [initandlisten] Initializing full-time diagnostic data capture with directory '/var/lib/ juju/db/ diagnostic. data' 37017[11659] : [initandlisten] waiting for connections on port 37017 ssl 37017[11659] : [conn1] note: no users configured in admin.system.users, allowing localhost access
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:45:56 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:10 juju-2 mongod.
Sep 28 20:46:11 juju-2 mongod.