Detailed bug description:
MongoDB can't be started due to old pid file.
Steps to reproduce:
Deploy latest MOS 9.2 env
Reboot all 3 controllers/mongoDB nodes
Expected results:
MongoDB is started, ceilometer is connected to Mongo
Actual result:
2017-04-12T09:32:35.606011+02:00 cic-2 mongod.27017[27919]: [initandlisten] User Assertion: 12596:old lock file
2017-04-12T09:32:35.606113+02:00 cic-2 mongod.27017[27919]: [initandlisten] exception in initAndListen: 12596 old lock file, terminating
Reproducibility:
Very often
Workaround:
remove old pid file
Description of the environment:
- Operation system: Ubuntu
- Versions of components: MOS 9.2
Additional information:
2017-04-12T09:32:35.605246+02:00 cic-2 mongod.27017[27919]: [DataFileSync] BackgroundJob starting: DataFileSync
2017-04-12T09:32:35.605301+02:00 cic-2 mongod.27017[27919]: shardObjTest passed
2017-04-12T09:32:35.605356+02:00 cic-2 mongod.27017[27919]: isInRangeTest passed
2017-04-12T09:32:35.605501+02:00 cic-2 mongod.27017[27919]: shardKeyTest passed
2017-04-12T09:32:35.605551+02:00 cic-2 mongod.27017[27919]: [initandlisten] MongoDB starting : pid=27919 port=27017 dbpath=/var/lib/mongo/mongodb 64-bit host=cic-2.domain.tld
2017-04-12T09:32:35.605629+02:00 cic-2 mongod.27017[27919]: [initandlisten] db version v2.6.10
2017-04-12T09:32:35.605657+02:00 cic-2 mongod.27017[27919]: [initandlisten] git version: nogitversion
2017-04-12T09:32:35.605682+02:00 cic-2 mongod.27017[27919]: [initandlisten] OpenSSL version: OpenSSL 1.0.1f 6 Jan 2014
2017-04-12T09:32:35.605706+02:00 cic-2 mongod.27017[27919]: [initandlisten] build info: Linux 1852ca316e35 3.13.0-55-generic #94-Ubuntu SMP Thu Jun 18 00:27:10 UTC 2015 x86_64 BOOST_LIB_VERSION=1_54
2017-04-12T09:32:35.605729+02:00 cic-2 mongod.27017[27919]: [initandlisten] allocator: tcmalloc
2017-04-12T09:32:35.605760+02:00 cic-2 mongod.27017[27919]: [initandlisten] options: { config: "/etc/mongodb.conf", net: { bindIp: "127.0.0.1,192.168.2.27", port: 27017 }, operationProfiling: { mode: "slowOp" }, replication: { replSetName: "ceilometer" }, security: { authorization: "enabled", keyFile: "/etc/mongodb.key" }, storage: { dbPath: "/var/lib/mongo/mongodb", directoryPerDB: true, journal: { enabled: true } }, systemLog: { destination: "syslog", quiet: true, verbosity: 1 } }
2017-04-12T09:32:35.606011+02:00 cic-2 mongod.27017[27919]: [initandlisten] User Assertion: 12596:old lock file
2017-04-12T09:32:35.606113+02:00 cic-2 mongod.27017[27919]: [initandlisten] exception in initAndListen: 12596 old lock file, terminating
2017-04-12T09:32:35.606141+02:00 cic-2 mongod.27017[27919]: [initandlisten] dbexit:
2017-04-12T09:32:35.606164+02:00 cic-2 mongod.27017[27919]: [initandlisten] shutdown: going to close listening sockets...
2017-04-12T09:32:35.606187+02:00 cic-2 mongod.27017[27919]: [initandlisten] shutdown: going to flush diaglog...
2017-04-12T09:32:35.606209+02:00 cic-2 mongod.27017[27919]: [initandlisten] shutdown: going to close sockets...
2017-04-12T09:32:35.606264+02:00 cic-2 mongod.27017[27919]: [initandlisten] shutdown: waiting for fs preallocator...
2017-04-12T09:32:35.606297+02:00 cic-2 mongod.27017[27919]: [initandlisten] shutdown: lock for final commit...
2017-04-12T09:32:35.606343+02:00 cic-2 mongod.27017[27919]: [initandlisten] shutdown: final commit...
2017-04-12T09:32:35.606384+02:00 cic-2 mongod.27017[27919]: [initandlisten] shutdown: closing all files...
2017-04-12T09:32:35.606414+02:00 cic-2 mongod.27017[27919]: [initandlisten] closeAllFiles() finished
2017-04-12T09:32:35.606447+02:00 cic-2 mongod.27017[27919]: [initandlisten] dbexit: really exiting now
Impact: 12T18:15: 11.478877+ 02:00 cic-1 mongod.27017[7475]: [rsHealthPoll] couldn't connect to 192.168.2.27:27017: couldn't connect to server 192.168.2.27:27017 (192.168.2.27) failed, connection attempt failed 12T18:15: 11.487226+ 02:00 cic-1 mongod.27017[7475]: [rsHealthPoll] Failed to connect to 192.168.2.27:27017, reason: errno:111 Connection refused 12T18:15: 11.487542+ 02:00 cic-1 mongod.27017[7475]: [rsHealthPoll] Failed to connect to 192.168.2.27:27017, reason: errno:111 Connection refused 12T18:15: 11.487563+ 02:00 cic-1 mongod.27017[7475]: [rsHealthPoll] couldn't connect to 192.168.2.27:27017: couldn't connect to server 192.168.2.27:27017 (192.168.2.27) failed, connection attempt failed
Two other nodes have the following errors:
2017-04-
2017-04-
2017-04-
2017-04-
Ceiometer-collector can't connect to primary node:
2017-04- 12T09:14: 26.096726+ 02:00 cic-1 ceilometer- collector[ 8529]: 2017-04-12 09:14:26.093 8529 ERROR oslo.messaging. _drivers. impl_rabbit [-] AMQP server on 192.168.2.26:5673 is unreachable: [Errno 111] Connection refused. Trying again in 1 seconds. 12T09:14: 27.116837+ 02:00 cic-1 ceilometer- collector[ 8529]: 2017-04-12 09:14:27.116 8529 INFO oslo.messaging. _drivers. impl_rabbit [-] Reconnected to AMQP server on 192.168.2.25:5673 via [amqp] client 12T09:14: 42.687743+ 02:00 cic-1 ceilometer- collector[ 8518]: 2017-04-12 09:14:42.687 8518 WARNING oslo_config.cfg [-] Option "metering_secret" from group "publisher" is deprecated. Use option "telemetry_secret" from group "publisher". 12T09:14: 42.690329+ 02:00 cic-1 ceilometer- collector[ 8517]: 2017-04-12 09:14:42.689 8517 WARNING oslo_config.cfg [-] Option "metering_secret" from group "publisher" is deprecated. Use option "telemetry_secret" from group "publisher".
2017-04-
2017-04-
2017-04-