juju-db fails to start -- WiredTiger reports Input/output error
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Invalid
|
Critical
|
Alexis Bruemmer | ||
Ubuntu on IBM z Systems |
Invalid
|
Critical
|
Unassigned |
Bug Description
$ juju --version
2.0-rc3-
$ lxd --version
2.0.4
Controller bootstrapped into clean LXD/local environment 3 days ago. Multiple models were created and deleted on the first day, and then the system sat idle over the weekend.
Coming in today, most "juju X" commands simply hang with no output. Debug switch shows that wss is failing:
$ juju switch default --debug
13:13:00 INFO juju.cmd supercommand.go:63 running juju [2.0-rc3 gc go1.6.2]
13:13:00 DEBUG juju.cmd supercommand.go:64 args: []string{"juju", "switch", "default", "--debug"}
13:13:00 INFO juju.juju api.go:72 connecting to API addresses: [10.113.
13:13:00 INFO juju.api apiclient.go:507 dialing "wss://
13:13:02 INFO juju.api apiclient.go:507 dialing "wss://
^C
Attempting to restart jujud manually are successful, but I noticed mongodb connection errors in the log:
2016-10-10 17:18:03 WARNING juju.mongo open.go:134 mongodb connection failed, will retry: dial tcp 127.0.0.1:37017: getsockopt: connection refused
Checking the logs for juju-db, it looks bad:
https:/
Further issues found in dmesg:
https:/
I've attached the full mongodb.log.
Changed in juju: | |
status: | New → Triaged |
importance: | Undecided → High |
milestone: | none → 2.0.1 |
Changed in juju: | |
status: | Triaged → Incomplete |
milestone: | 2.0.1 → none |
importance: | High → Undecided |
Changed in ubuntu-z-systems: | |
status: | New → Incomplete |
Changed in juju: | |
status: | Incomplete → Triaged |
importance: | Undecided → Critical |
milestone: | none → 2.0.1 |
Changed in ubuntu-z-systems: | |
status: | Incomplete → Triaged |
Changed in juju: | |
assignee: | nobody → Alexis Bruemmer (alexis-bruemmer) |
Changed in ubuntu-z-systems: | |
importance: | Undecided → Critical |
Changed in juju: | |
milestone: | 2.0.1 → none |
Changed in juju: | |
milestone: | none → 2.1.0 |
Changed in juju: | |
milestone: | 2.1.0 → none |
Changed in ubuntu-z-systems: | |
status: | Triaged → Invalid |
Hi, I ported the s390x support from master back to the 3.2 branch for juju-mongodb so it's possible I've missed something, I'm not really a deep expert on mongodb or s390x. But my patches pass mongodb's own tests which are reasonably comprehensive. But I don't really understand what it going on here...
The first error is this:
Oct 08 14:42:57 juju-84a348-0 mongod. 37017[17241] : [thread1] WiredTiger (52) [1475937777: 172958] [17241: 0x3ff977ff910] , file:collection -27-37850583920 42379666. wt, WT_SESSION. checkpoint: /var/lib/ juju/db/ collection- 27-378505839204 2379666. wt: handle-write: pwrite: failed to write 8192 bytes at offset 1044480: Invalid exchange
"Invalid exchange" means EBADE and grepping the kernel suggests you are using dasd storage? Is it possible your disk has gone bad or something (although I guess that sort of thing is less likely on big iron). Is there anything in dmesg or so on?