disk replacement on srv137-bud.infra.mirantis.net

Bug #1620968 reported by Alexander Gubanov
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Released
Medium
Andrey Nikitin

Bug Description

Hi devops!

I'd like to replace both disks (details below) on server srv137-bud.infra.mirantis.net
because it fails.

Proof:

root@srv137-bud:~# smartctl -a -d sat /dev/sda | egrep '^(Serial|Device M| 5|196|197|User|# 1])|[1-9]0%'
Device Model: Samsung SSD 850 EVO 500GB
Serial Number: S2RBNXAH116140N
User Capacity: 500,107,862,016 bytes [500 GB]
  5 Reallocated_Sector_Ct 0x0033 099 099 010 Pre-fail Always - 3
# 1 Extended offline Completed: read failure 90% 4941 221437880

root@srv137-bud:~# dd if=/dev/sda of=/dev/null skip=221437880 count=1
dd: error reading ‘/dev/sda’: Input/output error
0+0 records in
0+0 records out
0 bytes (0 B) copied, 5.63734 s, 0.0 kB/s

root@srv137-bud:~# smartctl -a -d sat /dev/sdb | egrep '^(Serial|Device M| 5|196|197|User|# 1])|[1-9]0%'
Device Model: Samsung SSD 850 EVO 500GB
Serial Number: S2RBNXAH116158A
User Capacity: 500,107,862,016 bytes [500 GB]
  5 Reallocated_Sector_Ct 0x0033 099 099 010 Pre-fail Always - 2
# 1 Extended offline Completed: read failure 90% 4941 361953656

root@srv137-bud:~# dd if=/dev/sdb of=/dev/null skip=361953656 count=1
dd: error reading ‘/dev/sdb’: Input/output error
0+0 records in
0+0 records out
0 bytes (0 B) copied, 0.262065 s, 0.0 kB/s

As we have software raid0 for / partition after disk replacement we need to reinstall OS - Ubuntu 14.04.4 x86_64.
Currently server is removed from jenkins pool and ready for maintenance. I confirm data loss.

Tags: area-devops
Andrey Nikitin (heos)
tags: added: area-devops
removed: devops
Changed in fuel:
status: New → Triaged
Revision history for this message
Andrey Nikitin (heos) wrote :

I've created the following ticket to replace the disk: https://mirantis.jira.com/browse/IT-31063.

Changed in fuel:
assignee: Fuel DevOps (fuel-devops) → Andrey Nikitin (heos)
Revision history for this message
Andrey Nikitin (heos) wrote :

The both disks were replaced on the server. The server was redeployed too. Please check and confirm.

Changed in fuel:
status: Triaged → In Progress
status: In Progress → Fix Committed
Changed in fuel:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.