Ubuntu 16.04 testing with PMC 12GB Series 8 adapter with MaxView RAID Migration deletes the RAID array.

Bug #1566400 reported by bugproxy
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
The Ubuntu-power-systems project
Invalid
Critical
Unassigned
linux (Ubuntu)
Invalid
Critical
Canonical Kernel Team

Bug Description

==== State: Open by: nauahmed on 17 March 2016 16:09:48 ====

Hi,

While using MaxView Storage Manager on Ubuntu 16.04
from https://w3-connections.ibm.com/communities/service/html/communitystart?communityUuid=7bb2d98d-64aa-437d-863e-db67dc1f50db
https://w3-connections.ibm.com/wikis/home?lang=en-us#!/wiki/W6ee916ce65bf_4973_a6c3_a0f025d8acc6/page/PMC%20Adapter%20Series%207%20and%20Series%208%20Code%20Deliveries

PostGA_2.zip https://w3-connections.ibm.com/files/form/anonymous/api/library/c5f8d539-710e-4e26-9d2f-5073644c8c67/document/f9b22e38-4b5e-46ab-94ca-f5cb80b9e13d/media/PostGA_2.zip

with Expand/Migrate Logical Drive option in web GUI menu.
whole array is lost.

Steps:
1> INITIALIZE ALL disks # arcconf task start 1 device all INITIALIZE noprompt

Controllers found: 1
Initializing Channel 0, Device 9.
Initializing Channel 0, Device 12.
Initializing Channel 0, Device 13.
Initializing Channel 0, Device 14.
4 device(s) initialized.

Command completed successfully.

2> Login to Web gui of MSM..
clink on controller --> create logical device ---> custom mode -->RAID 1 Mirroring; 2 drives required; redundant ---> select minimum two drives ---> next ---> finish.

3> Click on new logical raid1 device, click expand/migrate logical device

RAID1
Select a RAID level and click 'Next' to continue
 Select raid Level
 RAID Members
 Set Attributes
 Summary
 RAID 1 Mirroring,2 drives required, redundant
 RAID 5 Striping with parity,3 or more drives required, redundant
 Simple Volume A single drive segment,not redundant

Select RAID5, add additional drive, next, Finish,

Existing RAID 1 logical device is lost.. there is no migration <-- ALL data lost...

root@p7pran04:~# uname -a
Linux p7pran04 4.4.0-12-generic #28-Ubuntu SMP Wed Mar 9 00:40:38 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux
root@p7pran04:~# arcconf getversion
Controllers found: 1
Controller #1
==============
Firmware : 7.8-0 (32968)
Staged Firmware : 7.8-0 (32968)
BIOS : 7.8-0 (32968)
Driver : 1.2-1 (41010)
Boot Flash : 7.8-1 (32968)
CPLD (Load version/ Flash version) : 5/ 10
SEEPROM (Load version/ Flash version) : 1/ 1
root@p7pran04:~# modinfo aacraid
filename: /lib/modules/4.4.0-12-generic/kernel/drivers/scsi/aacraid/aacraid.ko
version: 1.2-1[41010]-ms
license: GPL
description: Dell PERC2, 2/Si, 3/Si, 3/Di, Adaptec Advanced Raid Products, HP NetRAID-4M, IBM ServeRAID & ICP SCSI driver
author: Red Hat Inc and Adaptec
srcversion: FE59F41215F9D0437EA29CE
alias: pci:v00009005d0000028Fsv*sd*bc*sc*i*
alias: pci:v00009005d0000028Dsv*sd*bc*sc*i*
alias: pci:v00009005d0000028Csv*sd*bc*sc*i*
alias: pci:v00009005d0000028Bsv*sd*bc*sc*i*
alias: pci:v00009005d00000288sv*sd*bc*sc*i*
alias: pci:v00009005d00000286sv*sd*bc*sc*i*
alias: pci:v00009005d00000285sv*sd*bc*sc*i*
alias: pci:v00009005d00000285sv000017AAsd*bc*sc*i*
alias: pci:v00009005d00000285sv00001028sd*bc*sc*i*
alias: pci:v00001011d00000046sv0000103Csd000010C2bc*sc*i*
alias: pci:v00001011d00000046sv00009005sd00001364bc*sc*i*
alias: pci:v00001011d00000046sv00009005sd00000364bc*sc*i*
alias: pci:v00001011d00000046sv00009005sd00000365bc*sc*i*
alias: pci:v00009005d00000285sv00001028sd00000287bc*sc*i*
alias: pci:v00009005d00000286sv00009005sd000002A2bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd0000029Abc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000299bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000298bc*sc*i*
alias: pci:v00009005d00000286sv00001014sd00009540bc*sc*i*
alias: pci:v00009005d00000286sv00001014sd00009580bc*sc*i*
alias: pci:v00009005d00000285sv00001014sd00000312bc*sc*i*
alias: pci:v00009005d00000285sv00001014sd000002F2bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000297bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000296bc*sc*i*
alias: pci:v00009005d00000285sv0000103Csd00003227bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000294bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000293bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000292bc*sc*i*
alias: pci:v00009005d00000285sv00001028sd00000291bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000290bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd0000028Fbc*sc*i*
alias: pci:v00009005d00000285sv00009005sd0000028Ebc*sc*i*
alias: pci:v00009005d00000286sv00009005sd00000800bc*sc*i*
alias: pci:v00009005d00000200sv00009005sd00000200bc*sc*i*
alias: pci:v00009005d00000287sv00009005sd00000800bc*sc*i*
alias: pci:v00009005d00000286sv00009005sd000002A6bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd000002A5bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd000002A4bc*sc*i*
alias: pci:v00009005d00000286sv00009005sd000002A3bc*sc*i*
alias: pci:v00009005d00000286sv00009005sd000002A1bc*sc*i*
alias: pci:v00009005d00000286sv00009005sd000002A0bc*sc*i*
alias: pci:v00009005d00000286sv00009005sd0000029Fbc*sc*i*
alias: pci:v00009005d00000286sv00009005sd0000029Ebc*sc*i*
alias: pci:v00009005d00000286sv00009005sd0000029Dbc*sc*i*
alias: pci:v00009005d00000286sv00009005sd0000029Cbc*sc*i*
alias: pci:v00009005d00000286sv00009005sd0000029Bbc*sc*i*
alias: pci:v00009005d00000286sv00009005sd0000028Dbc*sc*i*
alias: pci:v00009005d00000286sv00009005sd0000028Cbc*sc*i*
alias: pci:v00009005d00000285sv00009005sd0000028Bbc*sc*i*
alias: pci:v00009005d00000285sv00009005sd0000028Abc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000289bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000288bc*sc*i*
alias: pci:v00009005d00000285sv000017AAsd00000287bc*sc*i*
alias: pci:v00009005d00000285sv000017AAsd00000286bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000287bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000285bc*sc*i*
alias: pci:v00009005d00000285sv00009005sd00000286bc*sc*i*
alias: pci:v00009005d00000284sv00009005sd00000284bc*sc*i*
alias: pci:v00009005d00000283sv00009005sd00000283bc*sc*i*
alias: pci:v00001028d0000000Asv00001028sd00000121bc*sc*i*
alias: pci:v00001028d0000000Asv00001028sd0000011Bbc*sc*i*
alias: pci:v00001028d0000000Asv00001028sd00000106bc*sc*i*
alias: pci:v00001028d00000002sv00001028sd000000D9bc*sc*i*
alias: pci:v00001028d00000002sv00001028sd000000D1bc*sc*i*
alias: pci:v00001028d00000004sv00001028sd000000D0bc*sc*i*
alias: pci:v00001028d00000003sv00001028sd00000003bc*sc*i*
alias: pci:v00001028d00000002sv00001028sd00000002bc*sc*i*
alias: pci:v00001028d00000001sv00001028sd00000001bc*sc*i*
depends:
intree: Y
vermagic: 4.4.0-12-generic SMP mod_unload modversions
parm: aac_sync_mode:Force sync. transfer mode 0=off, 1=on (int)
parm: aac_convert_sgl:Convert non-conformable s/g list 0=off, 1=on (int)
parm: nondasd:Control scanning of hba for nondasd devices. 0=off, 1=on (int)
parm: cache:Disable Queue Flush commands:
 bit 0 - Disable FUA in WRITE SCSI commands
 bit 1 - Disable SYNCHRONIZE_CACHE SCSI command
 bit 2 - Disable only if Battery is protecting Cache (int)
parm: dacmode:Control whether dma addressing is using 64 bit DAC. 0=off, 1=on (int)
parm: commit:Control whether a COMMIT_CONFIG is issued to the adapter for foreign arrays.
This is typically needed in systems that do not have a BIOS. 0=off, 1=on (int)
parm: msi:IRQ handling. 0=PIC(default), 1=MSI, 2=MSI-X) (int)
parm: startup_timeout:The duration of time in seconds to wait for adapter to have it's kernel up and
running. This is typically adjusted for large systems that do not have a BIOS. (int)
parm: aif_timeout:The duration of time in seconds to wait for applications to pick up AIFs before
deregistering them. This is typically adjusted for heavily burdened systems. (int)
parm: numacb:Request a limit to the number of adapter control blocks (FIB) allocated. Valid values are 512 and down. Default is to use suggestion from Firmware. (int)
parm: acbsize:Request a specific adapter control block (FIB) size. Valid values are 512, 2048, 4096 and 8192. Default is to use suggestion from Firmware. (int)
parm: update_interval:Interval in seconds between time sync updates issued to adapter. (int)
parm: check_interval:Interval in seconds between adapter health checks. (int)
parm: check_reset:If adapter fails health check, reset the adapter. a value of -1 forces the reset to adapters programmed to ignore it. (int)
parm: expose_physicals:Expose physical components of the arrays. -1=protect 0=off, 1=on (int)
parm: reset_devices:Force an adapter reset at initialization. (int)
parm: wwn:Select a WWN type for the arrays:
 0 - Disable
 1 - Array Meta Data Signature (default)
 2 - Adapter Serial Number (int)

With regards,
Naveed

==== State: Open by: nauahmed on 18 March 2016 00:17:41 ====

Hi Noah,

There is no out box driver built for Ubuntu 16.04 yet..
Also we were asked to test inbox Ubuntu 16.04 driver.
Hence except the driver reset everything is outbox.

Yes, this issue is RAID 1 to RAID 5 Migration failure.. not only failure, the existing RAID1 gets deleted by this action, which should not have happened at any cause.

With regards,
Naveed

----- Forwarded by Naveed Us Ahmed/India/IBM on 03/30/2016 11:10 AM -----
> From: Douglas Miller1/Rochester/IBM
> To: Naveed Us Ahmed/India/IBM
> Date: 03/29/2016 07:46 AM
> Subject: Re: dkms driver
>
> It would appear that Microsemi has not updated their out-of-box
> driver for kernel 4.4.
>
> I think we've done our due diligence, and it's time to open a JIRA
> to Microsemi. Let them provide us with what is necessary to diagnose further.

== Comment: #8 - Brian J. King - 2016-04-05 09:40:18 ==
Once mirrored to Launchpad, please open to PMC-Sierra / Microsemi.

bugproxy (bugproxy)
tags: added: architecture-ppc64le bugnameltc-139532 severity-high targetmilestone-inin1604
Changed in ubuntu:
assignee: nobody → Taco Screen team (taco-screen-team)
Kevin W. Rudd (kevinr)
affects: ubuntu → linux (Ubuntu)
Revision history for this message
bugproxy (bugproxy) wrote : Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2016-04-14 08:49 EDT-------
(In reply to comment #11)
> Hi Naveed,
>
> We need to know if this same test ever succeeded, and under what conditions.
> Was this test tried more than once on Ubuntu 16.04? In other words, is it a
> transient failure or does it fail every time? And if it was run on other
> platforms/OS version, where did it succeed?
>
> Thanks,

Hi Dough,

We tested this today on 14.04.3 and RAID migration fails here as well (RAID1 drive was deleted when we tried to migrate to RAID 5 by adding another disk)..
for 14.04.3 we tested with outbox driver
# modinfo aacraid
filename: /lib/modules/3.19.0-43-generic/weak-updates/aacraid/aacraid.ko
version: 1.2-1.41059

Hence this issue was present in 14.04.3 as well.

let me know if any logs with respect to 14.04.3 is needed..
Also any updates from JIRA re-open?

With regards,
Naveed

tags: added: severity-critical
removed: severity-high
Revision history for this message
bugproxy (bugproxy) wrote :

------- Comment From <email address hidden> 2016-04-14 10:21 EDT-------

Revision history for this message
bugproxy (bugproxy) wrote : Bug reproductions snaps and logs

------- Comment (attachment only) From <email address hidden> 2016-05-06 14:18 EDT-------

Revision history for this message
bugproxy (bugproxy) wrote : Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2016-05-06 14:26 EDT-------
cde00 (<email address hidden>) added native attachment /tmp/AIXOS05657240/Bug139532_reproduction.tar on 2016-05-06 13:23:50

Revision history for this message
Michael Hohnbaum (hohnbaum) wrote :

Canonical does not have access to this hardware setup. Please provide proposed patches to address this issue.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
bugproxy (bugproxy) wrote : Bug reproductions snaps and logs
Revision history for this message
bugproxy (bugproxy) wrote : Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2017-02-24 15:20 EDT-------

Revision history for this message
bugproxy (bugproxy) wrote : Bug reproductions snaps and logs
Manoj Iyer (manjo)
Changed in ubuntu-power-systems:
status: New → Incomplete
Manoj Iyer (manjo)
tags: added: ubuntu-16.04
Manoj Iyer (manjo)
Changed in linux (Ubuntu):
importance: Undecided → Critical
Changed in ubuntu-power-systems:
importance: Undecided → Critical
Changed in linux (Ubuntu):
assignee: Taco Screen team (taco-screen-team) → Canonical Kernel Team (canonical-kernel-team)
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Are there patches available yet to address this issue?

tags: added: kernel-da-key
Changed in linux (Ubuntu):
status: Incomplete → Triaged
Revision history for this message
Douglas Miller (dougmill-ibm) wrote :

The corresponding bug for this issue has been closed as unreproducible. Unclear why that was not communicated here. This launchpad can also be closed.

Revision history for this message
bugproxy (bugproxy) wrote : Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2017-07-20 10:20 EDT-------
Sorry for the confusion, something must not be mirroring status properly. This bug has been CLOSED UNREPRODUCIBLE.

Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

This bug has been marked invalid per comment #11.

Changed in linux (Ubuntu):
status: Triaged → Invalid
bugproxy (bugproxy)
tags: removed: bugnameltc-139532 kernel-da-key severity-critical ubuntu-16.04
Frank Heimes (fheimes)
Changed in ubuntu-power-systems:
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.