Activity log for bug #1869947

Date Who What changed Old value New value Message
2020-03-31 19:33:26 Jeff Lane  bug added bug
2020-03-31 19:33:38 Jeff Lane  linux (Ubuntu): status New In Progress
2020-03-31 19:33:42 Jeff Lane  linux (Ubuntu): importance Undecided High
2020-03-31 19:33:44 Jeff Lane  linux (Ubuntu): assignee Jeff Lane (bladernr)
2020-03-31 19:43:42 Jeff Lane  description [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel, and also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit in Focal. Broadcom has asked that we revert this commit from the Focal tree and are working on a patch to revert this commit upstream. The reason is that this patch has been determined to be the root cause of a bug identified by the upstream provider [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references BugLink: https://bugs.launchpad.net/bugs/1860125 [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ] In nvme-fc: it's possible to have connected active controllers and as no references are taken on the LLDD, the LLDD can be unloaded. The controller would enter a reconnect state and as long as the LLDD resumed within the reconnect timeout, the controller would resume. But if a namespace on the controller is the root device, allowing the driver to unload can be problematic. To reload the driver, it may require new io to the boot device, and as it's no longer connected we get into a catch-22 that eventually fails, and the system locks up. Fix this issue by taking a module reference for every connected controller (which is what the core layer did to the transport module). Reference is cleared when the controller is removed. Acked-by: Himanshu Madhani <hmadhani@marvell.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: James Smart <jsmart2021@gmail.com> Signed-off-by: Keith Busch <kbusch@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org> Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels. [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel, and also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit in Focal. [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references     BugLink: https://bugs.launchpad.net/bugs/1860125     [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ]     In nvme-fc: it's possible to have connected active controllers     and as no references are taken on the LLDD, the LLDD can be     unloaded. The controller would enter a reconnect state and as     long as the LLDD resumed within the reconnect timeout, the     controller would resume. But if a namespace on the controller     is the root device, allowing the driver to unload can be problematic.     To reload the driver, it may require new io to the boot device,     and as it's no longer connected we get into a catch-22 that     eventually fails, and the system locks up.     Fix this issue by taking a module reference for every connected     controller (which is what the core layer did to the transport     module). Reference is cleared when the controller is removed.     Acked-by: Himanshu Madhani <hmadhani@marvell.com>     Reviewed-by: Christoph Hellwig <hch@lst.de>     Signed-off-by: James Smart <jsmart2021@gmail.com>     Signed-off-by: Keith Busch <kbusch@kernel.org>     Signed-off-by: Sasha Levin <sashal@kernel.org>     Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels.
2020-03-31 19:48:00 Jeff Lane  description [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel, and also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit in Focal. [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references     BugLink: https://bugs.launchpad.net/bugs/1860125     [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ]     In nvme-fc: it's possible to have connected active controllers     and as no references are taken on the LLDD, the LLDD can be     unloaded. The controller would enter a reconnect state and as     long as the LLDD resumed within the reconnect timeout, the     controller would resume. But if a namespace on the controller     is the root device, allowing the driver to unload can be problematic.     To reload the driver, it may require new io to the boot device,     and as it's no longer connected we get into a catch-22 that     eventually fails, and the system locks up.     Fix this issue by taking a module reference for every connected     controller (which is what the core layer did to the transport     module). Reference is cleared when the controller is removed.     Acked-by: Himanshu Madhani <hmadhani@marvell.com>     Reviewed-by: Christoph Hellwig <hch@lst.de>     Signed-off-by: James Smart <jsmart2021@gmail.com>     Signed-off-by: Keith Busch <kbusch@kernel.org>     Signed-off-by: Sasha Levin <sashal@kernel.org>     Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels. [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel, and also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit in Focal. [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references     BugLink: https://bugs.launchpad.net/bugs/1860125     [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ]     In nvme-fc: it's possible to have connected active controllers     and as no references are taken on the LLDD, the LLDD can be     unloaded. The controller would enter a reconnect state and as     long as the LLDD resumed within the reconnect timeout, the     controller would resume. But if a namespace on the controller     is the root device, allowing the driver to unload can be problematic.     To reload the driver, it may require new io to the boot device,     and as it's no longer connected we get into a catch-22 that     eventually fails, and the system locks up.     Fix this issue by taking a module reference for every connected     controller (which is what the core layer did to the transport     module). Reference is cleared when the controller is removed.     Acked-by: Himanshu Madhani <hmadhani@marvell.com>     Reviewed-by: Christoph Hellwig <hch@lst.de>     Signed-off-by: James Smart <jsmart2021@gmail.com>     Signed-off-by: Keith Busch <kbusch@kernel.org>     Signed-off-by: Sasha Levin <sashal@kernel.org>     Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> An updated tree with the commit reverted can be found here: https://code.launchpad.net/~bladernr/ubuntu/+source/linux/+git/focal/+ref/1869947-revert-lpfc-patch [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels.
2020-04-01 15:17:49 Jeff Lane  summary revert patch that introduces bug that prevents lpfc driver unloading nvme_fc patch prevents unloading of lpfc module in both BFS and non-BFS scenarios
2020-04-01 15:18:29 Jeff Lane  description [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel, and also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit in Focal. [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references     BugLink: https://bugs.launchpad.net/bugs/1860125     [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ]     In nvme-fc: it's possible to have connected active controllers     and as no references are taken on the LLDD, the LLDD can be     unloaded. The controller would enter a reconnect state and as     long as the LLDD resumed within the reconnect timeout, the     controller would resume. But if a namespace on the controller     is the root device, allowing the driver to unload can be problematic.     To reload the driver, it may require new io to the boot device,     and as it's no longer connected we get into a catch-22 that     eventually fails, and the system locks up.     Fix this issue by taking a module reference for every connected     controller (which is what the core layer did to the transport     module). Reference is cleared when the controller is removed.     Acked-by: Himanshu Madhani <hmadhani@marvell.com>     Reviewed-by: Christoph Hellwig <hch@lst.de>     Signed-off-by: James Smart <jsmart2021@gmail.com>     Signed-off-by: Keith Busch <kbusch@kernel.org>     Signed-off-by: Sasha Levin <sashal@kernel.org>     Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> An updated tree with the commit reverted can be found here: https://code.launchpad.net/~bladernr/ubuntu/+source/linux/+git/focal/+ref/1869947-revert-lpfc-patch [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels. [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel, and also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit in Focal. [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references     BugLink: https://bugs.launchpad.net/bugs/1860125     [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ]     In nvme-fc: it's possible to have connected active controllers     and as no references are taken on the LLDD, the LLDD can be     unloaded. The controller would enter a reconnect state and as     long as the LLDD resumed within the reconnect timeout, the     controller would resume. But if a namespace on the controller     is the root device, allowing the driver to unload can be problematic.     To reload the driver, it may require new io to the boot device,     and as it's no longer connected we get into a catch-22 that     eventually fails, and the system locks up.     Fix this issue by taking a module reference for every connected     controller (which is what the core layer did to the transport     module). Reference is cleared when the controller is removed.     Acked-by: Himanshu Madhani <hmadhani@marvell.com>     Reviewed-by: Christoph Hellwig <hch@lst.de>     Signed-off-by: James Smart <jsmart2021@gmail.com>     Signed-off-by: Keith Busch <kbusch@kernel.org>     Signed-off-by: Sasha Levin <sashal@kernel.org>     Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> An updated tree with the commit reverted can be found here: https://code.launchpad.net/~bladernr/ubuntu/+source/linux/+git/focal/+ref/1869947-revert-lpfc-patch [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels.
2020-04-01 15:20:11 Jeff Lane  summary nvme_fc patch prevents unloading of lpfc module in both BFS and non-BFS scenarios nvme_fc patch prevents unloading lpfs module in both BFS and non-BFS scenarios
2020-04-01 15:21:47 Jeff Lane  summary nvme_fc patch prevents unloading lpfs module in both BFS and non-BFS scenarios Revert "nvme_fc: add module to ops template to allow module references"
2020-04-01 15:29:03 Jeff Lane  description [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel, and also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit in Focal. [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references     BugLink: https://bugs.launchpad.net/bugs/1860125     [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ]     In nvme-fc: it's possible to have connected active controllers     and as no references are taken on the LLDD, the LLDD can be     unloaded. The controller would enter a reconnect state and as     long as the LLDD resumed within the reconnect timeout, the     controller would resume. But if a namespace on the controller     is the root device, allowing the driver to unload can be problematic.     To reload the driver, it may require new io to the boot device,     and as it's no longer connected we get into a catch-22 that     eventually fails, and the system locks up.     Fix this issue by taking a module reference for every connected     controller (which is what the core layer did to the transport     module). Reference is cleared when the controller is removed.     Acked-by: Himanshu Madhani <hmadhani@marvell.com>     Reviewed-by: Christoph Hellwig <hch@lst.de>     Signed-off-by: James Smart <jsmart2021@gmail.com>     Signed-off-by: Keith Busch <kbusch@kernel.org>     Signed-off-by: Sasha Levin <sashal@kernel.org>     Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> An updated tree with the commit reverted can be found here: https://code.launchpad.net/~bladernr/ubuntu/+source/linux/+git/focal/+ref/1869947-revert-lpfc-patch [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels. [IMPACT] This upstream commit: 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f nvme_fc: add module to ops template to allow module references was pulled into Focal as part of this bug: https://bugs.launchpad.net/bugs/1860125 and this commit into Focal: a7441301b20dc8a7772972a092968bb353b7a642 This patch was added to prevent the unloading of lpfc driver when doing NVMe BFS booting. Broadcom has identified that this patch introduces a bug that prevents unloading of lpfc driver in the non-BFS case as well. Because of this, they have asked that we revert this patch and remove it from our kernel. Additionally, Broadcome have also said that they are working on a patch upstream to revert this as well. As it may not land upstream in time to meet kernel freeze for Focal, I'm providing a patch to revert this commit directly. [FIXES] To resolve, revert the following commit ID in the Focal Tree: a7441301b20dc8a7772972a092968bb353b7a642 nvme_fc: add module to ops template to allow module references     BugLink: https://bugs.launchpad.net/bugs/1860125     [ Upstream commit 863fbae929c7a5b64e96b8a3ffb34a29eefb9f8f ]     In nvme-fc: it's possible to have connected active controllers     and as no references are taken on the LLDD, the LLDD can be     unloaded. The controller would enter a reconnect state and as     long as the LLDD resumed within the reconnect timeout, the     controller would resume. But if a namespace on the controller     is the root device, allowing the driver to unload can be problematic.     To reload the driver, it may require new io to the boot device,     and as it's no longer connected we get into a catch-22 that     eventually fails, and the system locks up.     Fix this issue by taking a module reference for every connected     controller (which is what the core layer did to the transport     module). Reference is cleared when the controller is removed.     Acked-by: Himanshu Madhani <hmadhani@marvell.com>     Reviewed-by: Christoph Hellwig <hch@lst.de>     Signed-off-by: James Smart <jsmart2021@gmail.com>     Signed-off-by: Keith Busch <kbusch@kernel.org>     Signed-off-by: Sasha Levin <sashal@kernel.org>     Signed-off-by: Paolo Pisati <paolo.pisati@canonical.com> An updated tree with the commit reverted can be found here: https://code.launchpad.net/~bladernr/ubuntu/+source/linux/+git/focal/+ref/1869947-revert-lpfc-patch [REGRESSION RISK] Regression risk is low as this is a new patch introduced upstream and synced into a development release. The patch only exists in our tree from Ubuntu-5.4-5.4.0-13.16 onwards and does not affect older kernels.
2020-04-02 21:01:52 Seth Forshee linux (Ubuntu): status In Progress Fix Committed
2020-04-16 11:31:32 Launchpad Janitor linux (Ubuntu): status Fix Committed Fix Released
2020-04-16 11:31:32 Launchpad Janitor cve linked 2020-11494