Comment 1 for bug 1896631

Revision history for this message
Steven Webster (swebster-wr) wrote :

Note:

This can affect pods on an AIO system using SR-IOV interfaces on an N3000 FPGA device. There is a race between the kubernetes processes coming up after the controller manifest is applied and the application of the worker manifest. The interface in the pod will be seen to 'disappear' after the FPGA device is reset in the worker manifest. It does not get plugged back in unless the pod is restarted. The fix for this would be quite extensive, requiring the creation of a new AIO, or separate kubernetes manifest to coordinate the bring-up of k8s services and the worker configuration. To mitigate this in the meantime, we could probably plug into the recently introduced pod recovery mechanism.