Comment 4 for bug 1891889

Revision history for this message
Pratik M. (pvmpublic) wrote : Re: kube-sriovdp container in crash loop

OK, I re-installed R4.0 and followed all the commands for OpenStack-only. Which just means that I left out
the two "system host-memory-modify ${NODE} 0|1 -1G 32" above.

Have not done unlock but still see the container is stuck in ContainerCreating.

It seems to be looking for a /etc/pcidp/config.json on host. Maybe a race condition in the setup (unlock will create it)?

Or is it a documentation issue?

The installation guide said:
# system interface-datanetwork-assign ${NODE} ${DATA0IFUUID} ${PHYSNET0}

But do I need to rather follow this:
https://wiki.openstack.org/wiki/StarlingX/Networking
# system host-if-modify -m 1500 -n sriov -c pci-sriov -N 5 ${COMPUTE} ${DATA0IFUUID}

Or
# system host-if-modify -m 1500 -n sriov1 -d datanet1 -c pci-sriov -N 4 --vf-driver=vfio controller-0 ens2f0

Name: kube-sriov-device-plugin-amd64-vpc49
Namespace: kube-system
Priority: 0
Node: controller-0/192.168.206.2
Start Time: Tue, 18 Aug 2020 18:09:40 +0530
Labels: app=sriovdp
              controller-revision-hash=6cfb4bff7b
              pod-template-generation=1
              tier=node
Annotations: <none>
Status: Pending
IP: 192.168.206.2
IPs:
  IP: 192.168.206.2
Controlled By: DaemonSet/kube-sriov-device-plugin-amd64
Containers:
  kube-sriovdp:
    Container ID:
    Image: registry.local:9001/docker.io/starlingx/k8s-plugins-sriov-network-device:stx.4.0-v3.2-16-g4e0302ae
    Image ID:
    Port: <none>
    Host Port: <none>
    Args:
      --log-dir=sriovdp
      --log-level=10
    State: Waiting
      Reason: ContainerCreating
    Ready: False
    Restart Count: 0
    Environment: <none>
    Mounts:
      /etc/pcidp/config.json from config (ro)
      /var/lib/kubelet/ from devicesock (rw)
      /var/log from log (rw)
      /var/run/secrets/kubernetes.io/serviceaccount from sriov-device-plugin-token-nblkw (ro)
Conditions:
  Type Status
  Initialized True
  Ready False
  ContainersReady False
  PodScheduled True
Volumes:
  devicesock:
    Type: HostPath (bare host directory volume)
    Path: /var/lib/kubelet/
    HostPathType:
  log:
    Type: HostPath (bare host directory volume)
    Path: /var/log
    HostPathType:
  config:
    Type: HostPath (bare host directory volume)
    Path: /etc/pcidp/config.json
    HostPathType: File
  sriov-device-plugin-token-nblkw:
    Type: Secret (a volume populated by a Secret)
    SecretName: sriov-device-plugin-token-nblkw
    Optional: false
QoS Class: BestEffort
Node-Selectors: beta.kubernetes.io/arch=amd64
                 sriovdp=enabled
Tolerations: :NoSchedule
                 node.kubernetes.io/disk-pressure:NoSchedule
                 node.kubernetes.io/memory-pressure:NoSchedule
                 node.kubernetes.io/network-unavailable:NoSchedule
                 node.kubernetes.io/not-ready:NoExecute
                 node.kubernetes.io/pid-pressure:NoSchedule
                 node.kubernetes.io/unreachable:NoExecute
                 node.kubernetes.io/unschedulable:NoSchedule
Events:
  Type Reason Age From Message
  ---- ------ ---- ---- -------
  Normal Scheduled 77m default-scheduler Successfully assigned kube-system/kube-sriov-device-plugin-amd64-vpc49 to controller-0
  Warning FailedMount 54m (x5 over 72m) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=[config], unattached volumes=[devicesock log config sriov-device-plugin-token-nblkw]: timed out waiting for the condition
  Warning FailedMount 36m (x2 over 57m) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=[config], unattached volumes=[config sriov-device-plugin-token-nblkw devicesock log]: timed out waiting for the condition
  Warning FailedMount 32m (x7 over 75m) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=[config], unattached volumes=[log config sriov-device-plugin-token-nblkw devicesock]: timed out waiting for the condition
  Warning FailedMount 7m11s (x8 over 70m) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=[config], unattached volumes=[sriov-device-plugin-token-nblkw devicesock log config]: timed out waiting for the condition
  Warning FailedMount 114s (x45 over 77m) kubelet, controller-0 MountVolume.SetUp failed for volume "config" : hostPath type check failed: /etc/pcidp/config.json is not a file