DC: helm operation failure during merging helm overrides

Bug #1903090 reported by Yuxing
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Won't Fix
Medium
Frank Miller

Bug Description

Brief Description
-----------------
During the deployment of a large scale of DC, the armada application apply will be failed during merging helm overrides

Severity
--------
Minor

Steps to Reproduce
------------------
Write down the steps to reproduce the issue

Expected Behavior
------------------
The application should be uploaded and applied successfully

Actual Behavior
----------------
The application is validated successfully by Armada and uploaded, the application failed before the Armada apply cmd

Reproducibility
---------------
It is intermittent, state the frequency of occurrence

System Configuration
--------------------
Distributed cloud with multiple nodes

Branch/Pull Time/Commit
-----------------------
11/02/2020

Last Pass
---------
na

Timestamp/Logs
--------------
In sysinv.log seeing error message:
sysinv.conductor.kube_app HelmTillerFailure: Helm operation failure: Failed to render helm chart: Helm operation failure: install.go:159: [debug] Original chart version: ""

This exception Should be produced by install_helm_chart_with_dry_run()

Test Activity
-------------
Developer test

Workaround
----------
Apply it again with cmd: system application-apply <application name>

Yuxing (yuxing)
Changed in starlingx:
assignee: nobody → Yuxing (yuxing)
Yuxing (yuxing)
summary: - Helm operation failure during mergying helm overrides
+ DC: helm operation failure during merging helm overrides
Revision history for this message
Ghada Khalil (gkhalil) wrote :

stx.5.0 / medium priority - related to DC scaling feature

tags: added: stx.distcloud
Changed in starlingx:
importance: Undecided → Medium
status: New → Confirmed
status: Confirmed → Triaged
tags: added: stx.5.0
Revision history for this message
Yuxing (yuxing) wrote :

Add the sysinv.log to analyze this issue

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to config (master)

Fix proposed to branch: master
Review: https://review.opendev.org/762909

Changed in starlingx:
status: Triaged → In Progress
Revision history for this message
Yuxing (yuxing) wrote :

As the error message is unclear to indicate whether it is a tiller connection(most probably as it will succeed in the 2nd time apply) error or an error in the chart, improve the error handling to offer more info from the helm error message.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on config (master)

Change abandoned by Yuxing Jiang (<email address hidden>) on branch: master
Review: https://review.opendev.org/762909
Reason: The buffering shouldn't impact the incomplete err message

Revision history for this message
Bart Wensley (bartwensley) wrote :

This is an issue in the helm/tiller code (not a DC issue). I am assigning this to Frank so the containers team can prioritize/investigate.

tags: added: stx.containers
removed: stx.distcloud
Changed in starlingx:
assignee: Yuxing (yuxing) → Frank Miller (sensfan22)
Revision history for this message
Frank Miller (sensfan22) wrote :

This issue is not seen recently and was only reported the one time. If the frequency of this issue increases then please open a new LP with a recent load.

Changed in starlingx:
status: In Progress → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.