Can't download images in Maas 3.5.0 in HA mode with 3 nodes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Fix Committed
|
High
|
Alexsander de Souza | ||
3.5 |
Fix Committed
|
High
|
Alexsander de Souza |
Bug Description
In Maas 3.5.0 in HA mode with 3 nodes the image sync is broken.
How to reproduce: setup Maas with 3 nodes and try to download some images.
From my reproducer:
By looking at the sync-boot-resource workflow some child workflows like `bootresource-
when this happens the top level `sync-boot-
```
{
"type": "workflowExecut
"failure": {
"message": "Child Workflow execution terminated",
"cause": {
"message": "Terminated",
"
},
"childWorkf
"namespace": "default",
"
"runId": "3908106b-
},
"
"name": "download-
},
"
"
"retryState": "RETRY_
}
},
"retryState": "RETRY_
"workflowTask
}
```
in the run id `3908106b-
```
[
"terminated by new runID: 6cd76c58-
]
```
and in the run id `6cd76c58-
```
{
"type": "workflowExecut
"reason": "by parent close policy",
"identity": "temporal-
}
```
in the run id `3908106b-
```
[
"terminated by new runID: 6cd76c58-
]
```
and in the run id `6cd76c58-
```
{
"type": "workflowExecut
"reason": "by parent close policy",
"identity": "temporal-
}
```
in the run id `3908106b-
```
[
"terminated by new runID: 6cd76c58-
]
```
it's like the child workflow is started twice and since the first one is cancelled then the entire workflow is terminated
The two child workflows have the same parent but they are in 2 different task queues (the 2 regions that need to download the images from the region that started the sync-boot-resource s).
This is because the 2 child workflows are sharing the same workflow id, but it must be unique within the temporal namespace unless specified differently using a Reuse Policy https:/
Related branches
- Anton Troyanov: Approve
-
Diff: 13 lines (+1/-1)1 file modifiedsrc/maasserver/workflow/bootresource.py (+1/-1)
- Jacopo Rota: Approve
- MAAS Lander: Approve
-
Diff: 13 lines (+1/-1)1 file modifiedsrc/maasserver/workflow/bootresource.py (+1/-1)
Changed in maas: | |
status: | Triaged → In Progress |
assignee: | nobody → Alexsander de Souza (alexsander-souza) |
Changed in maas: | |
milestone: | 3.5.0 → 3.6.0 |
Changed in maas: | |
status: | In Progress → Fix Committed |