Disaggregation calculator stuck after completing hazard curve calculation
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenQuake Engine |
Expired
|
Critical
|
Unassigned |
Bug Description
When running a disaggregation calculation using the US model for only 2 sites, the calculation seems not to progress after completing the hazard curve calculation.
I attach the log file. The calculation started on 2013-08-27 09:44:05, the hazard curve calculation completed ~ 30 min later ([2013-08-27 10:13:27,397 hazard #15 gemcontrol.ethz.ch PROGRESS MainProcess/26692 root] ** > Hazard curve computation complete), but after one day the calculation did not finish and remains at:
[2013-08-27 10:13:27,397 hazard #15 gemcontrol.ethz.ch PROGRESS MainProcess/26692 root] ** > Starting disaggregation.
My job is still running:
│ dmonelli 26659 openquake --rh=job_disagg.ini │
│ --exports=xml --log-file=
│ --config-
│ dmonelli 26692 openquake --rh=job_disagg.ini │
│ --exports=xml --log-file=
│ --config-
│ dmonelli 26693 openquake --rh=job_disagg.ini │
│ --exports=xml --log-file=
│ --config-
and from the cluster status I see only one active task:
Host: openquake.gemsun01 │
│ Status: Online │
│ Worker processes: 32 │
│ Active tasks: 0 │
│ ========== │
│ Host: openquake.
│ Status: Online │
│ Worker processes: 48 │
│ Active tasks: 0 │
│ ========== │
│ Host: openquake.gemsun03 │
│ Status: Online │
│ Worker processes: 32 │
│ Active tasks: 0 │
│ ========== │
│ │
│ Total workers: 224 │
│ Active tasks: 1 │
│ Cluster utilization: 0.45%
Changed in oq-engine: | |
status: | New → Triaged |
Changed in oq-engine: | |
status: | Triaged → Incomplete |
It would be nice to try this computation with the new disaggregation