Bug #984055 “bug in the classical PSHA calculator” : Bugs : OpenQuake (deprecated)

Revision history for this message

Vitor Silva (vitor-silva) wrote on 2012-04-17:

#1

classical_PSHA_risk_based.zip Edit (141.6 KiB, application/zip)

Vitor Silva (vitor-silva) on 2012-04-17

description:

updated

John Tarter (toh2) on 2012-04-18

Changed in openquake:
milestone:	none → 0.7.0
importance:	Undecided → Critical

Muharem Hrnjadovic (al-maisan) on 2012-04-18

tags:

added: defect enduser-visible risk

Revision history for this message

Andrea Cerisara (acerisara) wrote on 2012-04-19:

#2

From what I understood looking at the data, the problem could arise when we have region borders that are not an exact multiple of the grid spacing. For example, having a region like this,

region = shapes.Region.from_coordinates([(1.0, 2.0), (2.0, 2.0), (2.0, 1.0), (1.0, 1.0)])
grid_spacing = 0.5

and iterating over it, we generate these sites (that are supposed to be the center of the cell):

[Site(1.0, 1.0), Site(1.5, 1.0), Site(2.0, 1.0), Site(1.0, 1.5), Site(1.5, 1.5), Site(2.0, 1.5), Site(1.0, 2.0), Site(1.5, 2.0), Site(2.0, 2.0)]

and these points:

Point(row=0, column=0)
Point(row=0, column=1)
Point(row=0, column=2)
Point(row=1, column=0)
Point(row=1, column=1)
Point(row=1, column=2)
Point(row=2, column=0)
Point(row=2, column=1)
Point(row=2, column=2)

In this case whatever site I use inside the region the component is able to correctly translate to the grid point, for example:

point = region.grid.point_at(shapes.Site(1.9, 2.0))
site = region.grid.site_at(point)

`point` is Point(row=2, column=2) and `site` is Site(2.0, 2.0) (the center of the cell)

Let's now take a similar region:

region = shapes.Region.from_coordinates([(1.0, 2.0), (2.3, 2.0), (2.3, 1.0), (1.0, 1.0)])
grid_spacing = 0.5

In this case the region borders are not an exact multiple of the grid spacing. We always have the same sites:

[Site(1.0, 1.0), Site(1.5, 1.0), Site(2.0, 1.0), Site(1.0, 1.5), Site(1.5, 1.5), Site(2.0, 1.5), Site(1.0, 2.0), Site(1.5, 2.0), Site(2.0, 2.0)]

and I think it's correct because the next generated points/sites would be outside the region (Site(2.5, 2.0), for example). We also have the same points:

Point(row=0, column=0)
Point(row=0, column=1)
Point(row=0, column=2)
Point(row=1, column=0)
Point(row=1, column=1)
Point(row=1, column=2)
Point(row=2, column=0)
Point(row=2, column=1)
Point(row=2, column=2)

Here is the problem. If I take, for example, Site(2.25, 2.0) that is inside the region (note: risk sites are taken from the exposure file that is defined by the user, so every site inside the region is valid, also really closed to the borders) and ask for the point, I get:

point = region.grid.point_at(shapes.Site(2.25, 2.0))
site = region.grid.site_at(point)

`point` is Point(row=2, column=3) and `site` is Site(2.5, 2.0) (the center of the cell)

Site(2.5, 2.0) is not in the set of the sites computed by the region. Since we compute the hazard by iterating over these sites this means we can't find the pre-computed hazard input for the risk site we are interested in.

A solution I see to address this problem is to add an additional row and column in the grid we generate, to be sure we handle correctly sites really closed the region border.