NetApp: available snapshot physically not created
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Shared File Systems Service (Manila) |
Fix Released
|
Low
|
Maurice Escher |
Bug Description
Hi,
I have got snapshots in available state in my manila installation, where the snapshot is physically not existing on the NetApp backend according to the provider location.
So from frontend/manila perspective we have a working snapshot, but once we try to use it we notice it is broken or rather my customer reported that a share-revert-
I found some more examples: 18 missing snapshots out of a total of 1542, so about 1 percent.
I'm still a bit puzzled how it could happen, most of the occurrences where on systems with very high load.
I think the NetApp driver should double check after sending snapshot-create that the snapshot has been properly created.
I will try to work on this improvement myself.
BR,
Maurice
Changed in manila: | |
importance: | Undecided → Low |
assignee: | nobody → Maurice Escher (maurice-escher) |
milestone: | none → zed-2 |
Changed in manila: | |
milestone: | zed-2 → zed-3 |
Changed in manila: | |
milestone: | zed-3 → zed-rc1 |
Changed in manila: | |
milestone: | zed-rc1 → antelope-1 |
Changed in manila: | |
milestone: | antelope-1 → antelope-2 |
Changed in manila: | |
status: | In Progress → Fix Committed |
Changed in manila: | |
status: | Fix Committed → Fix Released |
Maybe to add some more about my investigations:
I'm running xena with DHSS=true
I know that the snapshot transitioned into available state seconds after it has been created (so it was no reset-state or any later change in the database). And it has not been involved in a replication scenario.
Some older (2 years old, 5 out of the mentioned 18 from above) snapshots did not even have a provider_location set.