OpenStack Object Storage (swift)

foreground auditor skips partitions

Bug #2054723 reported by clayg on 2024-02-22

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	OpenStack Object Storage (swift)	New	Undecided	Unassigned

Bug Description

SRE noticed that when they try to run the foreground auditor targeting a specific disk (because they want to drain it and reconstructor is hitting io/disk errors) that it won't DO anything.

at the begining of a cycle the auditor workers write down a status file per-disk with all the parts on that disk:

$ cat /srv/node4/sdb4/objects/auditor_status_ALL.json
{"partitions": ["42", "24", "46", "54", "11", "13", "59", "36", "14", "15", "12", "2", "49", "10", "37", "0", "58", "22", "26", "38", "45", "33", "28", "16", "52", "63", "20", "25", "8", "30", "55", "19", "48", "17", "6", "51"]}

every 60s the worker will rewrite the file with the list of partitions it still has left to audit

At the end of the cycle it deletes the file.

Each cycle (or on process startup) WAY down in diskfile the audit location generator will look for a pre-existing status file (left over from a restarted auditor) and ONLY yield out those parts to audit.

If you run a fg-auditor while a bg-auditor is mid-cycle it won't "go back" and audit the parts that had already been processed by the bg-auditor.

If it's possible that the bg-auditor was "hung" for some reason the stale auditor-status file will be cleaned up at the end of the fg-auditor cycle and on the next invocation it should audit all locations again.

Currently the auditor only lets you target a specific device; we could perhaps add support to target specific parts in which case the status file should probably be ignored and definately should be removed at the end of the run.

the plubming into the audit_location_generator to "skip" the status file might be a little tricky... if we "just" added a "–partitions" option (e.g. https://github.com/NVIDIA/swift/blob/master/bin/swift-object-reconstructor#L28-L31) maybe it would look like:

partitions = partitions or get_auidtor_status()

https://github.com/NVIDIA/swift/blob/master/swift/obj/diskfile.py#L590

As a work around SRE can delete the auditor status file before begining a fg auditor run.

An alternative "fix" might at least do the listdir and LOG that some partitions are being skipped; noting that the auditor status file will be removed when it finishes and you can re-run again to audit all partitions.

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.