container-sync apparently stuck on certain objects with status 408 or 409
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Object Storage (swift) |
Fix Released
|
Undecided
|
Eran Rom |
Bug Description
hi,
we've trying container-sync again with swift 1.13 and running into some issues
with (afaict) some objects.
There are two swift clusters, eqiad being the primary and codfw being the
secondary so we'd like to setup container sync eqiad -> codfw for ~5k out of
~40k containers in eqiad.
Setting up container sync seems to work except that on some containers/objects
it seemingly gets stuck, e.g.
eqiad# swift stat wikipedia-
Account: AUTH_mw
Container: wikipedia-
Objects: 8435
Bytes: 385823776
Read ACL: mw:media,.r:*
Write ACL: mw:media
Sync To: //mw_media/
Sync Key: REDACTED
Accept-Ranges: bytes
X-Timestamp: 1381945180.34197
X-Trans-Id: tx307f2b3824b24
Content-Type: text/plain; charset=utf-8
codfw# swift stat wikipedia-
Account: AUTH_mw
Container: wikipedia-
Objects: 1291
Bytes: 58424838
Read ACL:
Write ACL:
Sync To:
Sync Key: REDACTED
Accept-Ranges: bytes
X-Timestamp: 1414080235.37348
X-Trans-Id: txe6677cd472114
Content-Type: text/plain; charset=utf-8
we have observed from the logs object-server in codfw that for certain objects
(e.g. f/fc/Esistono_
object-server replies with 408 (or 409) to PUT and proxy-server relays that
back and eventually container-sync processes all get stuck on that failing
object.
the attached log below shows when this first happens (i.e. the initial
container-sync)
I've also attached a object/container info for an object that doesn't work and
one that works at the bottom.
I'd be happy to provide more details if needed!
thanks,
filippo
object details