Reconstruction error

Bug #1489587 reported by paul luse
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Object Storage (swift)
Fix Released
Medium
Unassigned

Bug Description

(192.168.91.43):

Aug 27 12:45:29 localhost object-server: ERROR __call__ error with PUT /d24/12/AUTH_BMPA_Intel/ssbench_64MB_000075_ec1014-12/large_324011 :
#012Traceback (most recent call last):#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/server.py", line 938, in __call__#012
res = method(req)#012 File "/usr/local/lib/python2.7/dist-packages/swift/common/utils.py", line 2668, in wrapped#012 return func(*a, **k
w)#012 File "/usr/local/lib/python2.7/dist-packages/swift/common/utils.py", line 1208, in _timing_stats#012 resp = func(ctrl, *args, **k
wargs)#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/server.py", line 617, in PUT#012 for chunk in iter(timeout_reader, '')
:#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/server.py", line 397, in timeout_reader#012 return file_like.read(self.netw
ork_chunk_size)#012 File "/usr/local/lib/python2.7/dist-packages/swift/common/utils.py", line 494, in read#012 chunk = next(self.iterato
r)#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/ssync_receiver.py", line 346, in subreq_iter#012 min(left, self.app.networ
k_chunk_size))#012 File "/usr/lib/python2.7/dist-packages/eventlet/wsgi.py", line 188, in read#012 return self._chunked_read(self.rfile,
 length)#012 File "/usr/lib/python2.7/dist-packages/eventlet/wsgi.py", line 178, in _chunked_read#012 self.chunk_length = int(rfile.read
line().split(b";", 1)[0], 16)#012ValueError: invalid literal for int() with base 16: ''

Aug 27 12:45:29 localhost object-server: 192.168.91.41/d24/12 EXCEPTION in replication.Receiver: #012Traceback (most recent call last):#012
 File "/usr/local/lib/python2.7/dist-packages/swift/obj/ssync_receiver.py", line 104, in __call__#012 for data in self.updates():#012 Fi
le "/usr/local/lib/python2.7/dist-packages/swift/obj/ssync_receiver.py", line 298, in updates#012 line = self.fp.readline(self.app.networ
k_chunk_size)#012 File "/usr/lib/python2.7/dist-packages/eventlet/wsgi.py", line 193, in readline#012 return self._chunked_read(self.rfi
le, size, True)#012 File "/usr/lib/python2.7/dist-packages/eventlet/wsgi.py", line 178, in _chunked_read#012 self.chunk_length = int(rfi
le.readline().split(b";", 1)[0], 16)#012ValueError: invalid literal for int() with base 16: ''

(192.168.91.41):

Aug 27 12:45:29 localhost object-reconstructor: 192.168.91.43:6003/d24/12 EXCEPTION in replication.Sender: #012Traceback (most recent call last):#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/ssync_sender.py", line 75, in __call__#012 self.updates()#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/ssync_sender.py", line 290, in updates#012 self.send_put(url_path, df)#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/ssync_sender.py", line 344, in send_put#012 for chunk in df.reader():#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/reconstructor.py", line 97, in reader#012 for chunk in self.rebuilt_fragment_iter:#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/reconstructor.py", line 331, in fragment_payload_iter#012 policy, fragment_payload, frag_index)#012 File "/usr/local/lib/python2.7/dist-packages/swift/obj/reconstructor.py", line 290, in _reconstruct#012 [frag_index])[0]#012 File "/usr/lib/python2.7/dist-packages/pyeclib/ec_iface.py", line 247, in reconstruct#012 available_fragment_payloads, missing_fragment_indexes)#012 File "/usr/lib/python2.7/dist-packages/pyeclib/core.py", line 116, in reconstruct#012 raise ECDriverError(e)#012ECDriverError: Reconstruct ERROR: Fragment integrity check failed. Please inspect syslog for liberasurecode error report.

(192.168.91.41 syslog):

Aug 27 12:45:29 r7n01 liberasurecode[3413]: Invalid fragment header (get idx)!
Aug 27 12:45:29 r7n01 liberasurecode[3413]: Could not properly partition the fragments!

CVE References

Revision history for this message
clayg (clay-gerrard) wrote :

good logs paul!

I think cleaning up the way that ssync sender disconnects would be very helpful to taking some noise out of the logs.

SSYNC has a (weird?) tunneling protocol (currently) where it wraps HTTP requests inside of an ongoing chunked-transfer HTTP request. When it disconnects that old eventlet chunked transfer ValueError disconnect message pops (I'm not sure why that shows up in the logs unless eventlet_debug = True).

In order to disconnect cleanly ssync_sender can write a chunked transfer termination line to the wire.

Changed in swift:
status: New → Confirmed
importance: Undecided → Medium
Revision history for this message
clayg (clay-gerrard) wrote :

FWIW I have no idea about the pyeclib reconstruct error - so I'm going to pretend this bug report is just about the ValueError

Revision history for this message
Caleb Tennis (ctennis) wrote :

(Clay, I was the one who filed this, Paul was just logged in at the time so it came through his account)

No idea if the pyeclib error was related, it just showed up at the same timestamp so wanted to include it.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to swift (master)

Reviewed: https://review.openstack.org/218078
Committed: https://git.openstack.org/cgit/openstack/swift/commit/?id=05de1305a903ee4ce9c8c50fde53c552d5b90d51
Submitter: Jenkins
Branch: master

commit 05de1305a903ee4ce9c8c50fde53c552d5b90d51
Author: Clay Gerrard <email address hidden>
Date: Thu Aug 27 18:35:09 2015 -0700

    Make ssync_sender send valid chunked requests

    The connect method of ssync_sender tells the remote connection that it's
    going to send a valid HTTP chunked request, but if the remote end needs
    to respond with an error of any kind sender throws HTTP right out the
    window, picks up his ball, and closes the socket down hard - much to the
    surprise of the eventlet.wsgi server who up to this point had been
    playing along quite nicely with this 'SSYNC' nonsense assuming that
    everyone here is consenting mature adults.

    If you're going to make a "Transfer-Encoding: chunked" request have the
    good decency to finish the job with a proper '0\r\n\r\n'. [1]

    N.B. It might be possible to handle an error status during the
    initialize_request phase with some sort of 100-continue support, but
    honestly it's not entirely clear to me when the server isn't going to
    close the connection if the client is still expected to send the body
    [2] - further if the error comes later during missing_check or updates
    we'll for sure want to send the chunk transfer termination line before
    we close down the socket and this way we cover both.

    1. Really, eventlet.wsgi shouldn't be so blasted brittle about this [3]
    2. https://lists.w3.org/Archives/Public/ietf-http-wg/2005AprJun/0007.html
    3. https://github.com/eventlet/eventlet/commit/c3ce3eef0b4d0dfdbfb1ec0186d4bb204fb8ecd5

    Closes-Bug #1489587
    Change-Id: Ic17c6c3075553f8cf6ef6213e62a00282f0d01cf

Changed in swift:
status: Confirmed → Fix Committed
Revision history for this message
Thierry Carrez (ttx) wrote :

(this was merged after the 2.4.0 tag so this will be picked up in next release)

Changed in swift:
milestone: none → 2.4.0
status: Fix Committed → Fix Released
status: Fix Released → Fix Committed
milestone: 2.4.0 → none
milestone: none → 2.4.0
status: Fix Committed → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to swift (feature/crypto)

Fix proposed to branch: feature/crypto
Review: https://review.openstack.org/219775

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to swift (feature/crypto)
Download full text (43.3 KiB)

Reviewed: https://review.openstack.org/219775
Committed: https://git.openstack.org/cgit/openstack/swift/commit/?id=257e468e9bfd1088040419ad408106ac3c77b531
Submitter: Jenkins
Branch: feature/crypto

commit e02609c66a804845672413b06830b87395afef31
Author: Samuel Merritt <email address hidden>
Date: Tue Sep 1 15:19:50 2015 -0700

    Preserve traceback in swift-dispersion-report

    Commit c690bcb fixed a bug in the dispersion report, but changed this
    from a bare "raise" to "raise err", which loses the traceback. Not a
    big deal, but worth putting back IMO.

    Change-Id: Id5b72153a4b8df8e3faaf1fa3fb2040e28ba85cc

commit d06d4ad0fd2dfe69da8008e729651264522c6c06
Author: Minwoo Bae <email address hidden>
Date: Tue Sep 1 15:08:44 2015 -0500

    Included reference in swift.obj.diskfile to enumerate the string
    used for data file paths.

    Change-Id: Ie22caa678bc00dfc43fabec7efbbb9f34490f1b5

commit 524c89b7eeff037b8a6b421888771e15f98c2da2
Author: John Dickinson <email address hidden>
Date: Fri Aug 21 13:39:41 2015 -0700

    Updated CHANGELOG, AUTHORS, and .mailmap for 2.4.0 release.

    Change-Id: Ic6301146b839c9921bb85c4f4c1e585c9ab66661

commit 05de1305a903ee4ce9c8c50fde53c552d5b90d51
Author: Clay Gerrard <email address hidden>
Date: Thu Aug 27 18:35:09 2015 -0700

    Make ssync_sender send valid chunked requests

    The connect method of ssync_sender tells the remote connection that it's
    going to send a valid HTTP chunked request, but if the remote end needs
    to respond with an error of any kind sender throws HTTP right out the
    window, picks up his ball, and closes the socket down hard - much to the
    surprise of the eventlet.wsgi server who up to this point had been
    playing along quite nicely with this 'SSYNC' nonsense assuming that
    everyone here is consenting mature adults.

    If you're going to make a "Transfer-Encoding: chunked" request have the
    good decency to finish the job with a proper '0\r\n\r\n'. [1]

    N.B. It might be possible to handle an error status during the
    initialize_request phase with some sort of 100-continue support, but
    honestly it's not entirely clear to me when the server isn't going to
    close the connection if the client is still expected to send the body
    [2] - further if the error comes later during missing_check or updates
    we'll for sure want to send the chunk transfer termination line before
    we close down the socket and this way we cover both.

    1. Really, eventlet.wsgi shouldn't be so blasted brittle about this [3]
    2. https://lists.w3.org/Archives/Public/ietf-http-wg/2005AprJun/0007.html
    3. https://github.com/eventlet/eventlet/commit/c3ce3eef0b4d0dfdbfb1ec0186d4bb204fb8ecd5

    Closes-Bug #1489587
    Change-Id: Ic17c6c3075553f8cf6ef6213e62a00282f0d01cf

commit 993ee4e37af1961adba2047d5aa2eb210e423eb3
Author: nakagawamsa <email address hidden>
Date: Fri Aug 28 11:49:43 2015 +0900

    Remove duplicate X-Backend-Storage-Policy-Index key

    There is duplicate 'X-Backend-Storage-Policy-Index' dictionary key in unit.obj.test_server.py.
    One key has fixed policy index value, and another ha...

tags: added: in-feature-crypto
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to swift (feature/hummingbird)

Fix proposed to branch: feature/hummingbird
Review: https://review.openstack.org/221410

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to swift (feature/hummingbird)
Download full text (70.7 KiB)

Reviewed: https://review.openstack.org/221410
Committed: https://git.openstack.org/cgit/openstack/swift/commit/?id=eb8f1f83f1cfc63d8452bc30096fd1c145781527
Submitter: Jenkins
Branch: feature/hummingbird

commit cb683d391cb66d0f52830de16760c80fd2afedf9
Author: OpenStack Proposal Bot <email address hidden>
Date: Sat Sep 5 06:17:51 2015 +0000

    Imported Translations from Transifex

    For more information about this automatic import see:
    https://wiki.openstack.org/wiki/Translations/Infrastructure

    Change-Id: I2d92b8e34a665fb0bb4c048cfb0c59de295dfce6

commit e4542455c8a07b7981c247df8b737816062c1655
Author: Emett Speer <email address hidden>
Date: Wed Sep 2 17:18:03 2015 -0700

    [Labs] Update links to Cloud Admin Guide

    Update links to the Cloud Admin Guide after the
    RST conversion of that book altered URLs.

    Change-Id: I899f8938498b744e62887968a65e58c00ef27f1b

commit 58fcc07523978306cd3889ada73af5d9e664cf59
Author: Christian Schwede <email address hidden>
Date: Wed Sep 2 10:52:34 2015 +0000

    Test if container_sweep is executed on unmounted devices

    This change ensures that container_sweep is not run if a device is not mounted
    and mount_check is set to True.

    Change-Id: I823083c8431d9e61fd426508033ec9188503957b

commit e02609c66a804845672413b06830b87395afef31
Author: Samuel Merritt <email address hidden>
Date: Tue Sep 1 15:19:50 2015 -0700

    Preserve traceback in swift-dispersion-report

    Commit c690bcb fixed a bug in the dispersion report, but changed this
    from a bare "raise" to "raise err", which loses the traceback. Not a
    big deal, but worth putting back IMO.

    Change-Id: Id5b72153a4b8df8e3faaf1fa3fb2040e28ba85cc

commit d06d4ad0fd2dfe69da8008e729651264522c6c06
Author: Minwoo Bae <email address hidden>
Date: Tue Sep 1 15:08:44 2015 -0500

    Included reference in swift.obj.diskfile to enumerate the string
    used for data file paths.

    Change-Id: Ie22caa678bc00dfc43fabec7efbbb9f34490f1b5

commit 615c7a204b9386e05c5bab658bfe96766ad1e680
Author: Brian Cline <email address hidden>
Date: Tue Sep 1 10:51:20 2015 -0500

    Adds useful dispersion info from changelog

    Change-Id: I1a45088fc32620b02ff9a754b02ec1eb75a59d6e

commit 3b8755098a1786c5447abf158bd686293a82977c
Author: janonymous <email address hidden>
Date: Sun Aug 2 21:29:13 2015 +0530

    Replace a / b with a // b to use integer division where needed

    Change-Id: I72c81faa62786e140b0de00e3a04934bf1b5adbd

commit 524c89b7eeff037b8a6b421888771e15f98c2da2
Author: John Dickinson <email address hidden>
Date: Fri Aug 21 13:39:41 2015 -0700

    Updated CHANGELOG, AUTHORS, and .mailmap for 2.4.0 release.

    Change-Id: Ic6301146b839c9921bb85c4f4c1e585c9ab66661

commit 05de1305a903ee4ce9c8c50fde53c552d5b90d51
Author: Clay Gerrard <email address hidden>
Date: Thu Aug 27 18:35:09 2015 -0700

    Make ssync_sender send valid chunked requests

    The connect method of ssync_sender tells the remote connection that it's
    going to send a valid HTTP chunked request, but if the remote end needs
    to respond with an error of any kind sender th...

tags: added: in-feature-hummingbird
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.