ZODB

cache GC interacts badly with multiple threads

Bug #113923 reported by Tim McLennan on 2007-05-11

	Status	Importance	Assigned to
ZODB	Fix Released	Undecided	Christian Theune
3.6	Fix Released	Undecided	Christian Theune
3.7	Fix Released	Undecided	Christian Theune
3.8	Fix Released	Undecided	Christian Theune

Bug Description

Okay, I somehow managed to lose my first version of what I was going to say here.

Basically, when you run with a cache-size of 0, (although it is theoretically possible to occur with other sizes as well), you can run into problems if the garbage collection for the cache is run while you are using objects in that thread. Unfortunately the db.open call will runs the GC on the caches for all connections when you create a new connection.

The sorts of errors that can occur can be quite confusing. For example take the code for the persistent mapping __getstate__ routine (as called eventually by Transaction.commit())
    def __getstate__(self):
        state = {}
        state.update(self.__dict__)
        state['_container'] = state['data']
        del state['data']
        return state

if the object is ghostified before the self.__dict__ reference then you'll get a KeyError('data') exception in the 3rd line.

Another case is when you store a reference to a mutable (non-persistent) subobject and then access it.

l = self.list
l.append([xyz])
self._p_changed = 1

then if self is ghostified before settting _p_changed then self.list != l and so you won't actually change the object like you mean to.

In fact if you're really unlucky you can get the case where:
self.member == self.member will be false.

Revision history for this message

Tim McLennan (quixilva) wrote on 2007-05-11:

test case illustrating the bug Edit (3.5 KiB, text/x-python)

I noticed the problem within our quite complex application, but fortunately I was able to reproduce the behaviour in a simple setting.
Attached is a python script that will create a PersistentMapping and populate it will a couple entries and then repeated access the entries while another thread is creating new connections and thus triggering the garbage collection.

Revision history for this message

Jim Fulton (jim-zope) wrote on 2007-05-11: Re: [Bug 113923] cache GC interacts badly with multiple threads

On May 10, 2007, at 10:38 PM, Tim McLennan wrote:

> Public bug reported:
>
> Okay, I somehow managed to lose my first version of what I was
> going to
> say here.
>
> Basically, when you run with a cache-size of 0, (although it is
> theoretically possible to occur with other sizes as well), you can run
> into problems if the garbage collection for the cache is run while you
> are using objects in that thread.

We generally try to avoid this ...

> Unfortunately the db.open call will
> runs the GC on the caches for all connections when you create a new
> connection.

Yipes. It shouldn't do that. Hm.

>
> The sorts of errors that can occur can be quite confusing. For
> example take the code for the persistent mapping __getstate__
> routine (as called eventually by Transaction.commit())
> def __getstate__(self):
> state = {}
> state.update(self.__dict__)
> state['_container'] = state['data']
> del state['data']
> return state
>
> if the object is ghostified before the self.__dict__ reference then
> you'll get a KeyError('data') exception in the 3rd line.

Interesting. I'll need to review the commit code. I would think
that the object would still be marked as changed at this point so it
wouldn't get ghostified, but the dance during commit may have already
marked it as saved -- although that would seem a bit odd.

> Another case is when you store a reference to a mutable (non-
> persistent)
> subobject and then access it.
>
> l = self.list
> l.append([xyz])
> self._p_changed = 1
>
> then if self is ghostified before settting _p_changed then
> self.list !=
> l and so you won't actually change the object like you mean to.
>
> In fact if you're really unlucky you can get the case where:
> self.member == self.member will be false.

I seem to remember a discussion a while back about the proper order
to do this sort of thing, although I think it shouldn't be this
delicate.

This is a very good argument to not do GC on a connection while
application code is using it.

Thanks for a very interesting and informative bug report. I'll dig
into this very soon -- hopefully today or this weekend.

Jim

--
Jim Fulton mailto:<email address hidden> Python Powered!
CTO (540) 361-1714 http://www.python.org
Zope Corporation http://www.zope.com http://www.zope.org

On May 10, 2007, at 10:38 PM, Tim McLennan wrote:

> Public bug reported:
>
> Okay, I somehow managed to lose my first version of what I was  
> going to
> say here.
>
> Basically, when you run with a cache-size of 0, (although it is
> theoretically possible to occur with other sizes as well), you can run
> into problems if the garbage collection for the cache is run while you
> are using objects in that thread.

We generally try to avoid this ...

>   Unfortunately the db.open call will
> runs the GC on the caches for all connections when you create a new
> connection.

Yipes.  It shouldn't do that.  Hm.

>
> The sorts of errors that can occur can be quite confusing.   For  
> example take the code for the persistent mapping __getstate__  
> routine (as called eventually by Transaction.commit())
>     def __getstate__(self):
>         state = {}
>         state.update(self.__dict__)
>         state['_container'] = state['data']
>         del state['data']
>         return state
>
> if the object is ghostified before the self.__dict__ reference then
> you'll get a KeyError('data') exception in the 3rd line.

Interesting.  I'll need to review the commit code.  I would think  
that the object would still be marked as changed at this point so it  
wouldn't get ghostified, but the dance during commit may have already  
marked it as saved  -- although that would seem a bit odd.

> Another case is when you store a reference to a mutable (non- 
> persistent)
> subobject and then access it.
>
> l = self.list
> l.append([xyz])
> self._p_changed = 1
>
> then if self is ghostified before settting _p_changed then  
> self.list !=
> l and so you won't actually change the object like you mean to.
>
> In fact if you're really unlucky you can get the case where:
> self.member == self.member will be false.

I seem to remember a discussion a while back about the proper order  
to do this sort of thing, although I think it shouldn't be this  
delicate.

This is a very good argument to not do GC on a connection while  
application code is using it.

Thanks for a very interesting and informative bug report.  I'll dig  
into this very soon -- hopefully today or this weekend.

Jim

--
Jim Fulton			mailto:jim@zope.com		Python Powered!
CTO 				(540) 361-1714			http://www.python.org
Zope Corporation	http://www.zope.com		http://www.zope.org

Revision history for this message

Christian Theune (ctheune) wrote on 2007-06-06:

I've talked to Jim here at the Potsdam sprint about this. I'm going to tackle this by changing the ZODB that whenever it does automatic garbage collection and iterates over all connections it will not GC connections that are open.

Changed in zodb:
assignee:	nobody → ct-gocept

Christian Theune (ctheune) on 2007-06-06

Changed in zodb:
status:	Unconfirmed → In Progress

Revision history for this message

Christian Theune (ctheune) wrote on 2007-11-09:

Setting to released because we don't do regular ZODB 3.6 releases anymore. If anybody needs it he can make one.

Revision history for this message

Christian Theune (ctheune) wrote on 2007-11-09:

Was released with ZODB 3.7.0c1.

Revision history for this message

Christian Theune (ctheune) wrote on 2007-11-09:

Released in ZODB 3.8.0b4

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Bug attachments

test case illustrating the bug Edit

Add attachment

Remote bug watches

Bug watches keep track of this bug in other bug trackers.