Bug #594954 “uBLAS GMRES solver fails on first run” : Bugs : DOLFIN

Garth Wells (garth-wells) on 2010-06-16

Changed in dolfin:
status:	New → Confirmed

Garth Wells (garth-wells) on 2010-06-16

Changed in dolfin:
milestone:	none → 0.9.8

Revision history for this message

Johannes Ring (johannr) wrote on 2010-06-16:

#1

I couldn't reproduce this problem either on my laptop or on any of the buildbots. However, on my desktop computer at home (running Lucid 64 bit), I ran into the same issue. It happens about half the times when I run the solver repeatedly. I tried to reconfigure DOLFIN like this:

scons configure customCxxFlags="-DBOOST_UBLAS_TYPE_CHECK=0"

This got rid of the error message but sometimes the solver completely hangs and then I have to kill the process:

  johannr@ubuntu-htpc:/tmp$ python solve3.py
  Matrix of size 16 x 16 has 82 nonzero entries.
  Sorting sparsity pattern.
  Solving linear system of size 16 x 16 (uBLAS Krylov solver).
  Terminated
  johannr@ubuntu-htpc:/tmp$

Anyone else experiencing this behaviour when DOLFIN is configured with the flag above?

Revision history for this message

Anders Logg (logg) wrote on 2010-06-28:

#2

I get the same error message. Will check what goes wrong.

Revision history for this message

Anders Logg (logg) wrote on 2010-06-28:

#3

And it happens only occasionally, not every time. And it doesn't seem to correlate with instant-clean.

Revision history for this message

Anders Logg (logg) wrote on 2010-06-28:

#4

The problem was that x = Vector(b.size()) created a vector which sometimes contained strange numbers. This has been changed now (in uBLASVector::resize()) so that the vector is always zero. Took me a while to work through all the 29 different axpy_prod that call each other in operation.hpp in uBLAS...

Should be fixed now.

Changed in dolfin:
status:	Confirmed → Fix Committed

Revision history for this message

Mehdi Nikbakht (m-nikbakht) wrote on 2010-06-29: Re: [Dolfin] [Bug 594954] Re: uBLAS GMRES solver fails on first run

#5

Is there any specific reason why we have this type of problems in
resizing uBLAS vectors not other type of vectors?

Note that resizing a vector becomes important in the adaptive methods.
Since when a function updates, we need to resize its underlying vector.
If we clear a vector, then we lose all data that we had there.

Mehdi

On Mon, 2010-06-28 at 15:01 +0000, Anders Logg wrote:
> The problem was that x = Vector(b.size()) created a vector which
> sometimes contained strange numbers. This has been changed now (in
> uBLASVector::resize()) so that the vector is always zero. Took me a
> while to work through all the 29 different axpy_prod that call each
> other in operation.hpp in uBLAS...
>
> Should be fixed now.
>
> ** Changed in: dolfin
> Status: Confirmed => Fix Committed
>

Revision history for this message

Anders Logg (logg) wrote on 2010-06-29:

#7

On Tue, Jun 29, 2010 at 11:50:38AM +0200, Mehdi wrote:
> Is there any specific reason why we have this type of problems in
> resizing uBLAS vectors not other type of vectors?

Yes, because of a bug in the uBLAS resize function (it does not
initialize data).

> Note that resizing a vector becomes important in the adaptive methods.
> Since when a function updates, we need to resize its underlying vector.
> If we clear a vector, then we lose all data that we had there.

There's no point in keeping the old dofs as part of the vector since
the number of the mesh (and the old dofs) may change. What is needed
for adaptive methods is to interpolate the old solution to the new
mesh (which is already implemented in DOLFIN).

--
Anders

> Mehdi
>
> On Mon, 2010-06-28 at 15:01 +0000, Anders Logg wrote:
> > The problem was that x = Vector(b.size()) created a vector which
> > sometimes contained strange numbers. This has been changed now (in
> > uBLASVector::resize()) so that the vector is always zero. Took me a
> > while to work through all the 29 different axpy_prod that call each
> > other in operation.hpp in uBLAS...
> >
> > Should be fixed now.
> >
> > ** Changed in: dolfin
> > Status: Confirmed => Fix Committed
> >
>
>
> _______________________________________________
> Mailing list: https://launchpad.net/~dolfin
> Post to : <email address hidden>
> Unsubscribe : https://launchpad.net/~dolfin
> More help : https://help.launchpad.net/ListHelp

Revision history for this message

Mehdi Nikbakht (m-nikbakht) wrote on 2010-06-29:

#8

On Tue, 2010-06-29 at 12:13 +0200, Anders Logg wrote:
> On Tue, Jun 29, 2010 at 11:50:38AM +0200, Mehdi wrote:
> > Is there any specific reason why we have this type of problems in
> > resizing uBLAS vectors not other type of vectors?
>
> Yes, because of a bug in the uBLAS resize function (it does not
> initialize data).
>
> > Note that resizing a vector becomes important in the adaptive methods.
> > Since when a function updates, we need to resize its underlying vector.
> > If we clear a vector, then we lose all data that we had there.
>
> There's no point in keeping the old dofs as part of the vector since
> the number of the mesh (and the old dofs) may change. What is needed
> for adaptive methods is to interpolate the old solution to the new
> mesh (which is already implemented in DOLFIN).

This is not the case in XFEM where we have a fixed mesh but the number
of degrees of freedom is changing. Although this is not a big challenge,
since we can define some local variables to transfer data from the old
vector to new one.

Mehdi
>
> --
> Anders
>
>
> > Mehdi
> >
> > On Mon, 2010-06-28 at 15:01 +0000, Anders Logg wrote:
> > > The problem was that x = Vector(b.size()) created a vector which
> > > sometimes contained strange numbers. This has been changed now (in
> > > uBLASVector::resize()) so that the vector is always zero. Took me a
> > > while to work through all the 29 different axpy_prod that call each
> > > other in operation.hpp in uBLAS...
> > >
> > > Should be fixed now.
> > >
> > > ** Changed in: dolfin
> > > Status: Confirmed => Fix Committed
> > >
> >
> >
> > _______________________________________________
> > Mailing list: https://launchpad.net/~dolfin
> > Post to : <email address hidden>
> > Unsubscribe : https://launchpad.net/~dolfin
> > More help : https://help.launchpad.net/ListHelp

Revision history for this message

Garth Wells (garth-wells) wrote on 2010-06-29:

#9

On 29/06/10 14:33, Mehdi wrote:
> On Tue, 2010-06-29 at 12:13 +0200, Anders Logg wrote:
>> On Tue, Jun 29, 2010 at 11:50:38AM +0200, Mehdi wrote:
>>> Is there any specific reason why we have this type of problems in
>>> resizing uBLAS vectors not other type of vectors?
>>
>> Yes, because of a bug in the uBLAS resize function (it does not
>> initialize data).
>>
>>> Note that resizing a vector becomes important in the adaptive methods.
>>> Since when a function updates, we need to resize its underlying vector.
>>> If we clear a vector, then we lose all data that we had there.
>>
>> There's no point in keeping the old dofs as part of the vector since
>> the number of the mesh (and the old dofs) may change. What is needed
>> for adaptive methods is to interpolate the old solution to the new
>> mesh (which is already implemented in DOLFIN).
>
> This is not the case in XFEM where we have a fixed mesh but the number
> of degrees of freedom is changing. Although this is not a big challenge,
> since we can define some local variables to transfer data from the old
> vector to new one.
>

No all backends support preservation of data when resizing, so it's not
an option to preserve data when calling GenericVector::resize.

Garth

> Mehdi
>>
>> --
>> Anders
>>
>>
>>> Mehdi
>>>
>>> On Mon, 2010-06-28 at 15:01 +0000, Anders Logg wrote:
>>>> The problem was that x = Vector(b.size()) created a vector which
>>>> sometimes contained strange numbers. This has been changed now (in
>>>> uBLASVector::resize()) so that the vector is always zero. Took me a
>>>> while to work through all the 29 different axpy_prod that call each
>>>> other in operation.hpp in uBLAS...
>>>>
>>>> Should be fixed now.
>>>>
>>>> ** Changed in: dolfin
>>>> Status: Confirmed => Fix Committed
>>>>
>>>
>>>
>>> _______________________________________________
>>> Mailing list: https://launchpad.net/~dolfin
>>> Post to : <email address hidden>
>>> Unsubscribe : https://launchpad.net/~dolfin
>>> More help : https://help.launchpad.net/ListHelp
>
>
> _______________________________________________
> Mailing list: https://launchpad.net/~dolfin
> Post to : <email address hidden>
> Unsubscribe : https://launchpad.net/~dolfin
> More help : https://help.launchpad.net/ListHelp

On 29/06/10 14:33, Mehdi wrote:
> On Tue, 2010-06-29 at 12:13 +0200, Anders Logg wrote:
>> On Tue, Jun 29, 2010 at 11:50:38AM +0200, Mehdi wrote:
>>> Is there any specific reason why we have this type of problems in
>>> resizing uBLAS vectors not other type of vectors?
>>
>> Yes, because of a bug in the uBLAS resize function (it does not
>> initialize data).
>>
>>> Note that resizing a vector becomes important in the adaptive methods.
>>> Since when a function updates, we need to resize its underlying vector.
>>> If we clear a vector, then we lose all data that we had there.
>>
>> There's no point in keeping the old dofs as part of the vector since
>> the number of the mesh (and the old dofs) may change. What is needed
>> for adaptive methods is to interpolate the old solution to the new
>> mesh (which is already implemented in DOLFIN).
>
> This is not the case in XFEM where we have a fixed mesh but the number
> of degrees of freedom is changing. Although this is not a big challenge,
> since we can define some local variables to transfer data from the old
> vector to new one.
>

No all backends support preservation of data when resizing, so it's not 
an option to preserve data when calling GenericVector::resize.

Garth

> Mehdi
>>
>> --
>> Anders
>>
>>
>>> Mehdi
>>>
>>> On Mon, 2010-06-28 at 15:01 +0000, Anders Logg wrote:
>>>> The problem was that x = Vector(b.size()) created a vector which
>>>> sometimes contained strange numbers. This has been changed now (in
>>>> uBLASVector::resize()) so that the vector is always zero. Took me a
>>>> while to work through all the 29 different axpy_prod that call each
>>>> other in operation.hpp in uBLAS...
>>>>
>>>> Should be fixed now.
>>>>
>>>> ** Changed in: dolfin
>>>>         Status: Confirmed =>  Fix Committed
>>>>
>>>
>>>
>>> _______________________________________________
>>> Mailing list: https://launchpad.net/~dolfin
>>> Post to     : dolfin@lists.launchpad.net
>>> Unsubscribe : https://launchpad.net/~dolfin
>>> More help   : https://help.launchpad.net/ListHelp
>
>
> _______________________________________________
> Mailing list: https://launchpad.net/~dolfin
> Post to     : dolfin@lists.launchpad.net
> Unsubscribe : https://launchpad.net/~dolfin
> More help   : https://help.launchpad.net/ListHelp

Garth Wells (garth-wells) on 2010-07-01

Changed in dolfin:
status:	Fix Committed → Fix Released

DOLFIN

uBLAS GMRES solver fails on first run

Bug Description

Other bug subscribers

Remote bug watches