[docs] improve cluster manual recovery documentation

Bug #1250572 reported by Andrew Woodward
28
This bug affects 5 people
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Released
High
Sergii Golovatiuk

Bug Description

docs/pages/frequently-asked-questions/0040-corosync-crashes.rst is not sufficient to restart the cluster. It needs to be extended and probably moved to it's own section "operations guide". It should be complete enough to recover the cluster from a crashed state.

Andrew Woodward (xarses)
Changed in fuel:
assignee: nobody → Andrew Woodward (xarses)
importance: Undecided → Medium
Andrew Woodward (xarses)
tags: added: cold-restart-inprovments
tags: added: cold-restart-improvements
removed: cold-restart-inprovments
Andrew Woodward (xarses)
Changed in fuel:
milestone: none → 3.2.1
Mike Scherbakov (mihgen)
Changed in fuel:
milestone: 3.2.1 → 4.0
Revision history for this message
Nikolay Markov (nmarkov) wrote :

Is there any progress on this? Seems like a little fix.

Changed in fuel:
status: New → Confirmed
Mike Scherbakov (mihgen)
Changed in fuel:
assignee: Andrew Woodward (xarses) → nobody
Revision history for this message
Andrew Woodward (xarses) wrote :

This is not duplicate, this is to convert internal wiki page to public docs for manual cluster rebuild, the autorebuild process is separate from doing it manually.

Dmitry Pyzhov (dpyzhov)
Changed in fuel:
milestone: 4.0 → 4.1
Mike Scherbakov (mihgen)
Changed in fuel:
assignee: nobody → Miroslav Anashkin (manashkin)
Mike Scherbakov (mihgen)
Changed in fuel:
milestone: 4.1 → 5.0
Revision history for this message
Dmitry Ilyin (idv1985) wrote :

I'll write this

Changed in fuel:
assignee: Miroslav Anashkin (manashkin) → Dmitry Ilyin (idv1985)
Dmitry Ilyin (idv1985)
Changed in fuel:
milestone: 5.0 → 5.1
Revision history for this message
Andrew Woodward (xarses) wrote :

From https://bugs.launchpad.net/fuel/+bug/1290075/comments/1

Mike Scherbakov (mihgen) wrote on 2014-03-10:
From Micah:
> One of the big holes we have is how to recover from a power outage type event. Starting the cloud from it being down, and not shutdown gracefully. We do not have any working steps for that, nor have found anyone who has such steps.

This services list might be helpful: https://docs.google.com/a/mirantis.com/spreadsheet/ccc?key=0Au_vGfPxE7BUdDEtMHB4VlJvV3RjQ2dTMHpILVNfUWc#gid=0wq

Changed in fuel:
importance: Medium → High
Revision history for this message
Bogdan Dobrelya (bogdando) wrote :
Dmitry Ilyin (idv1985)
summary: - improve cluster manual recovery documentation
+ [docs] improve cluster manual recovery documentation
Dmitry Ilyin (idv1985)
Changed in fuel:
importance: High → Medium
Dmitry Ilyin (idv1985)
Changed in fuel:
assignee: Dmitry Ilyin (idv1985) → Fuel Library Team (fuel-library)
Changed in fuel:
milestone: 5.1 → 6.0
Changed in fuel:
milestone: 6.0 → 5.1
importance: Medium → High
Changed in fuel:
assignee: Fuel Library Team (fuel-library) → Sergii Golovatiuk (sgolovatiuk)
Changed in fuel:
status: Confirmed → Fix Committed
Revision history for this message
Pavel Vaylov (pvaylov) wrote :

Hi team !

How the bug was fixed ? Do we have an article in doc's (I do not see it in docs) ?

Even more: We do not have a "cluster full shutdown procedure". So I have created a bug report for it https://bugs.launchpad.net/fuel/+bug/1406523

Revision history for this message
Irina Povolotskaya (ipovolotskaya) wrote :

Started working on a new bug, reported by Pavel V (see the link above, please).

Revision history for this message
Meg McRoberts (dreidellhasa) wrote : Re: [Bug 1250572] Re: [docs] improve cluster manual recovery documentation

Thanks for the heads-up -- this is an important topic and it
should be fun.

But I don't see this on the Trello board -- is there a reason
nobody but me is putting things there? I admit that I don't
completely understand how this is all going to work so I
just thought I would ask...

I look forward to reviewing what you write,
meg

On Tue, Jan 20, 2015 at 10:16 AM, Irina <email address hidden> wrote:

> Started working on a new bug, reported by Pavel V (see the link above,
> please).
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1250572
>
> Title:
> [docs] improve cluster manual recovery documentation
>
> Status in Fuel: OpenStack installer that works:
> Fix Committed
>
> Bug description:
> docs/pages/frequently-asked-questions/0040-corosync-crashes.rst is not
> sufficient to restart the cluster. It needs to be extended and
> probably moved to it's own section "operations guide". It should be
> complete enough to recover the cluster from a crashed state.
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/fuel/+bug/1250572/+subscriptions
>

Revision history for this message
Michele Fagan (michelefagan) wrote :

Closing this bug. Duplicate of Bug #1406523. Fix was committed and released. "How to shut down the whole cluster" including starting up the cluster and restarting the OpenStack services is now in the Operations Guide.

Changed in fuel:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.