MySQL OCF: pacemaker resource refuses to start after an op start has timed out

Bug #1580159 reported by Bogdan Dobrelya
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Committed
Medium
Bogdan Dobrelya

Bug Description

If operation start timed out, mysql resource refuses to start ever, unless cleanup manually.

How to reproduce:
* Deploy a cluster's as the following:
- DB primary role to deploy as always,
- the rest of the nodes with DB role to start with a modified OCF RA action start to sleep for a period of time higher than op start timeout (default 300)

Expected:
After op start timed out in logs, pacemaker shall repeat start attempts for the resources

Actual:
It times out and remains stopped, until cleaned up manually

Logs snippet:
May 10 11:50:36 n3 ocf-mysql-wss: INFO: p_mysql: get_master_timeout(): Setting timeout 300
May 10 11:50:36 n3 ocf-mysql-wss: INFO: p_mysql: check_if_galera_pc(): Checking if Primary Component
May 10 11:50:46 n3 ocf-mysql-wss: INFO: p_mysql: check_if_galera_pc(): Waiting for master. 290 seconds left
May 10 11:50:57 n3 ocf-mysql-wss: INFO: p_mysql: check_if_galera_pc(): Waiting for master. 280 seconds left
May 10 11:51:07 n3 ocf-mysql-wss: INFO: p_mysql: check_if_galera_pc(): Waiting for master. 270 seconds left
May 10 11:51:18 n3 ocf-mysql-wss: INFO: p_mysql: check_if_galera_pc(): Waiting for master. 260 seconds left
May 10 11:51:25 n3 lrmd[157]: warning: p_mysql_start_0 process (PID 1220) timed out
May 10 11:51:25 n3 lrmd[157]: warning: p_mysql_start_0:1220 - timed out after 60000ms
May 10 11:51:31 n3 crmd[160]: notice: Operation p_mysql_monitor_0: not running (node=n3, call=38, rc=7, cib-update=48, confirmed=true)

Solution:
Make MySQL OCF config param master_timeout to be lower than op start timeout

Changed in fuel:
importance: Undecided → Medium
milestone: none → 10.0
description: updated
tags: added: galera pacemaker
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (master)

Fix proposed to branch: master
Review: https://review.openstack.org/314533

Changed in fuel:
assignee: nobody → Bogdan Dobrelya (bogdando)
status: New → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (master)

Reviewed: https://review.openstack.org/314533
Committed: https://git.openstack.org/cgit/openstack/fuel-library/commit/?id=30ee96e938584bd6cb2a482ad5d2ac7164b34acf
Submitter: Jenkins
Branch: master

commit 30ee96e938584bd6cb2a482ad5d2ac7164b34acf
Author: Bogdan Dobrelya <email address hidden>
Date: Tue May 10 14:27:34 2016 +0200

    Increase mysql resource op start timeout

    Make MySQL OCF config param master_timeout to be lower
    than op start timeout.

    Closes-bug: #1580159

    Change-Id: I027e2fc114e57f0019bb76b09f1f9081fae7d121
    Signed-off-by: Bogdan Dobrelya <email address hidden>

Changed in fuel:
status: In Progress → Fix Committed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.