Vanilla Hadoop 2 does not set mapreduce.jobhistory.address config

Bug #1328675 reported by Trevor McKay
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Sahara
Fix Released
Medium
Andrew Lazarev

Bug Description

Occasionally when using the Vanilla 2 Hadoop plugin, a job will fail in the Oozie launcher when it tries to contact the jobhistory server at mapreduce.jobhistory.address. The default value for this parameter is 0.0.0.0:10020, and it needs to be set in mapred-site.xml.

It is unclear exactly why the jobhistory server is only contacted sometimes. The frequency seems to be relatively low, maybe less than 10%. Difficult to tell. I do not have a stack trace handy, but when it fails Oozie reports "Connection refused" to 0.0.0.0:10020 in the job log on the worker node.

Note, the HDP 2 plugin already sets this configuration parameter in a method called finalize_configuration, after services are deployed.

The Vanilla 2 plugin should set this as well. There are two potential ways to do this:

1) Set it in the job configuration, the same way we set values for namenode and datanode on each job. (Of course, this is redundant in the case of the HDP plugin and is irrelevant for Hadoop 1, so we would want to check the hadoop version). Quick fix maybe, but a little kludgy.

2) Make the Vanilla 2 plugin update this config on the cluster once the jobhistory server is launched, as in the HDP plugin. This is better but more complicated.

Changed in sahara:
assignee: nobody → Andrew Lazarev (alazarev)
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to sahara (master)

Fix proposed to branch: master
Review: https://review.openstack.org/99227

Changed in sahara:
status: Confirmed → In Progress
Revision history for this message
Andrew Lazarev (alazarev) wrote :

Wasn't able to reproduce. Made a blind fix.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to sahara (master)

Reviewed: https://review.openstack.org/99227
Committed: https://git.openstack.org/cgit/openstack/sahara/commit/?id=78cc4f5665ffef42512052018582a2b8c90b3cd8
Submitter: Jenkins
Branch: master

commit 78cc4f5665ffef42512052018582a2b8c90b3cd8
Author: Andrew Lazarev <email address hidden>
Date: Tue Jun 10 15:44:17 2014 -0700

    Added jobhistory address config to vanilla 2

    Change-Id: Ia04c2e4901cb03bac3487960092fd603c4c32e12
    Closes-Bug: #1328675

Changed in sahara:
status: In Progress → Fix Committed
Changed in sahara:
status: Fix Committed → Fix Released
Thierry Carrez (ttx)
Changed in sahara:
milestone: juno-2 → 2014.2
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.