"clocktest"/"memory/mcelog_check"/"miscellanea/idle_check" get fail result during OCPready test

Bug #1424910 reported by Matthewlin on 2015-02-24
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
The Open Compute Project
Undecided
Unassigned

Bug Description

It gets "clocktest"/"memory/mcelog_check"/"miscellanea/idle_check" get fail result during OCPready test
Follow OCPready page to download the test script.
URL:http://it-div-procurements.web.cern.ch/it-div-procurements/IT-4060/ocp-ready/

Test OS: Ubuntu14.10
Test platfrom: Grantley

Name Result Comment
cpu/clocktest FAILED sched_setaffinity: Invalid argument
memory/mcelog_check FAILED cat: /var/log/mcelog: No such file or directory
 miscellanea/idle_check FAILED

Matthewlin (matthew-lin) wrote :
Jeff Lane (bladernr) wrote :

It is impossible to open that zip file. It appears to be password protected:

bladernr@klaatu:~/Downloads/ocp-tmp$ unzip ../F06A_OCPready.zip
Archive: ../F06A_OCPready.zip
   creating: F06A_OCPready/
   creating: F06A_OCPready/local/
   creating: F06A_OCPready/local/.cache/
   creating: F06A_OCPready/local/.cache/checkbox/
[../F06A_OCPready.zip] F06A_OCPready/local/.cache/checkbox/checkbox.log password:
password incorrect--reenter:
password incorrect--reenter:
   skipping: F06A_OCPready/local/.cache/checkbox/checkbox.log incorrect password
[../F06A_OCPready.zip] F06A_OCPready/local/.cache/checkbox/checkbox.xsl password:
password incorrect--reenter:
password incorrect--reenter:

You can manually re-run the clocktest test by running /usr/lib/checkbox/bin/clocktest (or perhaps /usr/share/checkbox/scripts/clocktest) Having actual script output would be useful, so please re-run the test manually. Clocktest checks for cpu clock jitter. More info about the system you are testing would be nice too, such as what CPU is being used, what version and arch of Ubuntu you have installed, kernel version, etc.

I have no idea why mcelog_check failed, other than that there is no mcelog file for it to find. I also don't know exactly what mcelog does, to be honest, as it was added by engineers at ITRI.

idle check, no idea... unless the system was not idle long enough. according to the test definitiion, the system must be idle for over 12 hours to pass the idle check.

Jeff Lane (bladernr) wrote :

Hah, yeah, I can't read... it was pointed out that the password for hte zip file is in the link...

Jeff Lane (bladernr) wrote :

OK, so looks like a Haswell E5 chip...

To be honest, I'm not sure if that script has support for Haswell (and I wouldn't even begin to know how to patch it). Definitely run the clocktest program manually and see what the actual stdout/stderr is.

Jeff Lane (bladernr) wrote :

So this is the clocktest output from an F06A that passed Ubuntu certification earlier...

Testing for clock jitter on 64 cpus
PASSED, largest jitter seen was 0.004395
clock direction test: start time 1412247133, stop time 1412247193, sleeptime 60, delta 0
PASSED

using Ubuntu 14.04.1

So definitely re-run that test manually and copy the output to a reply here... also, it may be interesting if you ran it multiple times (maybe 10?) to see if it only fails periodically for some reason.

Also, you may want to re-install everything using 14.04.1 and re-try that as well.

Cheers
Jeff

Rod Smith (rodsmith) wrote :

A shot in the dark: The complaint about the lack of a /var/log/mcelog file suggests that it might be dependent upon the "mcelog" package. I'd try installing that and re-running.

Jeff Lane (bladernr) wrote :

Also, for what it's worth, there's a new test package availabel that I'm about to debut at OCP Summit. Could you please try that? To install, you need to do the following:

Install the SUT with Ubuntu 14.04 or 14.10
Add the following PPAs using these commands:
sudo add-apt-repository ppa:hardware-certification/public
sudo add-apt-repository ppa:opencompute-developers/ocp-certification-tools-ppa
sudo apt-get update

Then simply install:

sudo apt-get install opencompute-ready

To execute, run as a normal user on the SUT:

opencompute-ready

Select the local whitelist and once you get to the test selection screen, press T to start testing.

Changed in opencompute:
status: New → Incomplete
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers