slurm sbatch command fails
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
slurm-llnl (Ubuntu) |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
Binary package hint: slurm-llnl
version used is ubuntu 8.04 (hardy), architecture is amd64
source package is https:/
$sbatch jobscript
leads to the following error (reported in slurm-jobnumber
slurmd[hostname]: error: execve(): /var/run/
using srun,
$srun jobscript
the job runs fine if the executable bit is set for jobscript.
If the executable bit is not set for jobscript, the job fails with the error
slurmd[hostname]: error: execve(): jobscript: Permission denied
The executable bit of jobscript has no influence on whether sbatch fails. It looks to me that
/var/run/
The file is always created to have read/execute permission for the owner. script( batch_job_ launch_ msg_t *msg, char *path) slurmstepd/ mgr.c:
Here is an excerpt of the code from
_make_batch_
in src/slurmd/
if (chown(script, (uid_t) msg->uid, (gid_t) -1) < 0) {
error( "chown( %s): %m", path);
goto error;
}
if (chmod(script, 0500) < 0) {
error( "chmod: %m");
}
Is your slurmd daemon running as user root or the person running the job?
Are your uid numbers consistent across the cluster?
Take a look in the slurmd log file (located at
"scontrol show config | grep SlurmdLog").