Comment 7 for bug 1448191

Revision history for this message
alefisico (alefisico) wrote :

Hi Olivier

Thanks for all your help, but I still cannot fix the issue. Can you please give me the steps that you are doing? In case I am doing something wrong.

In any case I found two issues:

- when I try to run the creation of the gridpack in a cluster (lsf but I also try it in condor) I got this error:
generate_events pilotrun
Traceback (most recent call last):
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/extended_cmd.py", line 879, in onecmd
    return self.onecmd_orig(line, **opt)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/extended_cmd.py", line 872, in onecmd_orig
    return func(arg, **opt)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/madevent_interface.py", line 1983, in do_generate_events
    postcmd=False)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/extended_cmd.py", line 919, in exec_cmd
    stop = Cmd.onecmd_orig(current_interface, line, **opt)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/extended_cmd.py", line 872, in onecmd_orig
    return func(arg, **opt)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/madevent_interface.py", line 2654, in do_survey
    self.monitor(run_type='All jobs submitted for survey', html=True)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/madevent_interface.py", line 3763, in monitor
    self.cluster.wait(self.me_dir, update_status, update_first=update_first)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/cluster.py", line 57, in deco_f_interupt
    return f(self, *args, **opt)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/cluster.py", line 304, in wait
    idle, run, finish, fail = self.control(me_dir)
  File "/afs/cern.ch/work/a/algomez/Generation/genproductions/bin/MadGraph5_aMCatNLO/RPVSt100_UDD312_ISR2j_LO/RPVSt100_UDD312_ISR2j_LO_gridpack/work/processtmp/bin/internal/misc.py", line 244, in deco_f_retry
    raise error.__class__, '[Fail %i times] \n %s ' % (i+1, error)
ClusterManagmentError: [Fail 5 times]
 Job exhausted all retries.

- In the case that I try to run locally, it seems like it is running fine. I got a gridpack at the end, but I found these messages in the log file:

At line 1070 of file addmothers.f
Fortran runtime error: Index '3' of dimension 1 of array 'potential_index' above upper bound of 2
At line 1070 of file addmothers.f
Fortran runtime error: Index '3' of dimension 1 of array 'potential_index' above upper bound of 2
INFO: Idle: 49, Running: 8, Completed: 99 [ 3h 59m ]
At line 1070 of file addmothers.f
Fortran runtime error: Index '3' of dimension 1 of array 'potential_index' above upper bound of 2
At line 1070 of file addmothers.f
Fortran runtime error: Index '3' of dimension 1 of array 'potential_index' above upper bound of 2
INFO: Idle: 48, Running: 8, Completed: 100 [ 3h 59m ]
INFO: Idle: 47, Running: 8, Completed: 101 [ 3h 59m ]
At line 1070 of file addmothers.f
Fortran runtime error: Index '3' of dimension 1 of array 'potential_index' above upper bound of 2
At line 1070 of file addmothers.f
Fortran runtime error: Index '3' of dimension 1 of array 'potential_index' above upper bound of 2

and after I try to use the run script, I still got this error:

Updating grid_card.dat...
Now generating 100 events with random seed 7894 and granularity -1
P2_gg_su1su1xgg_su1_qq_su1x_qq
P2_gg_su1su1xqq_su1_qq_su1x_qq
P2_gg_su1su1xbbx_su1_qq_su1x_qq
P2_gq_su1su1xgq_su1_qq_su1x_qq
P2_qq_su1su1xqq_su1_qq_su1x_qq
P2_qq_su1su1xgg_su1_qq_su1x_qq
P2_qq_su1su1xbbx_su1_qq_su1x_qq
P1_gg_su1su1xg_su1_qq_su1x_qq
P1_gq_su1su1xq_su1_qq_su1x_qq
P1_qq_su1su1xg_su1_qq_su1x_qq
P0_gg_su1su1x_su1_qq_su1x_qq
P0_qq_su1su1x_su1_qq_su1x_qq
Traceback (most recent call last):
  File "./bin/gridrun", line 84, in <module>
    cmd_line = cmd_interface.GridPackCmd(me_dir=root_path, nb_event=args[0], seed=args[1])
  File "/afs/cern.ch/work/a/algomez/Generation/tmp/CMSSW_7_1_14/src/process/madevent/bin/internal/madevent_interface.py", line 4553, in __init__
    self.launch(nb_event, seed)
  File "/afs/cern.ch/work/a/algomez/Generation/tmp/CMSSW_7_1_14/src/process/madevent/bin/internal/madevent_interface.py", line 4575, in launch
    self.refine4grid(nb_event)
  File "/afs/cern.ch/work/a/algomez/Generation/tmp/CMSSW_7_1_14/src/process/madevent/bin/internal/madevent_interface.py", line 4619, in refine4grid
    cwd=Pdir)
  File "/afs/cern.ch/work/a/algomez/Generation/tmp/CMSSW_7_1_14/src/process/madevent/bin/internal/misc.py", line 551, in deco_f
    % arg
Exception: ['./../../bin/internal/gen_ximprove'] fails with no such file or directory
Error: event file not found !

I hope it helps.
cheers,