Comment 3 for bug 1265068

Revision history for this message
Matthew Farrellee (mattf) wrote :

Seems like the options are to 0) find a way to detect a functional oozie or 1) decide on a timeout for pending

I'd rather do (0), but a functioning oozie may still fail if the cluster itself is not functional. The only way to detect a functioning cluster is to run a simple job through oozie. However, a functioning cluster may only run one job before crashing, which means ultimately (1) is necessary.

If urgent, I'd proceed w/ (1), after careful thought about what's too long of a PENDING wait.

If not urgent or as a long term approach, we should look at doing a oozie status test and potentially running a simple job.