[ET Trac] [Einstein Toolkit] #334: unsuccessful qsub not recognized / submit succeeds for finished simulation

Einstein Toolkit trac-noreply at einsteintoolkit.org
Wed Mar 9 11:48:36 CST 2011


#334: unsuccessful qsub not recognized / submit succeeds for finished simulation
------------------------+---------------------------------------------------
 Reporter:  knarf       |       Owner:  mthomas
     Type:  defect      |      Status:  new    
 Priority:  minor       |   Milestone:         
Component:  SimFactory  |     Version:         
 Keywords:              |  
------------------------+---------------------------------------------------
 python version:

 I submitted a simulation 'sim submit' but the corresponding qsub failed
 due to wrong numbers of procs/node (philip cluster). I changed the number
 given on the command line and did a 'sim submit' again, this time
 successful. Several things happend which I think could be done better:

 - the unsuccessful qsub was not detected during the new submit - it
 attempted a restart and didn't simply clean
 the unsuccessful submit
 - when trying the restart, it went ahead and queued the job, but this
 later failed when run with "cannot rerun a restart that has been
 finished". This could have been caught earlier - without the wait time in
 the queue.

-- 
Ticket URL: <https://trac.einsteintoolkit.org/ticket/334>
Einstein Toolkit <http://einsteintoolkit.org>
The Einstein Toolkit


More information about the Trac mailing list