 One comment on the proposal: how would one handle MPI processes? Those are
 created before Cactus' main process (or at least creation is not under
 Cactus' control). So given that usually we want number_of_processes *
 number_of_threads_per_process = number_of_cores (or 2*number_of_cores or
 0.5*number_of_cores) it seems that we really cannot choose the number of
 threads once the executable starts but have to rely on simfactory instead.

