Replies: 5 comments 6 replies
-
I just checked the source code: Nextflow just assumes preempted jobs to have errored and also always adds However, it would make more sense to add def run_scontrol_on_preempted_job(cluster, jobid):
sctrl_res = sp.check_output(
shlex.split(f"scontrol {cluster} -o show job {jobid}")
)
m = re.search(r"Requeue=(\w+)", sctrl_res.decode())
requeueable = m.group(1)
if requeueable == "1":
job_state = "REQUEUED"
else:
job_state = "CANCELLED"
return job_state This would need to be put here: Unfortunately I have no idea how to translate that snippet to Groovy. Could someone from the Nextflow devs do that? |
Beta Was this translation helpful? Give feedback.
-
Nextlow adds |
Beta Was this translation helpful? Give feedback.
-
This issue has a long history: #226, #234, #3422 But to be honest, I'm having a hard time following these discussions. They talk about jobs that are suspended by SLURM due to preemption or node failure, so it seems like you would want to use |
Beta Was this translation helpful? Give feedback.
-
@bentsherman is it possible to have some sidecar process similar to Snakemake's
|
Beta Was this translation helpful? Give feedback.
-
@Hoeze Hi, Hoeze! |
Beta Was this translation helpful? Give feedback.
-
On our SLURM cluster we have a preemptible lowprio partition.
It seems like Nextflow does not recognize job preemtions. What is the correct way to configure Nextflow here?
clusterOptions = --requeue
: Can this cause duplicated job submissions?clusterOptions = --no-requeue
: The whole pipeline breaks all the time due to many job preemptionsBeta Was this translation helpful? Give feedback.
All reactions