-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Slurm doesn't exit on completion #33
Comments
Can you please indicate the snakemake version and the plugin version, as well as the command line you have been using? That would be tremendously helpful. Thank you! |
Snakemake version: 8.4.4
|
Wait a second: Your workflow runs within a github runner? |
Yes, using this https://github.com/koesterlab/setup-slurm-action. I've been seeing the same behavior on a real cluster as well though. |
Well, the Ansible configuration was the solution after we tinkered with vagrant ... I am, however, not sure whether the runner configures the slurmdb correctly and can provide feedback. As for your real cluster: Do you observe the same error message? |
Yea the real cluster has the same never ending refresh of |
What is your slurm version ( PS for me, it's bedtime, soon. ;-) |
Sorry for the delay.
Running Just in case it's helpful, here's the bash script I submit using
|
Oh wait I think I'm seeing the problem now. Snakemake is running |
Ah, yes. The job names are unique strings for a group of jobs (not a snakemake group job). This way, we can limit the number of queries and put less strain on the slurmdbd. However, we have the rule name as a comment. So, you can query the comment (manually) with sacct to. For snakemake we leave it the way it is. Just by chance, I see your comments now, because I have been teaching the whole day. I realized, that attaching the sample name to the comment (and extending the docs, like I write in every second issue thread) would make a lot of sense. Next week, I might find the time. |
Thanks a lot for your help! That would be great, the easier it is to identify each job the better imo. I'm not seeing any comments on the jobs I have running though. I am still setting each rule's |
No, it is not possible to overwrite the comment. Yet, now the comment carries the jobname and wildcards. Whilst it is not sensible to accommodate for every possible string combination, which is the one, you are missing? |
Sorry I think everything I need is covered by #35, once that's merged |
With past versions of snakemake, there have been issues with slurm not exiting the root job once everything was completed. This was solved by including a
--cluster-status
script (e.g. https://github.com/Snakemake-Profiles/slurm/blob/master/%7B%7Bcookiecutter.profile_name%7D%7D/slurm-status.py) but that argument is no longer in snakemake's CLI. Am I right in assuming that means the burden of stopping slurm properly has moved over to this module?This is the snippet of a log from a slurm test I just ran that I had to cancel (along with many others that I let run longer) exhibiting the bad behavior:
The text was updated successfully, but these errors were encountered: