Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

increase object store (jwd05e) weight #1362

Conversation

sanjaysrikakulam
Copy link
Member

It seems that a few jobs using jwd02f are getting stuck in the D state.

I see all the query_tabular jobs (from the micro galaxy hackathon) are using jwd02f, and they are stuck. Each of them is running on different compute nodes, and I also see several other jobs stuck in the D state that are also utilizing the same JWD NFS share (running on the same compute nodes as the others). It could be an NFS issue.

This is a temporary fix to shift the load to jwd05e. jwd02f is still active, though. I also see some jobs utilizing jwd02f running.

@bgruening: Feel free to close this PR if you don't think this is ideal.

@sanjaysrikakulam sanjaysrikakulam changed the title increase object store weight increase object store (jwd05e) weight Nov 21, 2024
@bgruening
Copy link
Member

Last time we did this we had other problems and we got "imbalanced". I will look at jwd02 and merge if there is no other way.

@bgruening bgruening closed this Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants