Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BEAST WDL emit checkpoint and/or logs/trees in case of premature termination #130

Open
dpark01 opened this issue Jun 24, 2020 · 1 comment

Comments

@dpark01
Copy link
Member

dpark01 commented Jun 24, 2020

Currently, our BEAST WDL task dumps its primary outputs (BEAST log and trees files) to local files and then delocalizes them at the end by standard wdl File descriptors in the output block.

However, there are many times in which a BEAST run might prematurely terminate, especially since they tend to run so long. In these cases, we lose the entire (expensive) run.

Ideally it would be nice if we could somehow funnel the outputs through stdout instead. WDL allows for task outputs to be derived from stdout, and most cloud executors like PAPI2 and dnanexus make a point of checkpointing stdout on a frequent basis. This would mean that if a running beast job was prematurely terminated for any reason, we would still have a deserialized copy available in a bucket.

The main challenge though is to figure out how to get both of the critical output files (.trees and .log) through a single pipe (leaving stderr untouched for the usual messages).

@dpark01
Copy link
Member Author

dpark01 commented Jan 22, 2021

Brand new features!

On the cromwell/terra side:

On the beast side:

@dpark01 dpark01 changed the title Beast wdl to stdout BEAST WDL emit checkpoint and/or logs/trees in case of premature termination Jan 26, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant