-
Notifications
You must be signed in to change notification settings - Fork 661
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nextflow to gracefully handle Google gRPC API call failures #5703
Comments
Duplicate of #4537 |
I believe this is already implemented nextflow/plugins/nf-google/src/main/nextflow/cloud/google/batch/client/BatchClient.groovy Lines 181 to 182 in 00376aa
|
Thanks @pditommaso. Per @jorgee's comment, it didn't look like his PR resolved all the call failure modes: |
Closing until we get a more specific (reproducible) bug report. |
New feature
Can Nextflow support retrying of gRPC API failures, specifically the following different failures:
UNAVAILABLE
DEADLINE_EXCEEDED
RESOURCE_EXHARSTED
UNKNOWN
Usage scenario
Running Nextflow pipelines on Google Batch, when there are job failures due to the resource not available (e.g. the requested VM was not be able to start due to exceeding the Google project quota), Nextflow was not able to receive any signal from these jobs and this causes the Nextflow job to not stop, consuming resources.
Suggest implementation
Could you make Nextflow support retrying of gRPC API failures based on failure response codes?
The text was updated successfully, but these errors were encountered: