Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incomplete tasks due service crash #472

Open
nuclearcat opened this issue Oct 21, 2024 · 0 comments
Open

Incomplete tasks due service crash #472

nuclearcat opened this issue Oct 21, 2024 · 0 comments

Comments

@nuclearcat
Copy link
Member

Often might happen that service crash due various reasons, for example pipeline-tarball might try to use more memory than available at node, then it will restart and we might have checkout node finished and stay in incomplete state.
We need a way to retry last job that was being executed, if it is possible.

Proposal: to do this we need mechanism that will take event, set this event as "current" of service, and on event completion - delete this state. In case service restarted, it will check if there was event that was being processed and if yes, it will retry to process it again.

Question is:
Where to store event? Persistent storage on kubernetes? some additional API endpoint?

Any other suggestions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant