Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OCTRL-888 [occ] If we see FairMQ's ERROR state, we should exit #565

Merged
merged 1 commit into from
May 22, 2024

Conversation

knopers8
Copy link
Collaborator

FairMQ considers ERROR state as a final state and expects any plugins to exit in such case. This is of course in contradiction to our concept of the ERROR state, which we envisage to RECOVER from. This being said, since FairMQ tries to exit anyway in such cases and we do not plan to work on recovery soon, I think the best is to respect the current status quo. This fixes OCTRL-888, allowing the DPL tasks to exit upon seeing an exception and propagating it to FairMQ.

FairMQ considers ERROR state as a final state and expects any plugins to exit in such case.
This is of course in contradiction to our concept of the ERROR state, which we envisage to RECOVER from.
This being said, since FairMQ tries to exit anyway in such cases and we do not plan to work on recovery soon, I think the best is to respect the current status quo.
This fixes OCTRL-888, allowing the DPL tasks to exit upon seeing an exception and propagating it to FairMQ.
@knopers8 knopers8 requested a review from teo May 22, 2024 15:05
@teo teo merged commit c32f70b into AliceO2Group:master May 22, 2024
2 checks passed
@teo
Copy link
Member

teo commented May 22, 2024

@knopers8 good catch! I guess our RECOVER, if/when implemented, might at best redeploy the task.

@knopers8 knopers8 deleted the fix-tasks-stuck-exiting branch May 23, 2024 07:07
@knopers8
Copy link
Collaborator Author

I'm afraid so, I think we might be bound by what FairMQ will allow us to do.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants