-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nomad Restart and Reschedule Policy #611
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #611 +/- ##
==========================================
+ Coverage 76.92% 76.98% +0.05%
==========================================
Files 41 41
Lines 3498 3498
==========================================
+ Hits 2691 2693 +2
+ Misses 592 590 -2
Partials 215 215 ☔ View full report in Codecov by Sentry. |
0964910
to
7b9cd3f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
I would not add an e2e for this scenario. We would mainly test Nomad (and not something truly Poseidon specific), which doesn't feel right. Further the long increase is really not ideal. If at all, I would opt to add another template job with shorter wait times specifically for the given scenario. Still, I don't think it's the right thing to do...
because our current Poseidon-Nomad architecture has a 1:1 runner-job relationship and there is no need to have more than the one task per job.
to limit the number of reschedules as we cannot guarantee an error-free job definition. We also set reasonable delays and intervals to increase the likelihood that one deployment has enough time, but there is no second deployment within the interval.
7b9cd3f
to
2dd0d16
Compare
Related to #587
Fix restart and reschedule configuration to not reschedule infinitely (e.g. due to an invalid image specifier).
ToDo:
Should we write an e2e test checking that a wrong environment definition does not loop infinitely? With the current configuration, the test will last almost 10 minutes..No