Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nomad Agent "Too many open files" #675

Open
mpass99 opened this issue Sep 4, 2024 · 2 comments
Open

Nomad Agent "Too many open files" #675

mpass99 opened this issue Sep 4, 2024 · 2 comments
Labels
bug Something isn't working pending

Comments

@mpass99
Copy link
Contributor

mpass99 commented Sep 4, 2024

Related to #612

Investigate the Linux Error Failed to allocate directory watch: Too many open files that appears when having a high number of runners on a single agent.

  • In Handle permanently dead Nomad jobs #612 the high number was created by migrations between the agents on restarts. Check if this issue can be reproduced with just one agent and without a restart.
@mpass99 mpass99 added the bug Something isn't working label Sep 4, 2024
@mpass99
Copy link
Contributor Author

mpass99 commented Sep 9, 2024

The warning does not happen for Nomad itself, but for systemctl restart commands.
It is not triggered by systemctl status nomad.service or systemctl start nomad.service. The error can also be triggered when just restarting Docker, instead of Nomad.

In our staging environment, the error happens not with 160 Runner aka 108 Nomad subprocesses, but with 180 Runners.
Maybe the fs.inotify.max_user_instances=128 limit is exceeded.

I would consider this as low priority as it seems to be an error just with using systemctl. Also investigating this issue follows a small path next to crashing the Nomad agents for multiple restarts.

@MrSerth
Copy link
Member

MrSerth commented Sep 10, 2024

Thanks for digging further. Based on your discoveries I agree about the relatively lower priority of this issue 👍. Shall we set it to pending?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working pending
Projects
None yet
Development

No branches or pull requests

2 participants