Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

wait_for_workers got stuck when to create cluster but application failed on yarn #146

Open
FANLONGFANLONG opened this issue Apr 9, 2021 · 3 comments

Comments

@FANLONGFANLONG
Copy link

I put 500 GB for a worker accidentally to start yarn cluster. But looks likely

  1. I can create cluster(application) on yarn because i got cluster.app_id and cluster.scheduler_address.

  2. i use client.wait_for_workers to wait for worker ready

  3. but dask got stuck.
    image

  4. i checked application on yarn and it is failed already
    image

the issue is reproduced.

May I know what I could do to avoid the problem?

@fjetter
Copy link
Member

fjetter commented Apr 12, 2021

This sounds like a yarn specific problem and I suggest to repost this on the issue tracker of https://github.com/dask/dask-yarn

@jrbourbeau jrbourbeau transferred this issue from dask/distributed Apr 12, 2021
@jrbourbeau
Copy link
Member

Just transferred this issue over to the dask-yarn repo

@FANLONGFANLONG
Copy link
Author

@jrbourbeau thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants