Replies: 2 comments 9 replies
-
You would expect the two tasks to run simultaneously, which may require copying of data between workers. Dask doesn't like copying data, but it also doesn't like idle CPUs, so it has heuristics in place to decide when to copy and when not. The dashboard will show you what is going on. |
Beta Was this translation helpful? Give feedback.
9 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi there,
I have the following question. Imagine we have two Dask worker processes and two sets of data (for example, two pandas DataFrames), which are located in one of the workers. Suppose we submit a long-running task into the worker, which manages both the DataFrames, whereas we submit another task into other worker, which requires the data from the first worker to execute the task.
My question is that will the second worker wait for completion of the first task in the first worker in order for it to submit required data into the second worker?
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions