Add draft of Dask NEP-18 advances #15

pentschev · 2019-03-06T12:36:08Z

No description provided.

pentschev · 2019-03-06T12:49:27Z

@mrocklin let me know what you think of this.

mrocklin · 2019-03-06T14:21:49Z

cc also @jakirkham

mrocklin · 2019-03-06T14:34:14Z

I think it's a fine blogpost. Some large scale comments/questions:

You compare Dask+NumPy to CuPy. I'm curious if it would make more sense to compare NumPy to CuPy
You didn't mention the Scikit-Learn results. I think that these are important to include, and also demonstrate honesty. Your current numbers demonstrate that there is clear promise to this approach, which is all we really need to accomplish here. This is clearly a work in progress.
I think you're copying the format in http://blog.dask.org/2019/03/04/building-gpu-groupbys, which mostly just listed a bunch of issues. FWIW I'm not sure that this is the most exciting way to start a post. I was mostly just lazy this time around. These posts may be better to pattern off of, they definitely got more readership
- http://blog.dask.org/2019/01/03/dask-array-gpus-first-steps
- http://blog.dask.org/2019/01/29/cudf-joins
You should probably refer to the first one in your post regardless

You might consider starting with the theme of "We want to run GLM algorithms on GPUs. Can we reuse existing libraries to accomplish this quickly?" the answer being, "Yes, if we do a bit of work to align a few libraries together, but first, here are some results"

pentschev · 2019-03-06T15:28:15Z

Thanks for the comments, replying:

That's a good point, I think it makes sense to compare also to NumPy alone, but I think Dask+NumPy should still be there, after all, that's the whole point of Dask-GLM, isn't it? Plus, I'm currently assuming (which may not be the case) that Dask+NumPy should be faster than NumPy alone.
That's another good point, I intentionally left those out for now because I didn't yet have the time to have a glance at what's going on. For example, I still don't know what solver Scikit-Learn is using by default, so that comparison may be unfair.
This was totally coincidental, I had already started writing this post in this format about 2 weeks ago. I agree to your point, but I think we're still too early for that kind of writing, there's not much exciting stuff I can write about with the current limitations, namely single-threaded Dask+CuPy and lack of more interesting Dask-GLM examples. I think if we wait 2-4 weeks we could do another much more interesting blog post in the format you're proposing, including what I mentioned in item 2. Regardless, I will refer to your previous Dask GPU post, thanks for reminding me.

mrocklin · 2019-03-06T15:33:04Z

Plus, I'm currently assuming (which may not be the case) that Dask+NumPy should be faster than NumPy alone.

Dask adds some overhead. I think it's worth trying out NumPy in isolation as well.

For example, I still don't know what solver Scikit-Learn is using by default, so that comparison may be unfair.

I don't think that users care. All's fair in love, war, and performance.

Scikit-Learn uses solvers that are asymptotically better than those used in dask-glm. Dask-GLM's solvers were designed to scale well, but they're not particularly good on a single node.

there's not much exciting stuff I can write about with the current limitations

Well, there are larger scale things here that I think are very exciting. You've gotten a pre-existing codebase to mostly work on GPUs without doing that much work. From an ecosystem perspective that's pretty interesting.

pentschev · 2019-03-06T15:44:58Z

Scikit-Learn uses solvers that are asymptotically better than those used in dask-glm. Dask-GLM's solvers were designed to scale well, but they're not particularly good on a single node.

I didn't mean to say exclusively about compute performance, but just in general, perhaps quality of results could be very much different, and I was trying to avoid leaving an untruthful conclusion for the reader based on information I don't currently have, for example: "CuPy is slower than Scikit-Learn, thus it must mean it's quality is much higher".

Well, there are larger scale things here that I think are very exciting. You've gotten a pre-existing codebase to mostly work on GPUs without doing that much work. From an ecosystem perspective that's pretty interesting.

Ok, so you're talking more from a design perspective now, indeed in that you're right. I was thinking more from performance and application points of view. I will then have to rethink a bit what and how to write it to emphasize that perspective.

mrocklin · 2019-03-06T15:56:57Z

I didn't mean to say exclusively about compute performance, but just in general, perhaps quality of results could be very much different, and I was trying to avoid leaving an untruthful conclusion for the reader based on information I don't currently have, for example: "CuPy is slower than Scikit-Learn, thus it must mean it's quality is much higher".

Sure, you can give the context above, saying that Scikit-Learn just uses better algorithms that converge more quickly .

You may also want to read through: http://blog.dask.org/2017/03/22/dask-glm-1

pentschev · 2019-03-11T14:14:22Z

Alright, I updated this, please tell me what you think @mrocklin.

mrocklin · 2019-03-11T23:07:31Z

Looking forward to reading through this.

Also, cc @jakirkham for review

mrocklin

Some feedback.

In general this looks great to me. The examples are very motivating and it's a fun read.

_posts/2019-03-11-dask-nep18.md

TomAugspurger · 2019-03-13T16:01:00Z

I could be wrong, but I don't think the estimators have changed substantially in dask-ml: https://github.com/dask/dask-ml/commits/master/dask_ml/linear_model/glm.py

…

On Wed, Mar 13, 2019 at 10:58 AM Peter Andreas Entschev < ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In _posts/2019-03-11-dask-nep18.md <#15 (comment)>: > + +``` +Solver admm with Dask took 22444.307 ms to fit and 41.299 ms to predict +Solver admm with CuPy took 263.742 ms to fit and 1.146 ms to predict + +Solver lbfgs with Dask took 1561.047 ms to fit and 41.063 ms to predict +Solver lbfgs with CuPy took 13.935 ms to fit and 0.260 ms to predict + +Solver newton with Dask took 785.150 ms to fit and 42.822 ms to predict +Solver newton with CuPy took 22.480 ms to fit and 0.248 ms to predict + +Solver proximal_grad with Dask took 1902.620 ms to fit and 45.349 ms to predict +Solver proximal_grad with CuPy took 11.700 ms to fit and 0.257 ms to predict + +Solver gradient_descent with Dask took 2483.603 ms to fit and 42.605 ms to predict +Solver gradient_descent with CuPy took 14.580 ms to fit and 0.257 ms to predict Also, Dask-GLM estimators were deprecated in dask/dask-glm#66 <dask/dask-glm#66>. Maybe they indeed have a bug which may or may not have been fixed in Dask-ML. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#15 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABQHItQ2dv_3PnjUfKfrsBl1DBzZ3cxlks5vWSAIgaJpZM4bgx_W> .

pentschev · 2019-03-13T16:12:06Z

@TomAugspurger that's possible, I'll try to investigate causes later, but it's probably useful to try out those in Dask-ML too.

pentschev · 2019-03-13T21:12:01Z

For the time being, I don't plan any more changes. Ready for another review.

mrocklin · 2019-03-15T00:37:10Z

I think that this looks great. The one suggestion I would make is in the axes of the timing plots.

The main point of these plots seems to be to compare the relative performance between NumPy and CuPy. To this end, it would be useful to have the NumPy and CuPy performance numbers on the same axes, so that one can see visually when the CuPy numbers are higher or lower than NumPy's numbers. Currrently they look visually similar at first until you look at the numbers on the y-axes.

I could imagine rearranging the plots so that there is a different plot per solver, throwing all of the solvers into one plot, or keeping the two separate plots, but fixing the axes of the second plot to match the axes of the first.

mrocklin · 2019-03-15T00:46:34Z

(I'm also happy to just push the merge button, this is great as it is)

pentschev · 2019-03-15T11:06:16Z

@mrocklin I think the split by solver is a better solution. When first plotting the graphs I thought it would be too cluttered, but I did it now and it looks fine to me. So if there's no more comments, from my side it's ok to merge.

mrocklin · 2019-03-15T16:44:15Z

Those plots do look nice :)

mrocklin · 2019-03-15T16:45:00Z

I plan to merge this PR in an hour if there are no objections.

mrocklin · 2019-03-15T22:14:51Z

I've merged this in and it's live. It's a bit late in the week though to start publicizing it. I'll tweet about it early next week.

pentschev · 2019-03-15T22:29:45Z

Now that you've mentioned it, maybe we should have waited until Monday to publish that. Well, too late now I guess. :)

mrocklin · 2019-03-15T22:32:01Z

I've just made it a draft, which should keep it off of the TOC for now.

ca2fe51

pentschev · 2019-03-15T22:38:40Z

Nice, please publish it at the most appropriate time, you probably have experience in knowing which time is best, and I don't.

jakirkham · 2019-03-16T00:55:55Z

Thanks @pentschev. This is really great.

I read an earlier version of this and really like the improvements you have made since. @mrocklin and I played with a benchmark based on your work earlier this week and it really shows the improvements you have made here! 😄

Sorry I was late to review. Have submitted some small tweaks to the text in PR ( #18 ). These are mostly minor syntax changes. Though please take a look and make sure I haven't lost something in the changes.

As another note, it would be nice to list the BLAS and LAPACK library used with NumPy for the CPU case (can just run python -c "import numpy; numpy.show_config()" to check). Would be good to get the version of the BLAS and LAPACK libraries as well.

Add draft of Dask NEP-18 advances

22dcbcb

Update draft of Dask NEP-18 advances

d2ffb4b

mrocklin reviewed Mar 12, 2019

View reviewed changes

stsievert mentioned this pull request Mar 12, 2019

Estimators don't have n_iter_ parameter dask/dask-glm#77

Open

pentschev mentioned this pull request Mar 13, 2019

Dask-GLM doesn't converge with Dask array #17

Closed

pentschev mentioned this pull request Mar 13, 2019

Dask-GLM doesn't converge with Dask array dask/dask-glm#78

Open

NEP-18 post updates based on comments, new graphs

094732f

pentschev added 2 commits March 15, 2019 12:03

Split subgraphs by solver.

f3281ba

Change date of NEP-18 post

b767b06

mrocklin merged commit 1a968ff into dask:gh-pages Mar 15, 2019

pentschev deleted the dask-nep18 branch April 2, 2019 12:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add draft of Dask NEP-18 advances #15

Add draft of Dask NEP-18 advances #15

pentschev commented Mar 6, 2019

pentschev commented Mar 6, 2019

mrocklin commented Mar 6, 2019

mrocklin commented Mar 6, 2019

pentschev commented Mar 6, 2019

mrocklin commented Mar 6, 2019

pentschev commented Mar 6, 2019

mrocklin commented Mar 6, 2019

pentschev commented Mar 11, 2019

mrocklin commented Mar 11, 2019

mrocklin left a comment

TomAugspurger commented Mar 13, 2019 via email

pentschev commented Mar 13, 2019

pentschev commented Mar 13, 2019

mrocklin commented Mar 15, 2019

mrocklin commented Mar 15, 2019

pentschev commented Mar 15, 2019

mrocklin commented Mar 15, 2019

mrocklin commented Mar 15, 2019

mrocklin commented Mar 15, 2019

pentschev commented Mar 15, 2019

mrocklin commented Mar 15, 2019

pentschev commented Mar 15, 2019

jakirkham commented Mar 16, 2019

Add draft of Dask NEP-18 advances #15

Add draft of Dask NEP-18 advances #15

Conversation

pentschev commented Mar 6, 2019

pentschev commented Mar 6, 2019

mrocklin commented Mar 6, 2019

mrocklin commented Mar 6, 2019

pentschev commented Mar 6, 2019

mrocklin commented Mar 6, 2019

pentschev commented Mar 6, 2019

mrocklin commented Mar 6, 2019

pentschev commented Mar 11, 2019

mrocklin commented Mar 11, 2019

mrocklin left a comment

Choose a reason for hiding this comment

TomAugspurger commented Mar 13, 2019 via email

pentschev commented Mar 13, 2019

pentschev commented Mar 13, 2019

mrocklin commented Mar 15, 2019

mrocklin commented Mar 15, 2019

pentschev commented Mar 15, 2019

mrocklin commented Mar 15, 2019

mrocklin commented Mar 15, 2019

mrocklin commented Mar 15, 2019

pentschev commented Mar 15, 2019

mrocklin commented Mar 15, 2019

pentschev commented Mar 15, 2019

jakirkham commented Mar 16, 2019