NMS in GPU #28

MaskVulcan · 2018-01-09T08:20:57Z

I find NMS operations is in CPU.Is there any way to switch to GPU?

RobertCsordas · 2018-01-09T10:28:42Z

As far as I know, TF has only a CPU based NMS implementation. I don't think so that it can be parallelized so well that it can get any significant performance gains on GPU.

machanic · 2018-01-12T07:20:19Z

@xdever I think if you calculate all pairs of boxes IOU first, then just for-loop once will ultimately boost speed, there have some trick in it, just see the source code in https://github.com/rbgirshick/py-faster-rcnn/tree/master/lib/nms
I think cuda version of NMS is faster than CPU version if we compile against tensorflow. The above code still have one issue if we use in tensorflow: it moves the data from cpu to gpu memory. We just need to define a new op and copy the source code of rbgirshick in it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NMS in GPU #28

NMS in GPU #28

MaskVulcan commented Jan 9, 2018

RobertCsordas commented Jan 9, 2018

machanic commented Jan 12, 2018 •

edited

Loading

NMS in GPU #28

NMS in GPU #28

Comments

MaskVulcan commented Jan 9, 2018

RobertCsordas commented Jan 9, 2018

machanic commented Jan 12, 2018 • edited Loading

machanic commented Jan 12, 2018 •

edited

Loading