how to speed up the post-process? #53

Ivan-VV · 2021-08-23T02:09:14Z

Thx for your great work!
I found the speed of post-process is too slow, and the bottleneck is torch::masked_select() function ,as the picture shows.

And then I set the environment variable CUDA_LAUNCH_BLOCKING=1 as #3, I found the speed of inference is too slow.

So would you like to give me any advice about solving this problem? Thank you very much!

The text was updated successfully, but these errors were encountered:

lvdonghan5 · 2021-11-01T06:50:58Z

The problem also troubles me.The post-processing part takes a lot of time only using CPU instead of GPU.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to speed up the post-process? #53

how to speed up the post-process? #53

Ivan-VV commented Aug 23, 2021

lvdonghan5 commented Nov 1, 2021

how to speed up the post-process? #53

how to speed up the post-process? #53

Comments

Ivan-VV commented Aug 23, 2021

lvdonghan5 commented Nov 1, 2021