Skip to content

how to speed up the post-process? #53

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Ivan-VV opened this issue Aug 23, 2021 · 1 comment
Open

how to speed up the post-process? #53

Ivan-VV opened this issue Aug 23, 2021 · 1 comment

Comments

@Ivan-VV
Copy link

Ivan-VV commented Aug 23, 2021

Thx for your great work!
I found the speed of post-process is too slow, and the bottleneck is torch::masked_select() function ,as the picture shows.
1
And then I set the environment variable CUDA_LAUNCH_BLOCKING=1 as #3, I found the speed of inference is too slow.
2
So would you like to give me any advice about solving this problem? Thank you very much!

@lvdonghan5
Copy link

The problem also troubles me.The post-processing part takes a lot of time only using CPU instead of GPU.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants