Processing time is much higher with TensorRT EP than with CUDA EP

When running SuperGlue with the TensorRT Execution Provider, I see significantly higher latency compared to using the CUDA Execution Provider.