-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BYOC] Fix build with TensorRT 8 #9047
Conversation
Got MyPy lint errors on files that I didn't touch https://ci.tlcpack.ai/blue/organizations/jenkins/tvm/detail/PR-9047/1/pipeline |
@masahi Question on this. I noticed you deleted the deprecating changes from |
As long as config_ has set up fp16 or int8 when building engine, we don't need to builder_->setFp16Mode(true) or builder_->setInt16Mode(true), and these two functions are deleted in trt8 api. |
I think this pr should be no problem @Laurawly @trevor-m @comaniac |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
btw, it would be good if you could also update the TensorRT document (https://tvm.apache.org/docs/deploy/tensorrt.html) about the supported versions. |
Has anyone seen any performance increases with TRT 8? |
I tested several models such as resnet and super resolution model, and my machine is T4, and they slightly have some performance gain based on my test (probably TRT8 doesn't have too much optimization for vision model? They say they have great performance gain in bert model). However, I didn't test them using large models such as bert, not quite sure whether they have great performance gain in those models, and Nvidia team says bert models have huge performance gain using TRT8. Also, we need to investigate tensorrt 8 supported ops to see what kind of new ops TRT8 has newly-supported, and we can register them into https://github.com/apache/tvm/blob/main/src/runtime/contrib/tensorrt/tensorrt_ops.cc. |
* fix compile error missing noexcept in overwridden methods * remove depricated builder method call
* fix compile error missing noexcept in overwridden methods * remove depricated builder method call
I tried to use TRT BYOC with the latest TensorRT version, but got a build error. The code is using functions that were deprecated in TRT 7 and removed in 8.
These function calls should be redundant, since we are also using
config_->setFlag(...)
API anyway.tvm/src/runtime/contrib/tensorrt/tensorrt_builder.cc
Lines 171 to 172 in 6f5b674
@trevor-m @comaniac @tiandiao123