refine EngineIOConverter, and use io_convert in test_trt_activation_op #10495

luotao1 · 2018-05-08T10:20:45Z

change EngineInputConverter to EngineIOConverter, which has two function: ConvertInput(LoDTensor->ITensor) and ConvertOutput(ITensor -> LoDTensor)
improve the unit-test test_io_converter.cc
use EngineIOConverter in unit-test test_activation_op.cc
remove duplicated cudaMemcpyAsync in SetInputFromCPU

Superjomn · 2018-05-09T02:54:06Z

paddle/fluid/inference/tensorrt/convert/io_converter.cc

-                                        cudaMemcpyHostToHost, *stream_));
-
+      PADDLE_ENFORCE_EQ(0, cudaMemcpyAsync(out, in.data<float>(), size,
+                                           cudaMemcpyHostToHost, *stream_));


not DeiveceToDevice?

Superjomn · 2018-05-09T03:02:25Z

paddle/fluid/inference/tensorrt/convert/io_converter.h

 */
-class EngineInputConverter {
+class EngineIOConverter {


Add comments about how to register a convter from fluid to TRT or from TRT to fluid.

For example, a converter for a special Op called x that from fluid to TRT will register as Fluid(x)->TRT.

Or a special Layer in TRT called y that convert output from TRT to fluid will register as TRT(y)->Fluid.

add some comments at first.

Superjomn

LGTM

Superjomn · 2018-05-15T05:37:08Z

paddle/fluid/inference/tensorrt/convert/test_activation_op.cc

@@ -26,7 +27,7 @@ namespace paddle {
 namespace inference {
 namespace tensorrt {

-void Compare(float input, float expect) {
+void Compare(const std::string op_type, float input, float expect) {


const string&

I will correct in next PR.

refine EngineIOConverter, and use io_convert in test_trt_activation_op

89dcb0b

luotao1 added the 预测原名Inference，包含Capi预测问题等 label May 8, 2018

luotao1 requested a review from Superjomn May 8, 2018 11:21

Merge branch 'develop' into refine_relu_test

0ae97e8

Superjomn reviewed May 9, 2018

View reviewed changes

Merge branch 'develop' into refine_relu_test

40b8b63

luotao1 added 4 commits May 14, 2018 17:53

Merge branch 'develop' into refine_relu_test

a3ba264

use the latest buffer to update the convert

4f5f0be

Merge branch 'develop' into refine_relu_test

be41c2f

Merge branch 'develop' into refine_relu_test

1992f70

Superjomn approved these changes May 15, 2018

View reviewed changes

luotao1 merged commit 6cbe597 into PaddlePaddle:develop May 15, 2018

luotao1 deleted the refine_relu_test branch May 15, 2018 05:53

luotao1 mentioned this pull request May 22, 2018

Relu op TRT Converter #10630

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refine EngineIOConverter, and use io_convert in test_trt_activation_op #10495

refine EngineIOConverter, and use io_convert in test_trt_activation_op #10495

luotao1 commented May 8, 2018

Superjomn May 9, 2018

luotao1 May 9, 2018

Superjomn May 9, 2018

luotao1 May 14, 2018

Superjomn left a comment

Superjomn May 15, 2018

luotao1 May 15, 2018

refine EngineIOConverter, and use io_convert in test_trt_activation_op #10495

refine EngineIOConverter, and use io_convert in test_trt_activation_op #10495

Conversation

luotao1 commented May 8, 2018

Superjomn May 9, 2018

Choose a reason for hiding this comment

luotao1 May 9, 2018

Choose a reason for hiding this comment

Superjomn May 9, 2018

Choose a reason for hiding this comment

luotao1 May 14, 2018

Choose a reason for hiding this comment

Superjomn left a comment

Choose a reason for hiding this comment

Superjomn May 15, 2018

Choose a reason for hiding this comment

luotao1 May 15, 2018

Choose a reason for hiding this comment