Softmax MKLDNN FLUID operator #9214

jczaja · 2018-03-19T14:54:55Z

This PR provides MKLDNN based Softmax op implementation.

Performance and testing:

On tested models softmax MKLDNN op is roughly ~10x faster than Plain CPU version.
RNN Search (https://github.com/dzhwinter/benchmark/blob/master/fluid/machine_translation.py)
executes training in ~90% of Plain CPU time. RNN search does converge with Softmax MKLDNN op used.

Notes

It was needed to update cross_entropy grad op with code preventing -INF results in a similar way as it was done in cross_entropy op.

luotao1 · 2018-03-20T13:13:51Z

LGTM. @tensor-tang Could you help review the softmax_mkldnn_op.cc?

tensor-tang · 2018-03-20T13:46:47Z

paddle/fluid/operators/softmax_mkldnn_op.cc

+using mkldnn::stream;
+
+template <typename T>
+class SoftmaxMkldnnKernel : public paddle::framework::OpKernel<T> {


SoftmaxMkldnnKernel ==> SoftmaxMKLDNNKernel

tensor-tang · 2018-03-20T13:56:48Z

python/paddle/fluid/layers/nn.py

@@ -81,6 +81,7 @@ def fc(input,
       num_flatten_dims=1,
       param_attr=None,
       bias_attr=None,
+       use_mkldnn=False,


Why softmax PR would change fc?

tensor-tang · 2018-03-20T13:57:27Z

python/paddle/fluid/layers/nn.py

+            attrs={
+                "x_num_col_dims": num_flatten_dims,
+                "y_num_col_dims": 1,
+                'use_mkldnn': use_mkldnn


Save as above, this should be added in FC PR, right?

@tensor-tang You are correct, but without those changes to FC Softmax MKLDNN would remain unused code. Till FC of MKLDNN is not ready (it is under development) then there is no other option for users to take advantage of softmax mkldnn operator as it is usually used as activation of FC layer. Do you suggest to remove use_mkldnn attribute/param from FC ?

@tensor-tang, @luotao1 Should I remove use_mkldnn attrbute from FC , in this PR?

I think there should be some other ways to use mkldnn softmax without change fc. But since we will implement MKLDNN FC soon, I think it's fine, you do not need revert back now.

removed diagnostic - Added Unit tests for Softmax MKLDNN Forward Added fix for div by 0 to happen in cross_entropy backward Conflicts: paddle/fluid/operators/CMakeLists.txt - Cosmetic fixes to SoftMax MKLDNN fluid operator Added misssing softmax fluid operator file Disabled MKLDNN softmax operator by default Fix to softmax op unittest merge clang_formater fixes clang_formatter fixes - Name changing of softmax mkldnn operator to maintin consistency across codebase - updated comment fix to comment

tensor-tang · 2018-03-21T12:32:18Z

python/paddle/fluid/layers/nn.py

+            attrs={
+                "x_num_col_dims": num_flatten_dims,
+                "y_num_col_dims": 1,
+                'use_mkldnn': use_mkldnn


I think there should be some other ways to use mkldnn softmax without change fc. But since we will implement MKLDNN FC soon, I think it's fine, you do not need revert back now.

mrysztow added the Intel label Mar 19, 2018

jczaja force-pushed the prv-softmax-mkldnn-operator-PR branch from 083b8bd to 29b68ef Compare March 19, 2018 15:55

kbinias requested a review from luotao1 March 20, 2018 08:45

luotao1 requested a review from tensor-tang March 20, 2018 13:14

tensor-tang reviewed Mar 20, 2018

View reviewed changes

jczaja force-pushed the prv-softmax-mkldnn-operator-PR branch 2 times, most recently from 9df2a09 to 228e0dc Compare March 20, 2018 16:14

jczaja force-pushed the prv-softmax-mkldnn-operator-PR branch from 228e0dc to 3b95b55 Compare March 21, 2018 10:13

tensor-tang approved these changes Mar 21, 2018

View reviewed changes

tensor-tang merged commit 7260e3a into PaddlePaddle:develop Mar 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Softmax MKLDNN FLUID operator #9214

Softmax MKLDNN FLUID operator #9214

jczaja commented Mar 19, 2018 •

edited

Loading

luotao1 commented Mar 20, 2018

tensor-tang Mar 20, 2018

jczaja Mar 20, 2018

tensor-tang Mar 20, 2018

tensor-tang Mar 20, 2018

jczaja Mar 20, 2018

jczaja Mar 21, 2018 •

edited

Loading

tensor-tang Mar 21, 2018

tensor-tang Mar 21, 2018

Softmax MKLDNN FLUID operator #9214

Softmax MKLDNN FLUID operator #9214

Conversation

jczaja commented Mar 19, 2018 • edited Loading

Performance and testing:

Notes

luotao1 commented Mar 20, 2018

tensor-tang Mar 20, 2018

Choose a reason for hiding this comment

jczaja Mar 20, 2018

Choose a reason for hiding this comment

tensor-tang Mar 20, 2018

Choose a reason for hiding this comment

tensor-tang Mar 20, 2018

Choose a reason for hiding this comment

jczaja Mar 20, 2018

Choose a reason for hiding this comment

jczaja Mar 21, 2018 • edited Loading

Choose a reason for hiding this comment

tensor-tang Mar 21, 2018

Choose a reason for hiding this comment

tensor-tang Mar 21, 2018

Choose a reason for hiding this comment

jczaja commented Mar 19, 2018 •

edited

Loading

jczaja Mar 21, 2018 •

edited

Loading