We plan to release one modified architecture implemented by MXNet for image classification.
A MXNet implementation of Modified CBAMnet.
In this part, we implement a modified CBAMnet (CBAM Resnet 100) architecture via MXNet. The original one is described in the paper CBAM: Convoluational Block Attention Module proposed by Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. This paper is accepted by ECCV 2018.
This is an overview of a convolutional block attention module (CBAM).
Each attention sub-module is illustrated as following diagram:
The Residual building block integrated with CBAM is demonstrated as the following figure:
We implement the modified CBAMnet based on original CBAMnet 100 (ResNet 100 + CBAM).
In our implementation, we use 1x1 convolution layer to replace the fully connected layer in MLP.
1. The size of input data is 112x112 not 224x224. In order to preserve higher feature map resolution, we follow the setting of input in [2]. Specifically, The first convolution layer with 7x7 kernel size and 2 stride is replaced by 3x3 kernel size and 1 stride. Moreover, we remove the following max pooling layer with 3x3 kernel size and 2 stride.
2. We adopt the improved residual unit mentioned in [2]. Specifically, the improved residual unit is constructed by BN-Conv-BN-PReLu-Conv-BN, where BN denotes batch normalization layer, PReLu is Parametric Rectified Linear Unit activation layer and Conv means convolution layer.
3. We replace all ReLu activation layers with PReLu activation layers in our whole architecture.
4. We follow the output setting mentioned in [2]. Specifically, we choose Option-E with structure of BN-Dropout-FC-BN after the last convolutional layer, where Dropout means dropout layer and FC denotes fully connected layer.
This modified Convolutional Block Attention Module based Residual Network architecture can be directly integrated into the library of insightface.
[1] Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. CBAM: Convoluational Block Attention Module ECCV 2018.
[2] Jiankang Deng, Jia Guo, Stefanos Zafeiriou. "ArcFace: Additive Angular Margin Loss for Deep Face Recognition"
[3] Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun. "Identity Mappings in Deep Residual Networks"
[4] Jie Hu, Li Shen and Gang Sun. "Squeeze-and-Excitation Networks"