-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Feature] Support MobileNetV2 backbone (#86)
* [Feature] Support MobileNetV2 backbone * Fixed import * Fixed test * Fixed test * Fixed dilate * upload model * update table * update table * update bibtex * update MMCV requirement
- Loading branch information
Showing
15 changed files
with
433 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
# MobileNetV2: Inverted Residuals and Linear Bottlenecks | ||
|
||
## Introduction | ||
|
||
``` | ||
@inproceedings{sandler2018mobilenetv2, | ||
title={Mobilenetv2: Inverted residuals and linear bottlenecks}, | ||
author={Sandler, Mark and Howard, Andrew and Zhu, Menglong and Zhmoginov, Andrey and Chen, Liang-Chieh}, | ||
booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition}, | ||
pages={4510--4520}, | ||
year={2018} | ||
} | ||
``` | ||
|
||
|
||
## Results and models | ||
|
||
### Cityscapes | ||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | mIoU | mIoU(ms+flip) | download | | ||
|------------|----------|-----------|--------:|---------:|----------------|------:|---------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | ||
| FCN | M-V2-D8 | 512x1024 | 80000 | 3.4 | 14.2 | 61.54 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/fcn_m-v2-d8_512x1024_80k_cityscapes/fcn_m-v2-d8_512x1024_80k_cityscapes_20200825_124817-d24c28c1.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/fcn_m-v2-d8_512x1024_80k_cityscapes/fcn_m-v2-d8_512x1024_80k_cityscapes-20200825_124817.log.json) | | ||
| PSPNet | M-V2-D8 | 512x1024 | 80000 | 3.6 | 11.2 | 70.23 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/pspnet_m-v2-d8_512x1024_80k_cityscapes/pspnet_m-v2-d8_512x1024_80k_cityscapes_20200825_124817-19e81d51.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/pspnet_m-v2-d8_512x1024_80k_cityscapes/pspnet_m-v2-d8_512x1024_80k_cityscapes-20200825_124817.log.json) | | ||
| DeepLabV3 | M-V2-D8 | 512x1024 | 80000 | 3.9 | 8.4 | 73.84 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3_m-v2-d8_512x1024_80k_cityscapes/deeplabv3_m-v2-d8_512x1024_80k_cityscapes_20200825_124836-bef03590.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3_m-v2-d8_512x1024_80k_cityscapes/deeplabv3_m-v2-d8_512x1024_80k_cityscapes-20200825_124836.log.json) | | ||
| DeepLabV3+ | M-V2-D8 | 512x1024 | 80000 | 5.1 | 8.4 | 75.20 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3plus_m-v2-d8_512x1024_80k_cityscapes/deeplabv3plus_m-v2-d8_512x1024_80k_cityscapes_20200825_124836-d256dd4b.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3plus_m-v2-d8_512x1024_80k_cityscapes/deeplabv3plus_m-v2-d8_512x1024_80k_cityscapes-20200825_124836.log.json) | | ||
|
||
### ADE20k | ||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | mIoU | mIoU(ms+flip) | download | | ||
|------------|----------|-----------|--------:|---------:|----------------|------:|---------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | ||
| FCN | M-V2-D8 | 512x512 | 160000 | 6.5 | 64.4 | 19.71 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/fcn_m-v2-d8_512x512_160k_ade20k/fcn_m-v2-d8_512x512_160k_ade20k_20200825_214953-c40e1095.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/fcn_m-v2-d8_512x512_160k_ade20k/fcn_m-v2-d8_512x512_160k_ade20k-20200825_214953.log.json) | | ||
| PSPNet | M-V2-D8 | 512x512 | 160000 | 6.5 | 57.7 | 29.68 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/pspnet_m-v2-d8_512x512_160k_ade20k/pspnet_m-v2-d8_512x512_160k_ade20k_20200825_214953-f5942f7a.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/pspnet_m-v2-d8_512x512_160k_ade20k/pspnet_m-v2-d8_512x512_160k_ade20k-20200825_214953.log.json) | | ||
| DeepLabV3 | M-V2-D8 | 512x512 | 160000 | 6.8 | 39.9 | 34.08 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3_m-v2-d8_512x512_160k_ade20k/deeplabv3_m-v2-d8_512x512_160k_ade20k_20200825_223255-63986343.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3_m-v2-d8_512x512_160k_ade20k/deeplabv3_m-v2-d8_512x512_160k_ade20k-20200825_223255.log.json) | | ||
| DeepLabV3+ | M-V2-D8 | 512x512 | 160000 | 8.2 | 43.1 | 34.02 | - | [model](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3plus_m-v2-d8_512x512_160k_ade20k/deeplabv3plus_m-v2-d8_512x512_160k_ade20k_20200825_223255-465a01d4.pth) | [log](https://openmmlab.oss-accelerate.aliyuncs.com/mmsegmentation/v0.5/mobilenet_v2/deeplabv3plus_m-v2-d8_512x512_160k_ade20k/deeplabv3plus_m-v2-d8_512x512_160k_ade20k-20200825_223255.log.json) | |
12 changes: 12 additions & 0 deletions
12
configs/mobilenet_v2/deeplabv3_m-v2-d8_512x1024_80k_cityscapes.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../deeplabv3/deeplabv3_r101-d8_512x1024_80k_cityscapes.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320), | ||
auxiliary_head=dict(in_channels=96)) |
12 changes: 12 additions & 0 deletions
12
configs/mobilenet_v2/deeplabv3_m-v2-d8_512x512_160k_ade20k.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../deeplabv3/deeplabv3_r101-d8_512x512_160k_ade20k.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320), | ||
auxiliary_head=dict(in_channels=96)) |
12 changes: 12 additions & 0 deletions
12
configs/mobilenet_v2/deeplabv3plus_m-v2-d8_512x1024_80k_cityscapes.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../deeplabv3plus/deeplabv3plus_r101-d8_512x1024_80k_cityscapes.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320, c1_in_channels=24), | ||
auxiliary_head=dict(in_channels=96)) |
12 changes: 12 additions & 0 deletions
12
configs/mobilenet_v2/deeplabv3plus_m-v2-d8_512x512_160k_ade20k.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../deeplabv3plus/deeplabv3plus_r101-d8_512x512_160k_ade20k.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320, c1_in_channels=24), | ||
auxiliary_head=dict(in_channels=96)) |
12 changes: 12 additions & 0 deletions
12
configs/mobilenet_v2/fcn_m-v2-d8_512x1024_80k_cityscapes.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../fcn/fcn_r101-d8_512x1024_80k_cityscapes.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320), | ||
auxiliary_head=dict(in_channels=96)) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../fcn/fcn_r101-d8_512x512_160k_ade20k.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320), | ||
auxiliary_head=dict(in_channels=96)) |
12 changes: 12 additions & 0 deletions
12
configs/mobilenet_v2/pspnet_m-v2-d8_512x1024_80k_cityscapes.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../pspnet/pspnet_r101-d8_512x1024_80k_cityscapes.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320), | ||
auxiliary_head=dict(in_channels=96)) |
12 changes: 12 additions & 0 deletions
12
configs/mobilenet_v2/pspnet_m-v2-d8_512x512_160k_ade20k.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = '../pspnet/pspnet_r101-d8_512x512_160k_ade20k.py' | ||
model = dict( | ||
pretrained='mmcls://mobilenet_v2', | ||
backbone=dict( | ||
_delete_=True, | ||
type='MobileNetV2', | ||
widen_factor=1., | ||
strides=(1, 2, 2, 1, 1, 1, 1), | ||
dilations=(1, 1, 1, 2, 2, 4, 4), | ||
out_indices=(1, 2, 4, 6)), | ||
decode_head=dict(in_channels=320), | ||
auxiliary_head=dict(in_channels=96)) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,10 +1,11 @@ | ||
from .fast_scnn import FastSCNN | ||
from .hrnet import HRNet | ||
from .mobilenet_v2 import MobileNetV2 | ||
from .resnest import ResNeSt | ||
from .resnet import ResNet, ResNetV1c, ResNetV1d | ||
from .resnext import ResNeXt | ||
|
||
__all__ = [ | ||
'ResNet', 'ResNetV1c', 'ResNetV1d', 'ResNeXt', 'HRNet', 'FastSCNN', | ||
'ResNeSt' | ||
'ResNeSt', 'MobileNetV2' | ||
] |
Oops, something went wrong.