when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ #7

kapness · 2019-07-18T10:05:49Z

How can I get a good performance on four 11G GPUS ?

JingChaoLiu · 2019-07-21T16:12:09Z

In our training, the original Mask R-CNN indeed only achieve a F-measure of 66%. The 10% improvement in our baseline may come from: (no ablation study, no guarantee, just based on memories)

Data Augmentation +6%
OHEM +2%
Train->Test extends to Train+Validation-> Test +1%
Use the Ignore Annotation +1%

Note: the first three tricks have been elaborated in our paper. Recently，I noticed the implementation of Use the Ignore Annotation was not a part of the official implementation but from an open source repository matterport/Mask_RCNN which our private framework followed.

The main idea of Use the Ignore Annotation is when a predicted box overlaps with the groundtruth box at a high ratio, then this predicted box is labeled as ignore, in other words, neither positive nor negative. The details can be referred in build_rpn_targets of RPN and detection_targets_graph of Bbox branch. And the only difference taken from cocoapi is that the evaluation criteria, intersection / (gt_ignore_area + pred_area - intersection) < 0.001, is replaced to intersection / pred_area < 0.5 .

kapness · 2019-07-22T13:20:07Z

Thanks very much for your reply.now I have a new question，in ohem process，the paper says you select 512 difficult samples to update the network，does it mean you only provide 512 samples to ROI heads，or you only compute 512 samples as RPN loss？

…

---Original--- From: "JingChaoLiu"<[email protected]> Date: Mon, Jul 22, 2019 00:12 AM To: "STVIR/PMTD"<[email protected]>; Cc: "kapness"<[email protected]>;"Author"<[email protected]>; Subject: Re: [STVIR/PMTD] when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ (#7) In our training, the original Mask R-CNN indeed only achieve a F-measure of 66%. The 10% improvement in our baseline may come from: (no ablation study, no guarantee, just based on memories) Data Augmentation +6% OHEM +2% Train->Test extends to Train+Validation-> Test +1% Use the Ignore Annotation +1% Note: the first three tricks have been elaborated in our paper. Recently，I noticed the implementation of Use the Ignore Annotation was not a part of the official implementation but from an open source repository matterport/Mask_RCNN which our private framework followed. The main idea of Use the Ignore Annotation is when a predicted box overlaps with the groundtruth box at a high ratio, then this predicted box is labeled as ignore, in other words, neither positive nor negative. The details can be referred in build_rpn_targets of RPN and detection_targets_graph of Bbox branch. And the only difference taken from cocoapi is that the evaluation criteria, intersection / (gt_ignore_area + pred_area - intersection) < 0.001, is replaced to intersection / pred_area < 0.5 . — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

JingChaoLiu · 2019-07-22T13:28:56Z

only compute 512 samples as box_cls and box_reg loss, not in RPN

zuokai · 2019-07-23T08:24:26Z

@JingChaoLiu hi, how many Data Augmentation methods do you use?

kapness · 2019-08-08T13:50:20Z

hi ,now I have a small problem,in random crop process , do you make sure that every cropped region has at least one clear GT box ？because I find that maskrcnn can't compute loss on a picture with no GT box.. thanks for your kindness again!

…

---Original--- From: "JingChaoLiu"<[email protected]> Date: Mon, Jul 22, 2019 00:12 AM To: "STVIR/PMTD"<[email protected]>; Cc: "kapness"<[email protected]>;"Author"<[email protected]>; Subject: Re: [STVIR/PMTD] when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ (#7) In our training, the original Mask R-CNN indeed only achieve a F-measure of 66%. The 10% improvement in our baseline may come from: (no ablation study, no guarantee, just based on memories) Data Augmentation +6% OHEM +2% Train->Test extends to Train+Validation-> Test +1% Use the Ignore Annotation +1% Note: the first three tricks have been elaborated in our paper. Recently，I noticed the implementation of Use the Ignore Annotation was not a part of the official implementation but from an open source repository matterport/Mask_RCNN which our private framework followed. The main idea of Use the Ignore Annotation is when a predicted box overlaps with the groundtruth box at a high ratio, then this predicted box is labeled as ignore, in other words, neither positive nor negative. The details can be referred in build_rpn_targets of RPN and detection_targets_graph of Bbox branch. And the only difference taken from cocoapi is that the evaluation criteria, intersection / (gt_ignore_area + pred_area - intersection) < 0.001, is replaced to intersection / pred_area < 0.5 . — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

kapness · 2019-08-08T14:03:04Z

or just do random crop and set mask loss and box reg loss as 0？ because on icdar15 dataset,if I only do random crop, there are too many croppped area with no GT box,and the loss becomes bad.

…

---Original--- From: "JingChaoLiu"<[email protected]> Date: Mon, Jul 22, 2019 21:29 PM To: "STVIR/PMTD"<[email protected]>; Cc: "kapness"<[email protected]>;"Author"<[email protected]>; Subject: Re: [STVIR/PMTD] when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ (#7) only compute 512 samples as RPN loss — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

JingChaoLiu · 2019-08-09T13:08:40Z

when no GT after cropping (though it rarely happens), just skip any steps involving positive ROIs (bbox regression and mask generation), set the corresponding losses to 0 (just for logging) and not backward them. I guess here is a good position for ignoring these zero losses.

JingChaoLiu · 2019-08-09T13:17:38Z

By the way, all the images in ICDAR 2015 shares a same shape of 1280x720, so as mentioned in the paper, it is recommened to crop image by preserving the aspect ratio.

JingChaoLiu mentioned this issue Aug 14, 2019

Train error #13

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ #7

when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ #7

kapness commented Jul 18, 2019

JingChaoLiu commented Jul 21, 2019

kapness commented Jul 22, 2019 via email

JingChaoLiu commented Jul 22, 2019 •

edited

Loading

zuokai commented Jul 23, 2019

kapness commented Aug 8, 2019 via email

kapness commented Aug 8, 2019 via email

JingChaoLiu commented Aug 9, 2019

JingChaoLiu commented Aug 9, 2019

when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ #7

when I use four rtx2080ti to train the maskrcnn as a baseline,the F-measure is only about 65%,is it normal？ #7

Comments

kapness commented Jul 18, 2019

JingChaoLiu commented Jul 21, 2019

kapness commented Jul 22, 2019 via email

JingChaoLiu commented Jul 22, 2019 • edited Loading

zuokai commented Jul 23, 2019

kapness commented Aug 8, 2019 via email

kapness commented Aug 8, 2019 via email

JingChaoLiu commented Aug 9, 2019

JingChaoLiu commented Aug 9, 2019

JingChaoLiu commented Jul 22, 2019 •

edited

Loading