Training speed become slower iteration by iteration #10

philokey · 2016-10-19T09:53:40Z

Hello,

When I training the model, the speed of per iteration slow down, eg. in the beginning, the speed is about 0.4s/iter, after 10000 iterations, the speed reduce to about 1s/iter. However, the time of tensorflow session
rpn_loss_cls_value, rpn_loss_box_value,loss_cls_value, loss_box_value, _ = sess.run([rpn_cross_entropy, rpn_loss_box, cross_entropy, loss_box, train_op], feed_dict=feed_dict)
does not increase.

What's more, the CPU time seems much more than the beginning, the usage of GPU is often 0%. Therefore, I suspect that there are something wrong in roi_data_layer which run in CPU.

I have check the code, but I can not find any bug. Has anyone meets this problem and how to solve this problem.

Thank you.

The text was updated successfully, but these errors were encountered:

philokey · 2016-10-19T11:45:52Z

I find the reason.
In training processing, the speed of the following code will slow down (I still do not know why).

if iter >= cfg.TRAIN.STEPSIZE:
     sess.run(tf.assign(lr, cfg.TRAIN.LEARNING_RATE * cfg.TRAIN.GAMMA))
else:
    sess.run(tf.assign(lr, cfg.TRAIN.LEARNING_RATE))

Hence, set the learning rate out of the loop can avoid this problem.

flowice · 2016-11-29T08:59:59Z

@philokey Hi, I also find the problem. What do you mean by "set the learning rate out of the loop can avoid this problem"? I think it is weird. Have you found the reason?

philokey · 2016-11-29T16:26:51Z

@flowice when you set learning rate, it will add a new node in tensorflow's graph, therefore, if you set learning rate in every iteration, it will add many nodes in the graph and become very slow.

flowice · 2016-11-30T02:45:24Z

@philokey Wonderful ! I will have a try. Thank you.

philokey changed the title ~~Is there something wrong in roi_data_layer?~~ Is there anything wrong in roi_data_layer? Oct 19, 2016

philokey changed the title ~~Is there anything wrong in roi_data_layer?~~ Training speed become slower iteration by iteration Oct 19, 2016

philokey mentioned this issue Oct 19, 2016

Update the way of decaying the learning rate #11

Open

smallcorgi closed this as completed Mar 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training speed become slower iteration by iteration #10

Training speed become slower iteration by iteration #10

philokey commented Oct 19, 2016 •

edited

Loading

philokey commented Oct 19, 2016 •

edited

Loading

flowice commented Nov 29, 2016

philokey commented Nov 29, 2016

flowice commented Nov 30, 2016

Training speed become slower iteration by iteration #10

Training speed become slower iteration by iteration #10

Comments

philokey commented Oct 19, 2016 • edited Loading

philokey commented Oct 19, 2016 • edited Loading

flowice commented Nov 29, 2016

philokey commented Nov 29, 2016

flowice commented Nov 30, 2016

philokey commented Oct 19, 2016 •

edited

Loading

philokey commented Oct 19, 2016 •

edited

Loading