Load weights from multiple caffemodels. #1456

jyegerlehner · 2014-11-20T02:26:54Z

At least one use case requiring this is doing layerwise or "stacked" autoencoder training: First I train the newly-added encoder and decoder layers by themselves (using features extracted from the net having only the previously-trained layers). Then when I begin to train the combined network, it needs to pull weights from two different caffemodel files. So this change allows the --weights parameter to be a comma-separated list of caffemodels instead of just a single caffemodel.

The other code change is that the test nets are also initialized from the provided caffemodels, not just the train net. So if the trained net is a subset of the test net, then some of the test nets' layers' weights would be uninitialized, whereas with this change they are initialized from the specified models.

shelhamer · 2014-11-20T14:06:10Z

This is a helpful generalization for certain uses. Note that this can
currently be done by loading several models in Python and assigning weights
between them, as in net surgery, but making the caffe tool understand
multi-model weight loading could be a nice convenience.

The level and stage rules for layer inclusion / exclusion are helpful for
layer-wise learning and model variations too.
On Thu, Nov 20, 2014 at 11:26 jyegerlehner [email protected] wrote:

At least one use case requiring this is doing layerwise or "stacked"
autoencoder training: First I train the newly-added encoder and decoder
layers by themselves (using features extracted from the net having only the
previously-trained layers). Then when I begin to train the combined
network, it needs to pull weights from two different caffemodel files. So
this change allows the --weights parameter to be a comma-separated list
of caffemodels instead of just a single caffemodel.

The other code change is that the test nets are also initialized from the
provided caffemodels, not just the train net. So if the trained net is a
subset of the test net, then some of the test nets' layers' weights would
be uninitialized, whereas with this change they are initialized from the

specified models.

You can merge this Pull Request by running

git pull https://github.com/jyegerlehner/caffe load-weights-from-multiple-caffemodels

Or view, comment on, or merge it at:

#1456
Commit Summary

Load weights from multiple caffemodels.

File Changes

M tools/caffe.cpp
https://github.com/BVLC/caffe/pull/1456/files#diff-0 (17)

Patch Links:

https://github.com/BVLC/caffe/pull/1456.patch

https://github.com/BVLC/caffe/pull/1456.diff

—
Reply to this email directly or view it on GitHub
#1456.

shelhamer · 2015-03-07T07:14:00Z

With the advances of pycaffe one can copy weights from several models by Net.copy_from.

I like preparing the nets through Python for its generality, but copying weights from multiple nets could be a useful special case. However I'm inclined to keep the caffe tool simple -- thoughts @longjon @jeffdonahue ?

jeffdonahue · 2015-03-07T07:28:06Z

I think it's useful and non-intrusive. I'm not a huge fan of the interface (commas in a flag argument) but I can't think of anything better (gflags doesn't let you specify the same flag multiple times and give you a vector<string>, does it?). I did this at one point by adding a repeated weights field to SolverParameter, but SolverParameter isn't a very good place for it...

…-caffemodels Load weights from multiple models by listing comma separated caffemodels as the `-weights` arg to the caffe command.

shelhamer · 2015-03-08T03:03:42Z

@jyegerlehner thanks for the convenient multi-model fine-tuning initialization. I merged this to master in a9bf7b9 (and collapsed this to a single commit).

jyegerlehner added 2 commits November 19, 2014 18:36

Load weights from multiple caffemodels.

36b1553

Fix more lint.

3fd08b5

shelhamer added a commit that referenced this pull request Mar 8, 2015

Merge pull request #1456 from jyegerlehner/load-weights-from-multiple…

a9bf7b9

…-caffemodels Load weights from multiple models by listing comma separated caffemodels as the `-weights` arg to the caffe command.

shelhamer closed this Mar 8, 2015

sherifshehata mentioned this pull request Oct 28, 2016

Support multi weights file in test mode #4916

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load weights from multiple caffemodels. #1456

Load weights from multiple caffemodels. #1456

jyegerlehner commented Nov 20, 2014

shelhamer commented Nov 20, 2014

specified models.

shelhamer commented Mar 7, 2015

jeffdonahue commented Mar 7, 2015

shelhamer commented Mar 8, 2015

Load weights from multiple caffemodels. #1456

Load weights from multiple caffemodels. #1456

Conversation

jyegerlehner commented Nov 20, 2014

shelhamer commented Nov 20, 2014

specified models.

shelhamer commented Mar 7, 2015

jeffdonahue commented Mar 7, 2015

shelhamer commented Mar 8, 2015