Add ScalarLayer to multiply two Blobs with broadcasting #3021

jeffdonahue · 2015-09-03T22:53:29Z

This adds ScalarLayer, which takes two Blobs and (in effect) multiplies them elementwise, after broadcasting the axes of the second Blob to match the first as necessary.

For example, if bottom[0] has shape (2, 3, 4, 5) and bottom[1] has shape (3, 4) and axis == 1, then the computation of this layer is equivalent to reshaping bottom[1] to (1, 3, 4, 1), then tiling it to (2, 3, 4, 5), then multiplying the result elementwise with bottom[0].

In the most general case, Backward to bottom[1] is accomplished with elementwise multiplication followed by 2 gemvs. For special cases (when bottom[1]'s shape corresponds to the beginning or end of bottom[0]'s shape, e.g. if it were instead shape (2, 3) and axis == 0 or shape (4, 5) with axis == 2) one or both of the gemvs is skipped (or replaced with a dot product).

My use case for this comes from #2033 -- I am replacing the hacky coeff_blob I added to Eltwise to perform the binary multiplications with this layer. It could also replace the channel-wise scalar in PReLU (I think this backward implementation is faster), or be used to learn a channel-wise scalar after batch normalization.

Thanks to @longjon for the name for this layer and initial implementation of a previous version.

jeffdonahue · 2015-12-24T19:41:28Z

I've updated this with some cleanup in the original commit, and two extra commits. The first (0437545) makes the layer learn the scalar as a parameter if only one bottom is specified. The second (4199be6) makes it support in-place computation, which necessitates(?) adding a temp storage blob of the input size and copying the input in forward for use in backward. Happy to squash if desired (though the commits are cleanly separated).

second as needed

jeffdonahue · 2016-01-23T01:49:52Z

Replaced by #3591

jeffdonahue force-pushed the scalar-layer branch from 3abc491 to 25371a1 Compare September 3, 2015 23:00

jeffdonahue added the ready for review label Sep 3, 2015

jeffdonahue force-pushed the scalar-layer branch 2 times, most recently from 6a66a01 to b4a5b6a Compare September 4, 2015 00:24

jeffdonahue mentioned this pull request Sep 4, 2015

Unrolled recurrent layers (RNN, LSTM) #2033

Closed

jeffdonahue mentioned this pull request Oct 21, 2015

Yet another batch normalization PR #3229

Merged

jeffdonahue force-pushed the scalar-layer branch 4 times, most recently from ade7a27 to 4199be6 Compare December 24, 2015 19:13

jeffdonahue force-pushed the scalar-layer branch 2 times, most recently from aa023e6 to 6fb602a Compare December 27, 2015 18:23

jeffdonahue added 3 commits January 13, 2016 12:31

Add ScalarLayer to multiply two Blobs, broadcasting the shape of the

d8dcb1d

second as needed

ScalarLayer learns scalar as a parameter if only one bottom given

d60389d

ScalarLayer supports in-place computation

f30ffad

jeffdonahue force-pushed the scalar-layer branch from 6fb602a to f30ffad Compare January 13, 2016 20:33

jeffdonahue mentioned this pull request Jan 13, 2016

Add BiasLayer to add two Blobs with broadcasting #3550

Closed

jeffdonahue mentioned this pull request Jan 23, 2016

Scale and Bias Layers #3591

Merged

jeffdonahue closed this Jan 23, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ScalarLayer to multiply two Blobs with broadcasting #3021

Add ScalarLayer to multiply two Blobs with broadcasting #3021

jeffdonahue commented Sep 3, 2015

jeffdonahue commented Dec 24, 2015

jeffdonahue commented Jan 23, 2016

Add ScalarLayer to multiply two Blobs with broadcasting #3021

Add ScalarLayer to multiply two Blobs with broadcasting #3021

Conversation

jeffdonahue commented Sep 3, 2015

jeffdonahue commented Dec 24, 2015

jeffdonahue commented Jan 23, 2016