PreActivations for Shortcut Projection #2

RishabGargeya · 2016-07-11T07:43:40Z

Hi,

Great work here! I have a quick question--
Based on my understanding of your code and the MSRA paper, shouldn't your type B projection shortcuts in the full and bottleneck preactivation models take bn_pre_relu as their input, and not l (excluding the first block)?

As the identity mapping paper states, "For the bottleneck ResNets, when reducing the feature map
size we use projection shortcuts [1] for increasing dimensions, and when pre-
activation is used, these projection shortcuts are also with pre-activation."

Did I misunderstand something? Thanks for your time!

The text was updated successfully, but these errors were encountered:

FlorianMuellerklein · 2016-07-12T05:21:51Z

I think that you might be right. I've been thinking recently that I have a mistake in the projections. I will look into it, thank you.

FlorianMuellerklein mentioned this issue Jul 20, 2016

Review Request: Muellerklein ReScience/ReScience-submission#20

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PreActivations for Shortcut Projection #2

PreActivations for Shortcut Projection #2

RishabGargeya commented Jul 11, 2016

FlorianMuellerklein commented Jul 12, 2016

PreActivations for Shortcut Projection #2

PreActivations for Shortcut Projection #2

Comments

RishabGargeya commented Jul 11, 2016

FlorianMuellerklein commented Jul 12, 2016