Normalize the mean and scale of the per-channel activations of the previous layer, typically before another nonlinear activation.

<~/.org/references.bib>