Skip to content

Batch normalization

Riccardo Viviano edited this page Apr 27, 2021 · 1 revision

don't use it, use instead adaptive gradient clipping + a regulization technique