Sign in

Batch Normalization Is Blind to the First and Second Derivatives of the Loss.

Zhanpeng ZhouWen ShenHuixin ChenLing TangQuanshi Zhang
Published in: CoRR (2022)
Keyphrases
  • higher order
  • preprocessing
  • machine learning
  • steady state
  • real world
  • data mining
  • video sequences
  • worst case
  • information loss
  • critical points
  • batch mode
  • normalization method
  • batch learning
  • quasi invariant