Whitening and Second Order Optimization Both Make Information in the Dataset Unusable During Training, and Can Reduce or Prevent Generalization.
Neha S. WadiaDaniel DuckworthSamuel S. SchoenholzEthan DyerJascha Sohl-DicksteinPublished in: ICML (2021)