Implicit Regularisation, Large Stepsizes and Edge of Stability for (S)GD over Diagonal Linear Networks

Publication
Neural Information Processing Systems (NeurIPS), 2023
Mathieu Even
Mathieu Even
Postdoc at Inria Montpellier