default search action
"(S)GD over Diagonal Linear Networks: Implicit Regularisation, Large ..."
Mathieu Even et al. (2023)
- Mathieu Even, Scott Pesme, Suriya Gunasekar, Nicolas Flammarion:
(S)GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability. CoRR abs/2302.08982 (2023)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.