Abstract
Using the statistical physics framework, we study the online learning dynamics in a particular case of shallow feed-forward neural networks with ReLU activation. By expanding the activation function in terms of Hermite polynomials we derive analytical results for the evolution of order parameters for any learning rate. Moreover, we compare our results with online gradient descent simulations and show how our method describes the typical learning curves. We also present results on how the learning rate affects the overall behavior of the network and its equilibria,
showing different learning regimes and critical values of the learning rate.
showing different learning regimes and critical values of the learning rate.
Original language | English |
---|---|
Title of host publication | European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning |
Subtitle of host publication | 33rd ESANN 2025 |
Editors | Michel Verleysen |
Publisher | Ciaco - i6doc.com |
Pages | 437-442 |
Number of pages | 6 |
DOIs | |
Publication status | Published - Apr-2025 |
Event | European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning - Brugge, Belgium Duration: 23-Apr-2025 → 25-Apr-2025 Conference number: 35 https://www.esann.org |
Conference
Conference | European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning |
---|---|
Abbreviated title | ESANN 2025 |
Country/Territory | Belgium |
City | Brugge |
Period | 23/04/2025 → 25/04/2025 |
Internet address |