Abstract
We demonstrate that the dynamics of neural networks (NNs) trained with gradient descent and the dynamics of scalar fields in a flat, vacuum energy dominated Universe are structurally profoundly related. This duality provides the framework for synergies between these systems, to understand and explain NN dynamics and new ways of simulating and describing early Universe models. Working in the continuous-time limit of NNs, we analytically match the dynamics of the mean background and the dynamics of small perturbations around the mean field, highlighting potential differences in separate limits. We perform empirical tests of this analytic description and quantitatively show the dependence of the effective field theory parameters on hyperparameters of the NN. As a result of this duality, the cosmological constant is matched inversely to the learning rate in the gradient descent update.
Dokumententyp: | Zeitschriftenartikel |
---|---|
Fakultät: | Physik |
Themengebiete: | 500 Naturwissenschaften und Mathematik > 530 Physik |
URN: | urn:nbn:de:bvb:19-epub-93789-4 |
ISSN: | 2632-2153 |
Sprache: | Englisch |
Dokumenten ID: | 93789 |
Datum der Veröffentlichung auf Open Access LMU: | 28. Nov. 2022, 07:19 |
Letzte Änderungen: | 04. Jan. 2024, 11:01 |
DFG: | Gefördert durch die Deutsche Forschungsgemeinschaft (DFG) - 491502892 |