Computer Science > Machine Learning

[Submitted on 9 Feb 2024]

The boundary of neural network trainability is fractal

Some fractals -- for instance those associated with the Mandelbrot and quadratic Julia sets -- are computed by iterating a function, and identifying the boundary between hyperparameters for which the resulting series diverges or remains bounded. Neural network training similarly involves iterating an update function (e.g. repeated steps of gradient descent), can result in convergent or divergent behavior, and can be extremely sensitive to small changes in hyperparameters. Motivated by these similarities, we experimentally examine the boundary between neural network hyperparameters that lead to stable and divergent training. We find that this boundary is fractal over more than ten decades of scale in all tested configurations.

Comments:	3 pages, mesmerizing fractals
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Chaotic Dynamics (nlin.CD)
Cite as:	arXiv:2402.06184 [cs.LG]
	(or arXiv:2402.06184v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.06184

Submission history

From: Jascha Sohl-Dickstein [view email]
[v1] Fri, 9 Feb 2024 04:46:48 UTC (36,948 KB)

[2402.06184] The boundary of neural network trainability is fractal

Computer Science > Machine Learning

The boundary of neural network trainability is fractal

Submission history

Recommend

Creating Letters with CR

[webapps] phpFox < 4.8.13 - (redirect) PHP Object Injection Exploit

dotfiles/emacs/editing.org at master · noctuid/dotfiles · GitHub

S/4 HANA Central Procurement - The future of procu... - SAP Community

Which Bach Cantata? - Which Bach Cantata today? - Home Page

Termination rule is not triggering if the terminat...

GlobalFoundries wins $3.1 billion in CHIPS Act subsidies for Upstate NY, Vermont...

李书福名下路特斯完成美股上市备案，估值54亿美元

Quest 3 'Wide Motion Mode' Expands Hand Tracking Volume By Tra...

Struxe - A more social way to work | Product Hunt

About Joyk