The Separation Capacity of Random Neural Networks

Dirksen, Sjoerd; Genzel, Martin; Jacques, Laurent; Stollenwerk, Alexander

doi:https://doi.org/10.48550/arXiv.2108.00207

The Separation Capacity of Random Neural Networks

Files

2108.00207v1.pdf (1.87 MB)

Publication date

2021-07-31

Authors

Dirksen, Sjoerd

Genzel, Martin

Jacques, Laurent

Stollenwerk, Alexander

DOI

https://doi.org/10.48550/arXiv.2108.00207

Document Type

/dk/atira/pure/researchoutput/researchoutputtypes/workingpaper/preprint

Metadata

Show full item record

Collections

Utrecht University Repository

License

cc_by

Abstract

Neural networks with random weights appear in a variety of machine learning applications, most prominently as the initialization of many deep learning algorithms and as a computationally cheap alternative to fully learned neural networks. In the present article we enhance the theoretical understanding of random neural nets by addressing the following data separation problem: under what conditions can a random neural network make two classes X−,X+⊂Rd (with positive distance) linearly separable? We show that a sufficiently large two-layer ReLU-network with standard Gaussian weights and uniformly distributed biases can solve this problem with high probability. Crucially, the number of required neurons is explicitly linked to geometric properties of the underlying sets X−,X+ and their mutual arrangement. This instance-specific viewpoint allows us to overcome the usual curse of dimensionality (exponential width of the layers) in non-pathological situations where the data carries low-complexity structure. We quantify the relevant structure of the data in terms of a novel notion of mutual complexity (based on a localized version of Gaussian mean width), which leads to sound and informative separation guarantees. We connect our result with related lines of work on approximation, memorization, and generalization.

Keywords

cs.LG, math.ST, stat.TH, Random neural networks, classification, hyperplane separation, high-dimensional geometry, Gaussian mean width

Citation

Dirksen, S, Genzel, M, Jacques, L & Stollenwerk, A 2021 'The Separation Capacity of Random Neural Networks' arXiv, pp. 1-34. https://doi.org/10.48550/arXiv.2108.00207

URI

https://dspace.library.uu.nl/handle/1874/415129

The Separation Capacity of Random Neural Networks

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI