March 8, 2017Open Access

Nearly-tight VC-dimension and pseudodimension bounds for piecewise linear neural networks

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We prove new upper and lower bounds on the VC-dimension of deep neural networks with the ReLU activation function. These bounds are tight for almost the entire range of parameters. Letting W be the number of weights and L be the number of layers, we prove that the VC-dimension is O (W L (W) ), and provide examples with VC-dimension Ω (W L (W/L) ). This improves both the previously known upper bounds and lower bounds. In terms of the number U of non-linear units, we prove a tight bound Θ (W U) on the VC-dimension. All of these bounds generalize to arbitrary piecewise linear activation functions, and also hold for the pseudodimensions of these function classes. Combined with previous results, this gives an intriguing range of dependencies of the VC-dimension on depth for networks with different non-linearities: there is no dependence for piecewise-constant, linear dependence for piecewise-linear, and no more than quadratic dependence for general piecewise-polynomial.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Bartlett et al. (Wed,) studied this question.

synapsesocial.com/papers/6a0f91c101be78fe815fd63d — DOI: https://doi.org/10.48550/arxiv.1703.02930

Authors

Peter L. Bartlett

Australian National University

Nick Harvey

Christopher Liaw

Google (United States)

Journals

Journal of Machine Learning Research

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Nearly-tight VC-dimension and pseudodimension bounds for piecewise linear neural networks

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion