April 2, 2014Open Access

Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We present techniques for speeding up the test-time evaluation of large convolutional networks, designed for object recognition tasks. These models deliver impressive accuracy but each image evaluation requires millions of floating point operations, making their deployment on smartphones and Internet-scale clusters problematic. The computation is dominated by the convolution operations in the lower layers of the model. We exploit the linear structure present within the convolutional filters to derive approximations that significantly reduce the required computation. Using large state-of-the-art models, we demonstrate we demonstrate speedups of convolutional layers on both CPU and GPU by a factor of 2x, while keeping the accuracy within 1% of the original model.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Emily Denton

Google (United States)

Wojciech Zaremba

Supélec

Joan Bruna

Courant Institute of Mathematical Sciences

Actions

Institutions

New York University

Courant Institute of Mathematical Sciences

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study