December 1, 2015

Deep Fried Convnets

Key Points

Key points are not available for this paper at this time.

Abstract

The fully-connected layers of deep convolutional neural networks typically contain over 90% of the network parameters. Reducing the number of parameters while preserving predictive performance is critically important for training big models in distributed systems and for deployment in embedded devices. In this paper, we introduce a novel Adaptive Fastfood transform to reparameterize the matrix-vector multiplication of fully connected layers. Reparameterizing a fully connected layer with d inputs and n outputs with the Adaptive Fastfood transform reduces the storage and computational costs costs from O(nd) to O(n) and O(n log d) respectively. Using the Adaptive Fastfood transform in convolutional networks results in what we call a deep fried convnet. These convnets are end-to-end trainable, and enable us to attain substantial reductions in the number of parameters without affecting prediction accuracy on the MNIST and ImageNet datasets.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Zichao Yang

Ocean University of China

Marcin Moczulski

University of Michigan

Misha Denil

Google (United States)

Actions

Institutions

University of Oxford

Carnegie Mellon University

Georgia Institute of Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Deep Fried Convnets

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study