October 1, 2016

Stripes: Bit-serial deep neural network computing

Key Points

Key points are not available for this paper at this time.

Abstract

Motivated by the variance in the numerical precision requirements of Deep Neural Networks (DNNs) 1, 2, Stripes (STR), a hardware accelerator is presented whose execution time scales almost proportionally with the length of the numerical representation used. STR relies on bit-serial compute units and on the parallelism that is naturally present within DNNs to improve performance and energy with no accuracy loss. In addition, STR provides a new degree of adaptivity enabling on-the-fly trade-offs among accuracy, performance, and energy. Experimental measurements over a set of DNNs for image classification show that STR improves performance over a state-of-the-art accelerator 3 from 1.30x to 4.51x and by 1.92x on average with no accuracy loss. STR is 57% more energy efficient than the baseline at a cost of 32% additional area. Additionally, by enabling configurable, per-layer and per-bit precision control, STR allows the user to trade accuracy for further speedup and energy efficiency.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Institutions

University of Toronto

University of British Columbia

References and Citations

Add This Paper to Your Research Feed

Any time a new paper drops it will be there.