May 1, 2018

Motivation for and Evaluation of the First Tensor Processing Unit

Key Points

Key points are not available for this paper at this time.

Abstract

The first-generation tensor processing unit (TPU) runs deep neural network (DNN) inference 15-30 times faster with 30-80 times better energy efficiency than contemporary CPUs and GPUs in similar semiconductor technologies. This domain-specific architecture (DSA) is a custom chip that has been deployed in Google datacenters since 2015, where it serves billions of people.

Bookmark

Motivation for and Evaluation of the First Tensor Processing Unit

Key Points

Abstract

Cite This Study

Also Consider

Also Consider