Los puntos clave no están disponibles para este artículo en este momento.
Computation-in-memory (CIM) is a promising avenue to improve the energy efficiency of multiply-and-accumulate (MAC) operations in AI chips. Multi-bit CNNs are required for high-inference accuracy in many applications 1–5. There are challenges and tradeoffs for SRAM-based CIM: (1) tradeoffs between signal margin, cell stability and area overhead; (2) the high-weighted bit process variation dominates the end-result error rate; (3) trade-off between input bandwidth, speed and area. Previous SRAM CIM macros were limited to binary MAC operations for fully connected networks 1, or they used CIM for multiplication 2 or weight-combination operations 3 with additional large-area near-memory computing (NMC) logic for summation or MAC operations.
Si et al. (Fri,) studied this question.