April 24, 2024

Characterizing Power Management Opportunities for LLMs in the Cloud

Key Points

Key points are not available for this paper at this time.

Abstract

Recent innovation in large language models (LLMs), and their myriad use cases have rapidly driven up the compute demand for datacenter GPUs. Several cloud providers and other enterprises plan to substantially grow their datacenter capacity to support these new workloads. A key bottleneck resource in datacenters is power, which LLMs are quickly saturating due to their rapidly increasing model sizes.

Perguntar à IA

Bookmark