March 3, 2024Open Access

Automatic Static Analysis-Guided Optimization of CUDA Kernels

Key Points

Key points are not available for this paper at this time.

Abstract

We propose a framework for using static resource analysis to guide the automatic optimization of general-purpose GPU (GPGPU) kernels written in CUDA, NVIDIA's framework for GPGPU programming. In our proposed framework, optimizations are applied to the kernel and candidate kernels are evaluated for performance by running a static analysis that predicts the execution cost of GPU kernels. The use of static analysis, in contrast to many existing frameworks for performance tuning GPU kernels, lends itself to high-level, hardware-independent optimizations that can be of particular benefit to novice programmers unfamiliar with CUDA's performance pitfalls. As a proof of concept, we have implemented two example optimizations and a simple search strategy in a tool called COpPER (CUDA Optimization through Programmatic Estimation of Resources), which makes use of a static resource analysis tool for CUDA from prior work. The prototype tool automatically improves the performance of sample kernels by 2-4% in initial experiments, and demonstrates the feasibility of using static analysis as part of automated performance tuning for GPU kernels.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Lou et al. (Sun,) studied this question.

www.synapsesocial.com/papers/68e75efdb6db6435876d6169 — DOI: https://doi.org/10.1145/3649169.3649249

Authors

Mark Lou

Stefan K. Muller

Actions

Institutions

Illinois Institute of Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Automatic Static Analysis-Guided Optimization of CUDA Kernels

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion