Problems and opportunities in training deep learning software systems

Key Points

Key points are not available for this paper at this time.

Abstract

Deep learning (DL) training algorithms utilize nondeterminism to improve models' accuracy and training efficiency. Hence, multiple identical training runs (e.g., identical training data, algorithm, and network) produce different models with different accuracies and training times. In addition to these algorithmic factors, DL libraries (e.g., TensorFlow and cuDNN) introduce additional variance (referred to as implementation-level variance) due to parallelism, optimization, and floating-point computation.

Bookmark

Cite This Study

Pham et al. (Mon,) studied this question.

synapsesocial.com/papers/6a0fe6704fb650da4ffea316 https://doi.org/https://doi.org/10.1145/3324884.3416545

Bookmark