May 18, 2021Open Access

Warm Starting CMA-ES for Hyperparameter Optimization

Key Points

Key points are not available for this paper at this time.

Abstract

Hyperparameter optimization (HPO), formulated as black-box optimization (BBO), is recognized as essential for automation and high performance of machine learning approaches. The CMA-ES is a promising BBO approach with a high degree of parallelism, and has been applied to HPO tasks, often under parallel implementation, and shown superior performance to other approaches including Bayesian optimization (BO). However, if the budget of hyperparameter evaluations is severely limited, which is often the case for end users who do not deserve parallel computing, the CMA-ES exhausts the budget without improving the performance due to its long adaptation phase, resulting in being outperformed by BO approaches. To address this issue, we propose to transfer prior knowledge on similar HPO tasks through the initialization of the CMA-ES, leading to significantly shortening the adaptation time. The knowledge transfer is designed based on the novel definition of task similarity, with which the correlation of the performance of the proposed approach is confirmed on synthetic problems. The proposed warm starting CMA-ES, called WS-CMA-ES, is applied to different HPO tasks where some prior knowledge is available, showing its superior performance over the original CMA-ES as well as BO approaches with or without using the prior knowledge.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Masahiro Nomura

Tokyo Institute of Technology

Shuhei Watanabe

Preferred Networks (Japan)

Youhei Akimoto

University of Tsukuba

Actions

Institutions

University of Freiburg

University of Tsukuba

RIKEN Center for Advanced Intelligence Project

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Nomura et al. (Tue,) studied this question.

synapsesocial.com/papers/6a2207be505988242b4938b2 — DOI: https://doi.org/10.1609/aaai.v35i10.17109

Also consider

Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context:

Gradient-based learning applied to document recognition· 1998 · 58,028 citations
LightGBM: A Highly Efficient Gradient Boosting Decision Tree· 2017 · 9,487 citations
Wide Residual Networks· 2016 · 5,974 citations

Warm Starting CMA-ES for Hyperparameter Optimization

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider