Los puntos clave no están disponibles para este artículo en este momento.
Deep neural networks often suffer from poor performance or even training failure due to the ill-conditioned problem, the vanishing/exploding gradient problem, and the saddle point problem. In this article, a novel method by acting the gradient activation function (GAF) on the gradient is proposed to handle these challenges. Intuitively, the GAF enlarges the tiny gradients and restricts the large gradient. Theoretically, this article gives conditions that the GAF needs to meet and, on this basis, proves that the GAF alleviates the problems mentioned above. In addition, this article proves that the convergence rate of SGD with the GAF is faster than that without the GAF under some assumptions. Furthermore, experiments on CIFAR, ImageNet, and PASCAL visual object classes confirm the GAF's effectiveness. The experimental results also demonstrate that the proposed method is able to be adopted in various deep neural networks to improve their performance. The source code is publicly available at https://github.com/LongJin-lab/Activated-Gradients-for-Deep-Neural-Networks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Liu et al. (Wed,) studied this question.
synapsesocial.com/papers/69dd3fef7808b00a4799bbcc — DOI: https://doi.org/10.1109/tnnls.2021.3106044
Mei Liu
Shanghai University
Liangming Chen
Lanzhou University
Xiaohao Du
Lanzhou University
IEEE Transactions on Neural Networks and Learning Systems
University of Chinese Academy of Sciences
Lanzhou University
Chongqing Institute of Green and Intelligent Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context: