Key points are not available for this paper at this time.
Machine learning, especially deep neural networks, has developed rapidly in fields, including computer vision, speech recognition, and reinforcement learning. Although minibatch stochastic gradient descent (SGD) is one of the most popular stochastic optimization methods for training deep networks, it shows a slow convergence rate due to the large noise in the gradient approximation. In this article, we attempt to remedy this problem by building a more efficient batch selection method based on typicality sampling, which reduces the error of gradient estimation in conventional minibatch SGD. We analyze the convergence rate of the resulting typical batch SGD algorithm and compare the convergence properties between the minibatch SGD and the algorithm. Experimental results demonstrate that our batch selection scheme works well and more complex minibatch SGD variants can benefit from the proposed batch selection strategy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xinyu Peng
China Jiliang University
Li Li
Ningbo University
Fei‐Yue Wang
Chinese Academy of Sciences
IEEE Transactions on Neural Networks and Learning Systems
Chinese Academy of Sciences
Tsinghua University
Shandong Institute of Automation
Building similarity graph...
Analyzing shared references across papers
Loading...
Peng et al. (Tue,) studied this question.
synapsesocial.com/papers/6a1c0700b33628da419d20f5 — DOI: https://doi.org/10.1109/tnnls.2019.2957003
Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context: