非凸随机优化的随机梯度哈密顿蒙特卡洛的全局收敛性：非渐近性能界与基于动量的加速

Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Nonconvex Stochastic Optimization: Nonasymptotic Performance Bounds and Momentum-Based Acceleration

Operations Research · 2021

被引 18

人大 AFT50UTD24ABS 4*

Xuefeng Gao · 香港中文大学
Mert Gürbüzbalaban · 新泽西州立大学
Lingjiong Zhu · 佛罗里达州立大学

中文导读

首次给出了随机梯度哈密顿蒙特卡洛方法在非凸随机优化中的有限时间性能界，证明动量加速在全局非凸优化中是可行的，对解决深度学习等机器学习问题有参考价值。

Abstract

Nonconvex Stochastic Optimization Nonconvex stochastic optimization problems arise in many machine learning problems, including deep learning. The stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stochastic gradients with a momentum method in which a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates toward a global minimum. SGHMC has shown empirical success in practice for solving nonconvex stochastic optimization problems. In “Global convergence of stochastic gradient Hamiltonian Monte Carlo for nonconvex stochastic optimization: Nonasymptotic performance bounds and momentum-based acceleration,” Gao, Gürbüzbalaban, and Zhu provide, for the first time, the finite-time performance bounds for the global convergence of SGHMC in the context of both population and empirical risk minimization problems and show that acceleration with momentum is possible in the context of global nonconvex stochastic optimization.

随机优化蒙特卡洛方法机器学习深度学习

阅读原文 ↗