New perspective on the convergence to a global solution of finite-sum optimization

M. Nguyen, Lam; Tran, Trang; van Dijk, Marten

Deep neural networks have shown great success in many machine learning tasks. Their training is challenging since the loss surface of the network architecture is generally non-convex, or even non-smooth. We propose a reformulation of the minimization problem allowing for a new recursive algorithmic framework. By using bounded style assumptions, we prove convergence to an \epsilon-(global) minimum using O(1/\epsilon^3) gradient computations. Our theoretical foundation motivates further study, implementation, and optimization of the new algorithmic framework and further investigation of its non-standard bounded style assumptions.

Additional Metadata
Stakeholder	IBM Research, Thomas J. Watson Research Center, USA
Organisation	Computer Security
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Nguyen, L., Tran, T., & van Dijk, M. (2022). New perspective on the convergence to a global solution of finite-sum optimization. In Informs Annual Meeting.

Additional Files
View at homepage

New perspective on the convergence to a global solution of finite-sum optimization

Publication

Publication

Address

CWI researchers

Questions or comments?

New perspective on the convergence to a global solution of finite-sum optimization

Publication

Publication

Workflow

Workflow

Add Content