Using curvature information for fast stochastic search

Genevieve B. Orr, Todd K. Leen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

24 Scopus citations

Abstract

We present an algorithm for fast stochastic gradient descent that uses a nonlinear adaptive momentum scheme to optimize the late time convergence rate. The algorithm makes effective use of curvature information, requires only O(n) storage and computation, and delivers convergence rates close to the theoretical optimum. We demonstrate the technique on linear and large nonlinear back-prop networks.

Original languageEnglish (US)
Title of host publicationAdvances in Neural Information Processing Systems 9 - Proceedings of the 1996 Conference, NIPS 1996
PublisherNeural information processing systems foundation
Pages606-612
Number of pages7
ISBN (Print)0262100657, 9780262100656
StatePublished - 1997
Event10th Annual Conference on Neural Information Processing Systems, NIPS 1996 - Denver, CO, United States
Duration: Dec 2 1996Dec 5 1996

Publication series

NameAdvances in Neural Information Processing Systems
ISSN (Print)1049-5258

Other

Other10th Annual Conference on Neural Information Processing Systems, NIPS 1996
Country/TerritoryUnited States
CityDenver, CO
Period12/2/9612/5/96

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

Fingerprint

Dive into the research topics of 'Using curvature information for fast stochastic search'. Together they form a unique fingerprint.

Cite this