Optimization

  • Understanding the difficulty of training deep feedforward neural networks (2010)

  • On the difficulty of training Recurrent Neural Networks (2012. 11)

  • Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification (2015. 2)

  • A Simple Way to Initialize Recurrent Networks of Rectified Linear Units (2015. 4)

  • Cyclical Learning Rates for Training Neural Networks (2015. 6)

  • On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima (2016. 9)

  • Neural Optimizer Search with Reinforcement Learning (2017. 9)

  • On the Convergence of Adam and Beyond (2018. 2)

  • Adafactor: Adaptive Learning Rates with Sublinear Memory Cost (2018. 4)

  • Revisiting Small Batch Training for Deep Neural Networks (2018. 4)

Last updated