On Convergence and Generalization of Dropout Training

Mianjy, Poorya; Arora, Raman

On Convergence and Generalization of Dropout Training

Poorya Mianjy, Raman Arora

Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

AuthorFeedback Bibtex MetaReview Paper Review Supplemental

Abstract

We study dropout in two-layer neural networks with rectified linear unit (ReLU) activations. Under mild overparametrization and assuming that the limiting kernel can separate the data distribution with a positive margin, we show that the dropout training with logistic loss achieves $\epsilon$-suboptimality in the test error in $O(1/\epsilon)$ iterations.

Abstract

Name Change Policy