Rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent
Michael Kohler, Technical University of Darmstadt, Germany; Adam Krzyzak, Concordia University, Canada; Benjamin Walter, Technical University of Darmstadt, Germany
Session:
Classification and Regression
Track:
8: Machine Learning
Location:
Ypsilon I-II-III
Presentation Time:
Mon, 8 Jul, 15:35 - 15:55
Session Chair:
Adam Krzyzak, Concordia University
Abstract
Image classifiers based on over-parametrized deep convolutional neural networks with an average-pooling are proposed. The weights of the network are learned by gradient descent. We present the bound on the rate of convergence of the difference between the expected misclassification risk of the plug-in classifier and the Bayes risk. The obtained rate of convergence is independent of image dimension under appropriate constraints on the image distribution.