The Stochastic Delta Rule: Faster and More Accurate Deep Learning Through Adaptive Weight Noise
暂无分享,去创建一个
[1] Geoffrey E. Hinton. Reducing the Dimensionality of Data with Neural , 2008 .
[2] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[3] Zoubin Ghahramani,et al. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.
[4] Surya Ganguli,et al. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization , 2014, NIPS.
[5] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .
[6] Yann LeCun,et al. Regularization of Neural Networks using DropConnect , 2013, ICML.
[7] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.
[8] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[9] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[10] Kenji Kawaguchi,et al. Deep Learning without Poor Local Minima , 2016, NIPS.
[11] Lorien Y. Pratt,et al. Comparing Biases for Minimal Network Construction with Back-Propagation , 1988, NIPS.
[12] Quoc V. Le,et al. GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism , 2018, ArXiv.
[13] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Alex Graves,et al. Practical Variational Inference for Neural Networks , 2011, NIPS.
[15] Benedict Delisle Burns,et al. The uncertain nervous system , 1968 .
[16] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.
[17] Shane Legg,et al. Noisy Networks for Exploration , 2017, ICLR.
[18] L. Pinneo. On noise in the nervous system. , 1966, Psychological review.
[19] Alok Aggarwal,et al. Regularized Evolution for Image Classifier Architecture Search , 2018, AAAI.
[20] Pierre Baldi,et al. Understanding Dropout , 2013, NIPS.
[21] Stephen José Hanson,et al. A stochastic version of the delta rule , 1990 .
[22] Pierre Baldi,et al. The dropout learning algorithm , 2014, Artif. Intell..