梯度下降法
CSC321 Winter 2014: lecture notes (toronto.edu)
[1212.5701] ADADELTA: An Adaptive Learning Rate Method (arxiv.org)
cs229.stanford.edu/proj2015/054_report.pdf
[1212.5701] ADADELTA: An Adaptive Learning Rate Method (arxiv.org)
本博客所有文章除特别声明外,均采用 CC BY-NC-SA 4.0 许可协议。转载请注明来自 Obito Blog!
评论