"Entropy-SGD: Biasing Gradient Descent Into Wide Valleys."

Pratik Chaudhari et al. (2016)
a service of Schloss Dagstuhl - Leibniz Center for Informatics