Abstrakti
Automatic variational inference has recently become feasible as a scalable inference tool for probabilistic programming. The state-of-the-art algorithms are stochastic in two respects: they use stochastic gradient descent to optimize an expectation that is estimated with stochastic approximation. The core computation of such algorithms involves evaluating the loss and its automatically differentiated gradient for random parameters sampled from the approximation. We study ways of re-using some of the gradient computations during optimization, to speed up learning for large-scale applications. We present a stochastic average gradient algorithm that uses gradients computed for past mini-batches to reduce noise, and explain how importance sampling allows re-using gradients computed for data samples visited again during the optimization loop.
| Alkuperäiskieli | englanti |
|---|---|
| Otsikko | Advances in Approximate Bayesian Inference; NIPS 2016 Workshop; December 9, 2016 |
| Julkaisupäivä | 9 jouluk. 2016 |
| Tila | Julkaistu - 9 jouluk. 2016 |
| OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisuussa |
| Tapahtuma | NIPS 2016 Workshop; December 9, 2016 - Barcelona, Espanja Kesto: 1 tammik. 1800 → … |
Tieteenalat
- 113 Tietojenkäsittely- ja informaatiotieteet
Siteeraa tätä
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver