Learning to Compose Words into Sentences with Reinforcement Learning
Authors:
D Yogatama,
P Blunsom,
C Dyer,
E Grefenstette,
W Ling
arXiv 2016
Sample Efficient Actor-Critic with Experience Replay
Authors:
Z Wang,
V Bapst,
N Heess,
V Mnih,
R Munos,
K Kavukcuoglu,
N de Freitas
arXiv 2016
Local minima in training of deep networks
Authors:
G Swirszcz,
W M Czarnecki,
R Pascanu
ICLR 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Authors:
M Jaderberg,
V Mnih,
W M Czarnecki,
T Schaul,
J Z Leibo,
D Silver,
K Kavukcuoglu
arXiv 2016
PGQ: Combining policy gradient and Q-learning
Authors:
B O'Donoghue,
R Munos,
K Kavukcuoglu,
V Mnih
arXiv 2016
Learning to Navigate in Complex Environments
Authors:
P Mirowski,
R Pascanu,
F Viola,
H Soyer,
A Ballard,
A Banino,
M Denil,
R Goroshin,
L Sifre,
K Kavukcuoglu,
D Kumaran,
R Hadsell
arXiv 2016
The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables
Authors:
C J Maddison,
A Mnih,
Y W Teh
arXiv 2016
Reference-Aware Language Models
Authors:
Z Yang,
P Blunsom,
C Dyer,
W Ling
arXiv 2016
The Neural Noisy Channel
Authors:
L Yu,
P Blunsom,
C Dyer,
E Grefenstette,
T Kočiský
arXiv 2016
Learning to Perform Physics Experiments via Deep Reinforcement Learning
Authors:
M Denil,
P Agrawal,
T D Kulkarni,
T Erez,
P Battaglia,
N de Freitas
Nature 2016
Mastering the game of Go with Deep Neural Networks & Tree Search
Authors:
D Silver,
A Huang,
C J Maddison,
A Guez,
L Sifre,
G van den Driessche,
J Schrittwieser,
I Antonoglou,
V Panneershelvam,
M Lanctot,
S Dieleman,
D Grewe,
J Nham,
N Kalchbrenner,
I Sutskever,
T Graepel,
T Lillicrap,
M Leach,
K Kavukcuoglu,
D Hassabis
Neuron 2016
Computations Underlying Social Hierarchy Learning: Distinct Neural Mechanisms for Updating and Representing Self-Relevant Information
Authors:
D Kumaran,
A Banino,
C Blundell,
D Hassabis,
P Dayan
NIPS 2016
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Authors:
J Foerster,
Y M Assael,
N de Freitas,
S Whiteson
NIPS 2016
Unsupervised Learning of 3D Structure from Images
Authors:
D J Rezende,
S M A Eslami,
S Mohamed,
P Battaglia,
M Jaderberg,
N Heess
NIPS 2016
Sequential Neural Models with Stochastic Layers
Authors:
M Fraccaro,
S K Sønderby,
U Paquet,
O Winther
NIPS 2016
Conditional Image Generation with PixelCNN Decoders
Authors:
A van den Oord,
N Kalchbrenner,
O Vinyals,
L Espeholt,
A Graves,
K Kavukcuoglu
NIPS 2016
Strategic Attentive Writer for Learning Macro-Actions
Authors:
A Vezhnevets,
V Mnih,
J Agapiou,
S Osindero,
A Graves,
O Vinyals,
K Kavukcuoglu
NIPS 2016
Learning to Learn by Gradient Descent by Gradient Descent
Authors:
M Andrychowicz,
M Denil,
S Gomez Colmenarejo,
M W Hoffman,
D Pfau,
T Schaul,
N de Freitas
NIPS 2016
Matching Networks for One Shot Learning
Authors:
O Vinyals,
C Blundell,
T Lillicrap,
K Kavukcuoglu,
D Wierstra
NIPS 2016
Memory-Efficient Backpropagation through Time
Authors:
A Gruslys,
R Munos,
I Danihelka,
M Lanctot,
A Graves
DeepMind.com uses cookies to help give you the best possible user experience and to allow us to see how the site is used. By using this site, you agree that we can set and use these cookies. For more information on cookies and how to change your settings, see our Privacy Policy.