时序差分学习

【时序差分学习】《Temporal-Difference Learning - YouTube》by Richard Sutton [DeepMind] 
PDF幻灯片http://videolectures.net/site/normal_dl/tag=1137922/deeplearning2017_sutton_td_learning_01.pdf

Video with the slides: http://videolectures.net/deeplearning2017_sutton_td_learning/

DeepMind announced in July, 2017 that Prof. Richard Sutton would be leading DeepMind Alberta. Richard S. Sutton is a

Canadian computer scientist. Currently he is professor of Computer Science and iCORE chair at the University of Alberta.

Dr. Sutton is considered one of the founding fathers of modern computational reinforcement learning, having several significant

contributions to the field, including temporal difference learning, policy gradient methods, the Dyna architecture.

 

原文地址:https://www.cnblogs.com/edisp/p/8045773.html