CS294-112 深度强化学习 秋季学期(伯克利)NO.7 Optimal control and planning

 

transition possibility is unknown and we even don't need to estimate the possibility

 

 

 

 

  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

原文地址:https://www.cnblogs.com/ecoflex/p/9094689.html