Apprenticeship Learning via Inverse Reinforcement Learning

Apprenticeship Learning via Inverse Reinforcement Learning

[source] ICML

[year] 2004

在本文中直接提出了两个算法:MMP和Projection方法。

Projection方法比较容易，可以找到实现。

找一个下降方向。

困难在于建模，如何找出MDP/R的各构成。其中的S A \phi都不那么明显。

【推广】免费学中医，健康全家人

原文地址：https://www.cnblogs.com/justin_s/p/2073113.html