阅读背景:

Q-Learning vs SARSA贪婪选择

来源:互联网 

The difference between Q-Learning and SARSA is that Q-Learning compares the current state vs. the best possible next state where as SARSA compares the current state vs. the actual next state.The difference between Q-Learning and SARSA is




你的当前访问异常,请进行认证后继续阅读剩余内容。

分享到: