61

基于C#的机器学习--惩罚与奖励-强化学习 - 王振耀

 5 years ago
source link: https://www.cnblogs.com/wangzhenyao1994/p/10259854.html
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
强化学习概况 正如在前面所提到的,强化学习是指一种计算机以“试错”的方式进行学习,通过与环境进行交互获得的奖赏指导行为,目标是使程序获得最大的奖赏,强化学习不同于连督学习,区别主要表现在强化信号上,强

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK