3 skills to master before reinforcement learning (RL)

栏目: IT技术 · 发布时间: 6年前

内容简介：Modern reinforcement learning is almost entirely focused onAlmost all of the courses and tutorials will assume you

1. Supervised learning

Modern reinforcement learning is almost entirely focused on deep reinforcement learning . The word in the “ deep ” in the phrase deep reinforcement learning implies the use of a neural network in a core aspect of the algorithm. The neural network does some high-dimensional approximation in the learning process. That being said, the model does not need to have many layers and features, which is a common misconception that deep implies many layers.

Almost all of the courses and tutorials will assume you can fine-tune simple neural networks to approximate state values or create a final policy . These models are historically highly sensitive to all of the following training parameters: learning rate, batch size, model parameters, data normalization, and more. Doubled with tasks that are difficult to solve, debugging RL can be very difficult, and just seem like a binary it works or it doesn’t . Eliminating tails of confusing by knowing that all the sub approximations made are up to par. The best way to do this would be to learn supervised learning, then let an AutoML tool finish the job for you.

以上就是本文的全部内容，希望本文的内容对大家的学习或者工作能带来一定的帮助，也希望大家多多支持码农网

查看所有标签

本站部分资源来源于网络，本站转载出于传递更多信息之目的，版权归原作者或者来源机构所有，如转载稿涉及版权问题，请联系我们。

码农书籍

信息论基础

Thomas M.Cover、Joy A.Thomas / 清华大学出版社 / 2003-11-1 / 65.00元

《国际知名大学原版教材•信息论基础》系统介绍了信息论基本原理及其在通信理论、统计学、计算机科学、概率论以及投资理论等领域的应用。作者以循序渐进的方式，介绍了信息量的基本定义、相对熵、互信息以及他们如何自然地用来解决数据压缩、信道容量、信息率失真、统计假设、网络信息流等问题。一起来看看《信息论基础》这本书的介绍吧!

码农工具

html转js在线工具

HEX HSV 转换工具

HEX HSV 互换工具