Deepmind: the existence proof for RL at scale

栏目: IT技术 · 发布时间: 6年前

DeepMind: the existence proof for RL at scale

The brain is the existence proof for general intelligence — Google’s DeepMind is the proof we are making progress to replicating it.

Deepmind: the existence proof for RL at scale

DeepMind’s reinforcement learning successes with AlphaGo, AlphaZero, etc. are paving the way for the next generation of technology companies deploying large-scale AI projects. DeepMind is far from a profitable entity on paper (they do give Google an advantage, though), but they are showing the world how to use artificial intelligence, and more impressively reinforcement learning , to obsolete humans in a particular task. Now that it exists, it’ll be done for profitable tasks.

What does DeepMind do?

DeepMind wants to solve intelligence. In recent years they have taken the world’s best researchers in machine learning and merged them with the best computer scientists to create computers that beat humans at games. Any game they have tried, to be exact. AlphaZero solved numerous games with self-play and no human input. DeepMind targeted a system and gave a computer the tools to master it. At a high level, it is a beautiful implication: what computers can do with no human input expands every day.

As a researcher, I have a love-hate affair with the technology giants coming to play in cutting edge machine learning research. DeepMind swings their weight around in the reinforcement learning area with more intent and more impact. People should copy their approach.


以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持 码农网

查看所有标签

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

流量池

流量池

杨飞 / 中信出版集团 / 2018-4 / 68.00

移动互联网时代,信息日益冗余,新闻速朽; 整体流量增长速度放缓,而竞争者数量高速增加; 流量呈现变少、变贵、欺诈频繁的现状; 品效合一的营销策略成为共识,而实现路径成为痛点; 多次开创各营销渠道效果之最的营销人、各种刷屏级营销事件操盘手、神州专车CMO杨飞,这一次倾囊相授,诚恳讲述如何实现流量获取、营销转化以及流量的运营和再挖掘。一起来看看 《流量池》 这本书的介绍吧!

Base64 编码/解码
Base64 编码/解码

Base64 编码/解码

XML 在线格式化
XML 在线格式化

在线 XML 格式化压缩工具

Markdown 在线编辑器
Markdown 在线编辑器

Markdown 在线编辑器