DeepMind 的 AI 从 AlphaGo Zero 进化到 AlphaZero

6 years ago

source link: http://www.solidot.org/story?sid=54778
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

DeepMind 的 AI 从 AlphaGo Zero 进化到 AlphaZero

pigsrollaroundinthem (39396)发表于 2017年12月07日 21时00分星期四新浪微博分享豆瓣分享 来自下个天网

Google AI 子公司 DeepMind 的研究人员本周在预印本网站 arxiv 发表论文（PDF），称他们的 AI 程序从 AlphaGo Zero 进化到了 AlphaZero，通过自对弈在数小时内打败了最出色的国际象棋和日本将棋程序。AlphaGo Zero 是通过强化学习方法训练花了 40 天时间成为超越人类的最强大围棋选手。AlphaZero 应用了类似但更通用的算法，它只掌握最基本的棋类规则，然后通过自对弈反复训练强化学习逐渐进化。它用了 8 小时超越了打败李世石的版本 AlphaGo Lee，用了 4 小时打败了最出色的国际象棋程序 Stockfish，用了 2 小时打败了将棋程序 Elmo。AlphaZero 和 AlphaGo Zero 一样都只使用 4 个 TPU。

Recommend

DeepMind 的 AI 从 AlphaGo Zero 进化到 AlphaZero

DeepMind 的 AI 从 AlphaGo Zero 进化到 AlphaZero

Recommend

机票投资失利，酒店急功近利，狂奔的美团为何突然遇阻？

永辉超市承认超级物种拟引战投：二马正面交锋新零售

艾萨克森：乔布斯的14节管理课

36氪独家|美团升级了出行事业部，滴滴就用做外卖来反击

电脑报2017年第48期

Architecture for Multiplatform native development in Kotlin

马云师范生计划10年投3亿:让最优秀的学生做乡村老师

GitHub - celrenheit/sandglass: Sandglass is a distributed, horizontally scalable...

ZH奶酪：编程语言入门经典100例 Python版

GitHub - parkouss/webmacs: webmacs - keyboard driven (emacs key bindings) browse...

About Joyk