ELF OpenGo

ELF OpenGo
原作者	田渊栋（Yuandong Tian）; Jerry Ma; Qucheng Gong; Shubho Sengupta; 陈卓远（Zhuoyuan Chen）; C. Lawrence Zitnick;
首次发布	v0（2018年5月2日，6年前）
源代码库	github.com/pytorch/elf
编程语言	C++、Python、C语言
类型	电脑围棋
许可协议	BSD许可证
网站	facebook.ai/developers/tools/elf

ELF OpenGo是Facebook AI Research团队（FAIR）所开发的电脑围棋软件及所发布的资料^[2]。

Quick Facts 原作者, 首次发布 ...

Close

简介

ELF OpenGo是Facebook AI Research团队（FAIR）依照DeepMind在科学期刊《自然》上对于AlphaGo Zero所发表的论文《Mastering the game of Go without human knowledge^[3]》与AlphaZero的论文《Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm^[4]》所实做出的开源电脑围棋程序^[5]，也就是不使用人类棋谱与累积的围棋知识，仅实做围棋规则，使用单一人工神经网络从自我对弈中学习（不像AlphaGo以人类角度思考，设计了Policy Network与Value Network）。

训练网络使用20 blocks x 224 filters，在2000个GPU下训练两周^[5]，相较AlphaGo Zero使用的20 blocks x 256 filters版本略小一些（AlphaGo Zero另外还有40 blocks x 256 filters）。

由于Facebook所拥有的计算资源，产生出高质量的训练网络资料与对局棋谱^[6]^{[注 1]}，许多基于相同算法或是AlphaGo相关论文内容的围棋软件都积极测试ELF OpenGo所提供的训练资料^[7]^[8]。

成绩

对电脑

Leela Zero

Leela Zero是目前少数有公开代码并公开训练网络资料的围棋软件，且仍然有志愿者持续投入资源计算演化，故经常被当作其他围棋软件的基准。

由Facebook自行测试，ELF OpenGo与Leela Zero对战的成绩为198:2^[2]^{[注 2]}。在Leela Zero的进度网站上也经常会比较现有训练网络与ELF OpenGo的比较^[9]。

CGOS

志愿者在CGOS上使用Leela Zero的程序引擎以及由ELF OpenGo公开的训练网络（v0）转换成Leela Zero格式的训练网络（即Hash值62b5417b的训练网络，账号LZ_62b541_ELF_1600）进行对弈测试^[10]，但由于目前在CGOS上测试的不是ELF Go的程序引擎，不清楚ELF OpenGo的训练资料在转换后的影响。截至2018年5月30日 (2018-05-30)^[update]，对弈已经超过1000盘，BayesElo分数约3770分^[11]。

对人

Facebook与韩国棋院合作，以中国规则与韩国的世界顶尖棋手对弈（贴目7.5目），在电脑每步限制50秒（使用单机单张NVIDIA Tesla V100），人类不限时间的前提下，每个人至少下两局，达到14:0的成绩^[6]，对弈对手包括金志锡、申真谞、朴永训以及崔哲瀚^[2]。

参考资料

[1]
pytorch/ELF: ELF: a platform for game research. [2018-05-07]. （原始内容存档于2018-05-03）.
[2]
Facebook Open Sources ELF OpenGo. 2018-05-02 [2018-05-08]. （原始内容存档于2018-05-07）.
[3]
Mastering the game of Go without human knowledge. 自然 (期刊). 2017-10-18 [2018-05-06]. （原始内容存档于2018-04-12）.
[4]
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm. 2017-12-05 [2018-05-06]. （原始内容存档于2017-12-08）.
[5]
ELF | Game Research Platform | Facebook AI. [2018-05-06]. （原始内容存档于2018-05-06）.
[6]
Releases · pytorch/ELF. [2018-05-06]. （原始内容存档于2019-02-15）.
[7]
Releases · gcp/leela-zero. [2018-05-09]. （原始内容存档于2019-10-16）.
[8]
Releases · zakki/Ray. [2018-05-07]. （原始内容存档于2019-02-15）.
[9]
Leela Zero. [2018-05-06]. （原始内容存档于2018-05-05）.
[10]
Crosstable for LZ_62b541_ELF_1600. [2018-05-07]. （原始内容存档于2018-05-07）.
[11]
19x19 All Time Ranks. [2018-05-30]. （原始内容存档于2018-01-17）.

注解

[注 1]
包含所有电脑对弈棋谱，以及12盘人类棋手愿意公开的棋谱信息（14盘中有2盘因尊重棋手意愿不公开^[6]）。另外在棋谱中未纳入人类棋手的名字^[6]。
[注 2]
Leela Zero的训练网络为2018年4月25日所训练出的第128代，Hash值为158603eb^[9]的版本。

外部链接

官方网站
pytorch/ELF: ELF: a platform for game research（页面存档备份，存于互联网档案馆）

[elf-github-1] [1]
pytorch/ELF: ELF: a platform for game research. [2018-05-07]. （原始内容存档于2018-05-03）.

[announcement-2] [2]
Facebook Open Sources ELF OpenGo. 2018-05-02 [2018-05-08]. （原始内容存档于2018-05-07）.

[agz-nature-3] [3]
Mastering the game of Go without human knowledge. 自然 (期刊). 2017-10-18 [2018-05-06]. （原始内容存档于2018-04-12）.

[az-arxiv-4] [4]
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm. 2017-12-05 [2018-05-06]. （原始内容存档于2017-12-08）.

[elf-5] [5]
ELF | Game Research Platform | Facebook AI. [2018-05-06]. （原始内容存档于2018-05-06）.

[release-6] [6]
Releases · pytorch/ELF. [2018-05-06]. （原始内容存档于2019-02-15）.

[leelazero-release-8] [7]
Releases · gcp/leela-zero. [2018-05-09]. （原始内容存档于2019-10-16）.

[ray-release-9] [8]
Releases · zakki/Ray. [2018-05-07]. （原始内容存档于2019-02-15）.

[leelazero-project-10] [9]
Leela Zero. [2018-05-06]. （原始内容存档于2018-05-05）.

[12] [10]
Crosstable for LZ_62b541_ELF_1600. [2018-05-07]. （原始内容存档于2018-05-07）.

[bayes-19x19-13] [11]
19x19 All Time Ranks. [2018-05-30]. （原始内容存档于2018-01-17）.

[7] [注 1]
包含所有电脑对弈棋谱，以及12盘人类棋手愿意公开的棋谱信息（14盘中有2盘因尊重棋手意愿不公开^[6]）。另外在棋谱中未纳入人类棋手的名字^[6]。

[11] [注 2]
Leela Zero的训练网络为2018年4月25日所训练出的第128代，Hash值为158603eb^[9]的版本。

[2]

[1]

[3]

[4]

[5]

[6]

[注 1]

[7]

[8]

[注 2]

[9]

[10]

[11]

ELF OpenGo

简介

成绩

对电脑

Leela Zero

CGOS

对人

相关链接

参考资料

注解

外部链接

Wikiwand in your browser!

ELF OpenGo