ELF OpenGo

ELF OpenGo
原作者	田淵棟（Yuandong Tian）; Jerry Ma; Qucheng Gong; Shubho Sengupta; 陳卓遠（Zhuoyuan Chen）; C. Lawrence Zitnick;
首次發布	v0（2018年5月2日，6年前）
原始碼庫	github.com/pytorch/elf
程式語言	C++、Python、C語言
類型	電腦圍棋
許可協定	BSD許可證
網站	facebook.ai/developers/tools/elf

ELF OpenGo是Facebook AI Research團隊（FAIR）所開發的電腦圍棋軟體及所釋出的資料^[2]。

快速預覽 原作者, 首次發布 ...

關閉

簡介

ELF OpenGo是Facebook AI Research團隊（FAIR）依照DeepMind在科學期刊《自然》上對於AlphaGo Zero所發表的論文《Mastering the game of Go without human knowledge^[3]》與AlphaZero的論文《Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm^[4]》所實做出的開源電腦圍棋程式^[5]，也就是不使用人類棋譜與累積的圍棋知識，僅實做圍棋規則，使用單一類神經網路從自我對弈中學習（不像AlphaGo以人類角度思考，設計了Policy Network與Value Network）。

訓練網路使用20 blocks x 224 filters，在2000個GPU下訓練兩週^[5]，相較AlphaGo Zero使用的20 blocks x 256 filters版本略小一些（AlphaGo Zero另外還有40 blocks x 256 filters）。

由於Facebook所擁有的計算資源，產生出高品質的訓練網路資料與對局棋譜^[6]^{[註 1]}，許多基於相同演算法或是AlphaGo相關論文內容的圍棋軟體都積極測試ELF OpenGo所提供的訓練資料^[7]^[8]。

Remove ads

成績

對電腦

Leela Zero

Leela Zero是目前少數有公開程式碼並公開訓練網路資料的圍棋軟體，且仍然有志願者持續投入資源計算演化，故經常被當作其他圍棋軟體的基準。

由Facebook自行測試，ELF OpenGo與Leela Zero對戰的成績為198:2^[2]^{[註 2]}。在Leela Zero的進度網站上也經常會比較現有訓練網路與ELF OpenGo的比較^[9]。

CGOS

志願者在CGOS上使用Leela Zero的程式引擎以及由ELF OpenGo公開的訓練網路（v0）轉換成Leela Zero格式的訓練網路（即Hash值62b5417b的訓練網路，帳號LZ_62b541_ELF_1600）進行對弈測試^[10]，但由於目前在CGOS上測試的不是ELF Go的程式引擎，不清楚ELF OpenGo的訓練資料在轉換後的影響。截至2018年5月30日 (2018-05-30)^[update]，對弈已經超過1000盤，BayesElo分數約3770分^[11]。

對人

Facebook與韓國棋院合作，以中國規則與韓國的世界頂尖棋手對弈（貼目7.5目），在電腦每步限制50秒（使用單機單張NVIDIA Tesla V100），人類不限時間的前提下，每個人至少下兩局，達到14:0的成績^[6]，對弈對手包括金志錫、申真諝、朴永訓以及崔哲瀚^[2]。

參考資料

[1]
pytorch/ELF: ELF: a platform for game research. [2018-05-07]. （原始內容存檔於2018-05-03）.
[2]
Facebook Open Sources ELF OpenGo. 2018-05-02 [2018-05-08]. （原始內容存檔於2018-05-07）.
[3]
Mastering the game of Go without human knowledge. 自然 (期刊). 2017-10-18 [2018-05-06]. （原始內容存檔於2018-04-12）.
[4]
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm. 2017-12-05 [2018-05-06]. （原始內容存檔於2017-12-08）.
[5]
ELF | Game Research Platform | Facebook AI. [2018-05-06]. （原始內容存檔於2018-05-06）.
[6]
Releases · pytorch/ELF. [2018-05-06]. （原始內容存檔於2019-02-15）.
[7]
Releases · gcp/leela-zero. [2018-05-09]. （原始內容存檔於2019-10-16）.
[8]
Releases · zakki/Ray. [2018-05-07]. （原始內容存檔於2019-02-15）.
[9]
Leela Zero. [2018-05-06]. （原始內容存檔於2018-05-05）.
[10]
Crosstable for LZ_62b541_ELF_1600. [2018-05-07]. （原始內容存檔於2018-05-07）.
[11]
19x19 All Time Ranks. [2018-05-30]. （原始內容存檔於2018-01-17）.

註解

[註 1]
包含所有電腦對弈棋譜，以及12盤人類棋手願意公開的棋譜資訊（14盤中有2盤因尊重棋手意願不公開^[6]）。另外在棋譜中未納入人類棋手的名字^[6]。
[註 2]
Leela Zero的訓練網路為2018年4月25日所訓練出的第128代，Hash值為158603eb^[9]的版本。

外部連結

[elf-github-1] [1]
pytorch/ELF: ELF: a platform for game research. [2018-05-07]. （原始內容存檔於2018-05-03）.

[announcement-2] [2]
Facebook Open Sources ELF OpenGo. 2018-05-02 [2018-05-08]. （原始內容存檔於2018-05-07）.

[agz-nature-3] [3]
Mastering the game of Go without human knowledge. 自然 (期刊). 2017-10-18 [2018-05-06]. （原始內容存檔於2018-04-12）.

[az-arxiv-4] [4]
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm. 2017-12-05 [2018-05-06]. （原始內容存檔於2017-12-08）.

[elf-5] [5]
ELF | Game Research Platform | Facebook AI. [2018-05-06]. （原始內容存檔於2018-05-06）.

[release-6] [6]
Releases · pytorch/ELF. [2018-05-06]. （原始內容存檔於2019-02-15）.

[leelazero-release-8] [7]
Releases · gcp/leela-zero. [2018-05-09]. （原始內容存檔於2019-10-16）.

[ray-release-9] [8]
Releases · zakki/Ray. [2018-05-07]. （原始內容存檔於2019-02-15）.

[leelazero-project-10] [9]
Leela Zero. [2018-05-06]. （原始內容存檔於2018-05-05）.

[12] [10]
Crosstable for LZ_62b541_ELF_1600. [2018-05-07]. （原始內容存檔於2018-05-07）.

[bayes-19x19-13] [11]
19x19 All Time Ranks. [2018-05-30]. （原始內容存檔於2018-01-17）.

[7] [註 1]
包含所有電腦對弈棋譜，以及12盤人類棋手願意公開的棋譜資訊（14盤中有2盤因尊重棋手意願不公開^[6]）。另外在棋譜中未納入人類棋手的名字^[6]。

[11] [註 2]
Leela Zero的訓練網路為2018年4月25日所訓練出的第128代，Hash值為158603eb^[9]的版本。

[2]

[1]

[3]

[4]

[5]

[6]

[註 1]

[7]

[8]

[註 2]

[9]

[10]

[11]

ELF OpenGo

簡介

成績

對電腦

Leela Zero

CGOS

對人

相關連結

參考資料

註解

外部連結

Wikiwand in your browser!

ELF OpenGo