alphago

首先感谢分享程序。
请问在6x6 四子棋的训练过程中，有没有调节learning_rate或者其他参数？
程序里的c_puct=5 , 温度t=1，学习率 0.002，batch_size 512 , deque最大长度10000, kl-targ=0.02 ,epochs=5
我使用你程序里的预设参数 tensorflow训练6x6 四子棋，loss下降到2左右就无法下降了，调节学习率也没成功。。。求帮助解答，谢谢

另外，不明白explain_var_old这个参考数值的意义。

alphago

Here are 76 public repositories matching this topic...

junxiaosong / AlphaZero_Gomoku

关于五子棋训练的问题 ,以及explain_var_old的意义

suragnair / alpha-zero-general

Handle examples originated from the same board state

sweetice / Deep-reinforcement-learning-with-pytorch

maxpumperla / betago

maxpumperla / deep_learning_and_the_game_of_go

chncwang / FoolGo

HardcoreJosh / JoshieGo

dylandjian / SuperGo

werner-duvaud / muzero-general

bupticybee / icyChessZero

yenw / computer-go-dataset

initial-h / AlphaZero_Gomoku_MPI

QueensGambit / CrazyAra

BlinkDL / BlinkDL

tejank10 / AlphaGo.jl

Urinx / ReinforcementLearning

PolyKen / 15_by_15_AlphaGomoku

yangboz / godpaper

zouyih / AlphaZero_Gomoku-tensorflow

GuoYi0 / alphaFive

Aleum / AlphaGo

shionhonda / IaGo

unrealgo / unrealgo

Terkwood / BUGOUT

Sabaki: increase estimate text size

⚠️ Sabaki: use material dialog component for "Invalid game" notice on rejection

kongjiellx / AlphaZero-Renju

QueensGambit / CrazyAra-Engine

davinwang / C2TutorialsGo

kekmodel / gym-tictactoe-zero

yangboz / 2017-2018-computing-thinking

ladofa / janggi

Improve this page

Add this topic to your repo