mcts

首先感谢分享程序。
请问在6x6 四子棋的训练过程中，有没有调节learning_rate或者其他参数？
程序里的c_puct=5 , 温度t=1，学习率 0.002，batch_size 512 , deque最大长度10000, kl-targ=0.02 ,epochs=5
我使用你程序里的预设参数 tensorflow训练6x6 四子棋，loss下降到2左右就无法下降了，调节学习率也没成功。。。求帮助解答，谢谢

另外，不明白explain_var_old这个参考数值的意义。

mcts

Here are 143 public repositories matching this topic...

junxiaosong / AlphaZero_Gomoku

关于五子棋训练的问题 ,以及explain_var_old的意义

suragnair / alpha-zero-general

Handle examples originated from the same board state

hrpan / tetris_mcts

werner-duvaud / muzero-general

dylandjian / SuperGo

thuxugang / doudizhu

akolishchak / doom-net-pytorch

sungyubkim / Deep_RL_with_pytorch

initial-h / AlphaZero_Gomoku_MPI

QueensGambit / CrazyAra

manyoso / allie

chauvinSimon / My_Bibliography_for_Research_on_Autonomous_Driving

Urinx / ReinforcementLearning

blanyal / alpha-zero

yangboz / godpaper

OMerkel / UCThello

xuetf / AlphaZero_Gobang

adepierre / Caffe_AlphaZero

zhangshun97 / AI_Gomocup

Aleum / AlphaGo

QueensGambit / CrazyAra-Engine

kekmodel / gym-tictactoe-zero

matgrioni / euchre-bot

hayoung-kim / mcts-tic-tac-toe

ladofa / janggi

aijunbai / thompson-sampling

gorisanson / quoridor-ai

MerceaOtniel / HybridAlpha

Mikeywalsh / MCTS-Visualisation

novoselov-ab / ai-zero

Improve this page

Add this topic to your repo