Hi,Yuri!how are you there? and I want to know about your recent progress in Muzero project, has your model converged?I built my Muzero to play renju,but after several hundred epochs of training, it still showed little enhancement too, which makes me kind of frustrated, would you share with me some intermediate output of the model such as hideen status, the predicated probability or evaluated value of one particular board configuration? we can communicate each other and analyse the problem
Hi,Yuri!how are you there? and I want to know about your recent progress in Muzero project, has your model converged?I built my Muzero to play renju,but after several hundred epochs of training, it still showed little enhancement too, which makes me kind of frustrated, would you share with me some intermediate output of the model such as hideen status, the predicated probability or evaluated value of one particular board configuration? we can communicate each other and analyse the problem