Questions on submodule implementation

Hi, I'm trying to implement MCTSnet recently, and your repo is very inspiring. I have several questions regarding the submodules.
1. The authors claim they use residual blocks in both the embedding network and the prior policy network. Each residual block contains two convolutional layers and a 'residual' step. Did you simplify it for the experiment? 
2. The authors use MLP with ONE hidden layer in the backup network as well as the readout network. I think your code maps the input to output directly.

P.S. Have you tried to limit the path depth at each state? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions on submodule implementation #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Questions on submodule implementation #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions