Thanks for you open-sourced code very much.
I am very confused about this code segment in put_last_trajectory method in selfplay_worker.py:
In Line 69 , why is,
pad_child_visits_lst = game_histories[i].child_visits[beg_index:end_index] rather than
pad_child_visits_lst = game_histories[i].child_visits[:self.config.num_unroll_steps],
in my understanding, the game_histories[i].child_visits[0] is the child_visits of stacked obs game_histories[i].obs_history[beg_index],
is this a bug?
Looking forward to your reply!
Thanks for you open-sourced code very much.
I am very confused about this code segment in put_last_trajectory method in selfplay_worker.py:
In Line 69 , why is,
pad_child_visits_lst = game_histories[i].child_visits[beg_index:end_index]rather thanpad_child_visits_lst = game_histories[i].child_visits[:self.config.num_unroll_steps],in my understanding, the
game_histories[i].child_visits[0] is the child_visits of stacked obs game_histories[i].obs_history[beg_index],is this a bug?
Looking forward to your reply!