Thanks for your open-sourced code very much.
I'm a little confused about the reason for the identity connection of state encoding in DynamicsNetwork in model.py:
Why do we add this state encoding identity connection, rather than using action encoding, and what is its empirical impact on atari results?
Looking forward to your reply!