can explain why do this (h + 1) // 2, (w + 1) // 2


https://github.com/openai/Video-Pre-Training/blob/077ba2b9885ff696051df8348dc760d9699139ca/lib/impala_cnn.py#L127

Can explain why this line we do this +1 for height and weight before divide?


https://github.com/openai/Video-Pre-Training/blob/077ba2b9885ff696051df8348dc760d9699139ca/lib/impala_cnn.py#L168

any paper i can read on why set init scaling using the number of channel ?