Hello, I understand that this is a quite old repo, but I would like to know if the GPU implementation has some bug, since it is commented out. Thanks