Hello, I was trying to independently reproduce the results in the paper. Specifically, the results on Drand and Ddet, I was able to reproduce the results for Drand but for Ddet I found that the networks overfit to the training set and have bad performance on the original test set. I saw on the paper that it says that no data augmentation was done and the parameters that I used were not enough to regularize the network. Any ideas why this is happening?