Hello, I encountered an issue while reproducing your work. I used your stage2_with_object365.sh file for training, but found that the batch size is very small, which leads to a very long training time. Moreover, the loss keeps increasing during the training process. Could you please tell me what might be causing this problem?

Hello, I encountered an issue while reproducing your work. I used your stage2_with_object365.sh file for training, but found that the batch size is very small, which leads to a very long training time. Moreover, the loss keeps increasing during the training process. Could you please tell me what might be causing this problem?