https://github.com/huggingface/pytorch-pretrained-BERT/pull/597
huggingface/transformers#597