The pre-trained models are incompatible with official GloVe mebddings. Re-training frameid ends up with -nan gradients at the early stage of training.