pytorch-wordemb

Load pretrained word embeddings (word2vec, glove format) into torch.FloatTensor for PyTorch

Install

PyTorch required.

pip install torchwordemb

import torch
import torchwordemb

read word2vec binary-format model from path.

returns (vocab, vec)

vocab is a dict mapping a word to its index.
vec is a torch.FloatTensor of size V x D, where V is the vocabulary size and D is the dimension of word2vec.

vocab, vec = torchwordemb.load_word2vec_bin("/path/to/word2vec/model.bin")
print(vec.size())
print(vec[ w2v.vocab["apple"] ] )

read word2vec text-format model from path.

read GloVe text-format model from path.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
resource		resource
src		src
.gitignore		.gitignore
MANIFEST.in		MANIFEST.in
README.md		README.md
README.rst		README.rst
setup.cfg		setup.cfg
setup.py		setup.py
tests.py		tests.py