How to use fastText for out of sample words? #26

shgidi · 2018-05-06T15:11:51Z

When downloading fastText with this method, we get a folder with a file in standard word2vec format, which can be loaded with
model = KeyedVectors.load_word2vec_format(path, binary=False)
But not with
from gensim.models import FastText
model = FastText.load_fasttext_format(path, binary=False)

This disables the ability to get vectors for out-of-vocabulary words.
How can this be done correcly?

The text was updated successfully, but these errors were encountered:

menshikh-iv · 2018-07-30T10:08:00Z

@shgidi

Facebook distribute 2 type of files:

.vec contains ONLY word-vectors (no ngrams here), can be loaded with KeyedVectors.load_word2vec_format
.bin contains ngrams, can be loaded with FastText.load_fasttext_format

next time please ask in mailing list mailing list

piskvorky · 2018-07-30T15:05:16Z

@menshikh-iv is this clear from our documentation?

I see people confused about these formats, how to load them and what can be done with them, all the time.

A clear, authoritative docs section would help us with support too (just point with hyperlink).

menshikh-iv · 2018-08-01T07:11:54Z

@piskvorky I agree this situation happens sometimes, it worth to make a tutorial.

piskvorky · 2018-08-02T07:55:58Z

A tutorial would be ideal, but a simple paragraph in the docs would go a long way. Can you add it?

scottlittle · 2018-08-07T21:29:40Z

This is not working for me with gensim 3.5, python 3.6, and a FB model:

from gensim.models import FastText
model_yelp = FastText.load_fasttext_format('yelp_review_full.bin')

I get:
NotImplementedError: Supervised fastText models are not supported

menshikh-iv · 2018-08-08T02:30:46Z

@scottlittle please read an exception again: we really don't support supervised fasttext models

scottlittle · 2018-08-08T14:46:49Z

@shgidi https://github.com/facebookresearch/fastText/tree/master/python worked for me.

romass12 · 2018-09-30T04:44:23Z

What is meant by supervised fasttext models and how to train for unsupervised?

menshikh-iv · 2018-10-01T03:00:12Z

@romass12

supervised fasttext models

Exactly what supervised learning means. FB implementation have supervised-mode support (gensim - only unsupervised)

how to train for unsupervised

Just read an Gensim FastText documentation

menshikh-iv closed this as completed Jul 30, 2018

menshikh-iv mentioned this issue Jul 30, 2018

Importing fasttext models still not working piskvorky/gensim#2045

Closed

marco-c mentioned this issue Jan 29, 2019

Try using different textual features mozilla/bugbug#17

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use fastText for out of sample words? #26

How to use fastText for out of sample words? #26

shgidi commented May 6, 2018 •

edited

Loading

menshikh-iv commented Jul 30, 2018

piskvorky commented Jul 30, 2018 •

edited

Loading

menshikh-iv commented Aug 1, 2018 •

edited

Loading

piskvorky commented Aug 2, 2018 •

edited

Loading

scottlittle commented Aug 7, 2018

menshikh-iv commented Aug 8, 2018

scottlittle commented Aug 8, 2018

romass12 commented Sep 30, 2018

menshikh-iv commented Oct 1, 2018

How to use fastText for out of sample words? #26

How to use fastText for out of sample words? #26

Comments

shgidi commented May 6, 2018 • edited Loading

menshikh-iv commented Jul 30, 2018

piskvorky commented Jul 30, 2018 • edited Loading

menshikh-iv commented Aug 1, 2018 • edited Loading

piskvorky commented Aug 2, 2018 • edited Loading

scottlittle commented Aug 7, 2018

menshikh-iv commented Aug 8, 2018

scottlittle commented Aug 8, 2018

romass12 commented Sep 30, 2018

menshikh-iv commented Oct 1, 2018

shgidi commented May 6, 2018 •

edited

Loading

piskvorky commented Jul 30, 2018 •

edited

Loading

menshikh-iv commented Aug 1, 2018 •

edited

Loading

piskvorky commented Aug 2, 2018 •

edited

Loading