Skip to content

Latest commit

 

History

History
109 lines (78 loc) · 6.09 KB

gensim.md

File metadata and controls

109 lines (78 loc) · 6.09 KB

##The DIY Guide to Gensim

Analytics

Please make Pull Requests for good resources, or create Issues for any feedback! Thanks!


gensim logo

###Table Of Contents


Gensim is a very performant python library for NLP projects. It is arguably the most popular library for Word2Vec and Doc2Vec. In addition, it also provides various NLP tools such as LDA, LSI and Random Projection.

Installation

pip install gensim
easy_install gensim

###Hello World Just a simple code-based intro, theory is covered in the next section #####Text to Vectors

#####Models and Transformation

#####TF-IDF (Model)

#####Phrases (Model)

#####LSI (Model)

#####LDA (Model)

#####Word2Vec (Model)

#####Doc2Vec (Model)


###Theory

#####TFIDF

#####LSI

#####LDA

#####Word2Vec

#####Doc2Vec

  • Paper
  • [contribution needed: good resources that explain Doc2Vec]

###Advanced Features

#####Query Similarities

#####Distributed Computing

#####Similarity Server

###Super Short Feedback Survey (Pretty please!)