Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vectorize word2vec.predict_output_word for speed #3153

Merged
merged 9 commits into from
Jul 19, 2021

Commits on May 18, 2021

  1. [Fix] gensim/models/word2vec.py: in method predict_output_word, chang…

    …ed a call to sum to numpy.sum to gain performance.
    Mathis committed May 18, 2021
    Configuration menu
    Copy the full SHA
    6a94492 View commit details
    Browse the repository at this point in the history
  2. [Feat] gensim.models.word2vec.Word2Vec.predict_output_word: added pos…

    …sibility for the user to input a list of word indices as parameter 'context' instead of a list of words.
    Mathis committed May 18, 2021
    Configuration menu
    Copy the full SHA
    91d5dca View commit details
    Browse the repository at this point in the history

Commits on May 25, 2021

  1. Word2Vec.predict_output_word: Changed handling of ints and strs, tryi…

    …ng to trying to make it more compact and versatile.
    M-Demay authored and Mathis committed May 25, 2021
    Configuration menu
    Copy the full SHA
    84258b4 View commit details
    Browse the repository at this point in the history

Commits on May 26, 2021

  1. Fixed docstring of predict_output_word.

    Mathis committed May 26, 2021
    Configuration menu
    Copy the full SHA
    47a7fe5 View commit details
    Browse the repository at this point in the history

Commits on Jun 15, 2021

  1. Simplified predict_output_word changes.

    * Retained the suggested `sum`->`np.sum`
      replacement, which has been tested to
      yield significant runtime gains.
    * Dropped unnecessary type/value checks
      that are already run when calling the
      `KeyedVectors.__isin__` dunder method.
    * Corrected the docstring to accurately
      document the supported inputs (which
      were already compatible prior to the
      PR this commit is a part of).
    pandrey-fr committed Jun 15, 2021
    Configuration menu
    Copy the full SHA
    ba00d98 View commit details
    Browse the repository at this point in the history

Commits on Jul 1, 2021

  1. Added tests for gensim.Word2Vec.predict_output_word() when context co…

    …ntains ints.
    Mathis Demay committed Jul 1, 2021
    Configuration menu
    Copy the full SHA
    679a086 View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2021

  1. Update CHANGELOG.md

    mpenkov authored Jul 19, 2021
    Configuration menu
    Copy the full SHA
    ed8ec36 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bb9c2a6 View commit details
    Browse the repository at this point in the history
  3. update sbt install step

    mpenkov authored Jul 19, 2021
    Configuration menu
    Copy the full SHA
    04cc137 View commit details
    Browse the repository at this point in the history