Skip to content

Slides and Jupyter Notebook for PyData Vancouver talk (25-02-2020)

Notifications You must be signed in to change notification settings

marioga/spaCy-talk-demo

Repository files navigation

spaCy-talk-demo

Slides and Jupyter Notebook for PyData Vancouver talk (25-02-2020)

In this talk, we first review spaCy's basic usage:

  • Loading models for different languages.
  • Identifying special tokens: punctuation, stopwords, etc.
  • Compute part-of-speech (POS) tags, dependency parsing, named entity recognition (NER).
  • Visualize dependencies and named entities, along with tag explanations.

In the second half, we take a stab at customizing spaCy pipelines to solve two tasks:

  • Create a tokenizer that does not split hyphenated words.
  • Create a simple noun chunker that identifies only consecutive nouns, optionally preceded by a single adjective.

About

Slides and Jupyter Notebook for PyData Vancouver talk (25-02-2020)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published