You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
can't and won't get tokenized as ca n't and wo n't respectively.
In previous versions of spacy, the lemma of ca and wo was ca and wo.
Sicne the recent update, ca now is correctly lemmatized as can, but wo is still wo when it should be will.
Edit:
Also just noticed the lemma of sha in sha n't is not converted to shall.
Just fixed this on master and it should work now – the tokenizer exceptions for the contractions were missing a TAG, so the lemma was overwritten by the lemmatizer.
can't
andwon't
get tokenized asca n't
andwo n't
respectively.In previous versions of spacy, the lemma of
ca
andwo
wasca
andwo
.Sicne the recent update,
ca
now is correctly lemmatized ascan
, butwo
is stillwo
when it should bewill
.Edit:
Also just noticed the lemma of
sha
insha n't
is not converted toshall
.Your Environment
The text was updated successfully, but these errors were encountered: