Skip to content

Releases: amir-zeldes/RFTokenizer

V2.2.0

01 Feb 21:26
d17d79f
Compare
Choose a tag to compare

Compatibility update for torch 2.1. Now supports newer versions (for neural models for Hebrew):

scikit-learn==1.3.2
joblib==1.3.2
pandas==2.1.2
xgboost==2.0.3
flair==0.13.0

V2.0.0 - add transformer based features

19 Aug 20:27
2830897
Compare
Choose a tag to compare
Pre-release
  • add transformer based features

V1.1.0 - sklearn 0.23 and joblib

23 May 18:57
2a2a934
Compare
Choose a tag to compare

Compatibility update for sklearn 0.23 + joblib

V1.0.1 - Arabic and setup for PyPI

01 Aug 15:35
5d3aa87
Compare
Choose a tag to compare
  • Added Arabic model
  • Re-arranged folder setup for PyPI deployment

V1.0.0 - new and improved classifier

17 Jul 18:47
8e4a602
Compare
Choose a tag to compare
  • Single file implementation, no classes for multilabel encoder etc.
  • Switch classifier to xgboost
  • Add interface for getting probabilities instead of predictions
  • Add hyperparameter optimization code
  • Various minor improvements

V0.9.0 - first release of RFTokenizer

17 Sep 21:01
1b1215b
Compare
Choose a tag to compare

For more details see:

Zeldes, Amir (2018) A Characterwise Windowed Approach to Hebrew Morphological Segmentation. In: Proceedings of the 15th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. Brussels, Belgium.

https://arxiv.org/abs/1808.07214