This repository contains the POS-tagged (CTSized) texts of the Ancient Greek Literature contained in the following releases:
- https://github.com/PerseusDL/canonical-greekLit/releases/tag/0.0.236
- https://github.com/OpenGreekAndLatin/First1KGreek/releases/tag/1.1.1802
The POS-tagging has been generated (completely) automatically by using the MATE tagger, which has been trained on the Perseus treebank data:
The tagger achieved an accuracy of 88%. More details can be found in the article:
- Celano, Giuseppe G. A, Gregory Crane, Saeed Majidi. 2016. Part of Speech Tagging for Ancient Greek. Open Linguistics 2:393–399. https://doi.org/10.1515/opli-2016-0020
Release 1.2.0:
- New Texts have been added. Tokenization now detects capital letter abbreviations such as ΑΠΟΛ.
Release 1.1.0:
- Correction to the cts-urn structure by considering the elements seg and p (currently div, seg, p, and l are considered)
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.