Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 1.4 KB

README.md

File metadata and controls

23 lines (15 loc) · 1.4 KB

Middle High German Scansion

This repo contains the most recent efforts in a model for MHG epic meter scansion. It builds off of the repo CLFL 2016, in which a CRF model was employed. Current improvements include:

  • Restriction to 4 stresses per line, 1 stress per foot
  • Secondary stress is not longer predicted, but algorithmically determined
  • A GUI has been created in tkinter

NAACL-CLFL 2016

Title: Supervised Machine Learning for Hybrid Meter
Authors: Alex Estes and Christopher Hench

Abstract:
Following classical antiquity, European poetic meter was complicated by traditions negotiating between the prosodic stress of vernacular dialects and a classical system based on syllable length. Middle High German (MHG) epic poetry found a solution in a hybrid qualitative and quantitative meter. We develop a CRF model to predict the metrical values of syllables in MHG epic verse, achieving an F-score of .894 on 10-fold cross-validated development data (outperforming several baselines) and .904 on held-out testing data. The method used in this paper presents itself as a viable option for other literary traditions, and as a tool for subsequent genre or author analysis.

Data and source code for paper

Dependencies:

  • sklearn (pip install scikit-learn)
  • nltk (pip install nltk)
  • pycrfsuite (pip install python-crfsuite)