By: Michael O'Brien
Goal: To use computational methods to explore the composition of Russian Textbook glossaries; tokenizing, part of speech tagging, and aggregating results
Data source Plain text versions of Kudyma textbooks
-
final_report.md
is my final report wrap-up -
Processed_Data.ipynb
is my code that I use to explore the data. The same is available here through Jupyter's nbviewer. -
textbook_vocab_data/
is the folder where you can find the data files. -
images/
is the folder containing plots as.png
files. -
LICENSE.md
contains licensing information. -
Project_Plan.md
was my initial project plan. -
Progress_Reports.md
contains progress logs throughout this semester.
My guestbook is available here. You're more than welcome to check it out and leave some advice for how this project could be improved in the future!