Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HindiMonoCorp Extraction #2

Open
3 tasks
djokester opened this issue Jun 23, 2018 · 0 comments
Open
3 tasks

HindiMonoCorp Extraction #2

djokester opened this issue Jun 23, 2018 · 0 comments
Assignees
Labels
GSSoC Issues relating to GirlScript Summer of Code 2018 Intermediate The intermediate difficulty level issues for GirlScript Summer of Code. Pro The harder difficulty level issues for GirlScript Summer of Code.

Comments

@djokester
Copy link
Member

djokester commented Jun 23, 2018

Extraction of HindiMonoCorp as based on issue SangitaNLP/sangita#8
Tasks include:

  • Extraction of linguistic features in (word, tag) tuples

  • Storing them in usable and importable formats.

  • Identifying the POS tagset used and converting them to the Penn Treebank tagset.

@djokester djokester added Intermediate The intermediate difficulty level issues for GirlScript Summer of Code. Pro The harder difficulty level issues for GirlScript Summer of Code. GSSoC Issues relating to GirlScript Summer of Code 2018 labels Jun 23, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
GSSoC Issues relating to GirlScript Summer of Code 2018 Intermediate The intermediate difficulty level issues for GirlScript Summer of Code. Pro The harder difficulty level issues for GirlScript Summer of Code.
Projects
None yet
Development

No branches or pull requests

2 participants