Skip to content

Caroline Gish's final course project on Universal Decompositional Semantics for Data Science for Linguists 2022 | University of Pittsburgh

License

GPL-3.0, Unknown licenses found

Licenses found

GPL-3.0
LICENSE.md
Unknown
LICENSE-cc.md
Notifications You must be signed in to change notification settings

Data-Science-for-Linguists-2022/UDS-child-speech

Repository files navigation

License: GPL v3 License: CC BY-NC-SA 3.0

Universal Decompositional Semantics (UDS) and Child Speech

Hello and welcome!

Caroline Gish | [email protected]


Overview

This project was undertaken by Caroline Gish for the course project in the Data Science for Linguists 2022 course.

The overall goal of this project was to see how the UDS semantic annotation framework, a novel framework claimed to have better coverage for non-prototypical instances, was equipped to handle child speech that may contain nonprototypical instances dissimilar from the UDS training sentences. My personal goals in undeertaking this project were to gain experience with a massive dataset, that comes with its own library, designed specifically for semantic research.

To read what my classmates had to say about my project during the semester, be sure to visit my project guestbook!

Data were sourced from the both the Decomp repository of the Decompositional Semantics Initiative and the CHILDES child language component of the TalkBank system.

Directory

Main repository files

  • final_report.md is the final write-up for my project
  • README.md is what you are currently reading! It contains an overview of my project, links to all files, and information on the licensing and works cited.
  • presentation_gish.pdf is a PDF copy of my presentation slides for the presentation I gave in the 2022 Data Science for Linguists class. These slides do not contain any of my notes, so please feel free to contact me for more information about them!
  • progress_report.md contains three different progress reports each detailing my project progress over the course of the semester.
  • project_plan.md is my initital project plan that I proposed at the beginning of the semester.
  • LICENSE.md is the license for the code elements of the repository.
  • LICENSE-cc.md is the license for the non-code elements of the repository.

Subdirectories

Licenses

Citations

The Universal Decompositional Semantics Dataset and Decomp Toolkit (White et al., LREC 2020)

ACL version: Aaron Steven White, Elias Stengel-Eskin, Siddharth Vashishtha, Venkata Subrahmanyan Govindarajan, Dee Ann Reisinger, Tim Vieira, Keisuke Sakaguchi, Sheng Zhang, Francis Ferraro, Rachel Rudinger, Kyle Rawlins, and Benjamin Van Durme. 2020. The Universal Decompositional Semantics Dataset and Decomp Toolkit. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 5698–5707, Marseille, France. European Language Resources Association.

Hicks, D. (1990). Kinds of texts: Narrative genre skills among children from two communities. In A. McCabe (Ed.), Developing narrative structure. Hillsdale, NJ: Erlbaum.

Additional references that go along with Hicks, D. (1990) include:

Berman, R. A. and D. I. Slobin (1994). Relating events in narrative: A crosslinguistic de-velopmental study. Hillsdale, NJ, Lawrence Erlbaum Associates.

Heath, S. (1983). Ways with words: Language, life and work in communities and classrooms. Cambridge, Cambridge University Press.

Quirk, R., S. Greenbaum, et al. (1972). A grammar of contemporary English. London, Longman.

About

Caroline Gish's final course project on Universal Decompositional Semantics for Data Science for Linguists 2022 | University of Pittsburgh

Resources

License

GPL-3.0, Unknown licenses found

Licenses found

GPL-3.0
LICENSE.md
Unknown
LICENSE-cc.md

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published