Skip to content

markPranav/WhoSaidIt

Repository files navigation

WhoSaidIt

Multi voiced audio generator with quote attribution

This project is under development, corerference resolution part has been implemented and can be executed by running

cd coref_resolution && python main.py --filepath ./data/{file_name} --output_dir ./data

quote attribution has also been implemeted, now the pipeline sequence of coreference resolution + quote attribution can be executed with

python main.py --filepath ./data/{filepath} --output_dir ./out

Voice synthesis can be executed standalone as

cd voice_synthesis && python main.py --filepath ../out/quote_attr/{quotes filepath}--coref_path ../out/coref/{coref filepath} --output_dir ./out/voice


References

The coreference resolution module has been heavily inspired from the paper "Incremental Neural Coreference Resolution in Constant Memory" by Xia, Patrick and his repo.

@inproceedings{xia-etal-2020-incremental,
    title = "Incremental Neural Coreference Resolution in Constant Memory",
    author = "Xia, Patrick  and
      Sedoc, Jo{\~a}o  and
      Van Durme, Benjamin",
    booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    year = "2020",
    url = "https://aclanthology.org/2020.emnlp-main.695",
    doi = "10.18653/v1/2020.emnlp-main.695",
}

And the entire pipline for quote attribution has been inspired from fanfiction-nlp

Michael Miller Yoder, Sopan Khosla, Qinlan Shen, Aakanksha Naik, Huiming Jin, Hariharan Muralidharan, and Carolyn P Rosé. 2021. FanfictionNLP: A Text Processing Pipeline for Fanfiction. In Proceedings of the 3rd Workshop on Narrative Understanding, pages 13–23.

About

Multi voiced audio generator with quote attribution

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages