This reporsitory contains human Pyramid annotations for the TAC 2014 scientific summarization dataset [1].
The provided annotations include the main information units in each of the human written summaries. File names correspond to the TAC "topic_id" and the "annotator_id".
The format of annotations according to the following:
"nugget_id" <tab> "begin_span" <space> "end_span" <tab> "nugget_text"
Main paper to be cited:
-- A. Cohan and N. Goharian "Revisiting Summarization Evaluation for Scientific Papers", In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), May 2016.