Scripts for XML -> Python object -> YAML conversion (re #129) #133
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR adds a script that converts the authoritative XML to YAML format, in preparation for using these files as a data source in the static rewrite of the site.
anthology.py
contains classes that read in the XML and represent it as simple Python objects. They might potentially be useful for further checks or conversions as well.xml_to_yaml.py
outputs one YAML file per XML file with the paper infos, and additionally compiles an author index in a separate YAML file. This is done to facilitate the generation of the static pages, and might be extended with other auxiliary files in the future. Conflation of name variants could, in principle, also be done at this point, depending on how exactly we decide to handle them ( Authors being stored under multiple spellings #86 ).I will continue to add to and/or modify these scripts during the static rewrite, according to the needs of the page generation.