Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add institution metadata #2128

Open
mjpost opened this issue Sep 9, 2022 · 3 comments · May be fixed by #2325
Open

Add institution metadata #2128

mjpost opened this issue Sep 9, 2022 · 3 comments · May be fixed by #2325
Assignees
Milestone

Comments

@mjpost
Copy link
Member

mjpost commented Sep 9, 2022

I had a discussion with someone from Google Scholar, who suggested that we add institution metadata to the HTML pages, e.g.,

<meta content="Post, Matt" name=citation_author>
<meta content="Microsoft" name="citation_author_institution">
<!-- ideally, we'd also add this -→
<meta content="0000-0002-1297-6794" name="citation_author_orcid">

This will help with resolving ambiguity.

We should do this by

  • Adding an <affiliation> tag to the author block in the XML
  • Exposing this information via the Hugo templates

Secondly, we should format author values as "Last name, first name", a format Google understands.

Finally: it would be nice to line-format the metadata in the HTML (it's currently all printed on a single line).

@bdgdevelopment
Copy link

For the time being, is there a way to get author's affiliation data without opening the pdf file?

@akoehn
Copy link
Member

akoehn commented Nov 8, 2022

No -- it is simply not in the metadata that we have.

@mjpost
Copy link
Member Author

mjpost commented Nov 8, 2022

That said, we could start adding this. We get this information from aclpub2 which is used since last year for ingestion of main *ACL conferences. We'll add this for EMNLP'22 and look into revisiting the old imports.

@mjpost mjpost added this to the 2022Q2 milestone Nov 8, 2022
@mjpost mjpost linked a pull request Dec 26, 2022 that will close this issue
@mjpost mjpost modified the milestones: 2022Q2, 2023Q3 Jul 13, 2023
@mjpost mjpost modified the milestones: 2023Q3, 2024Q1 Jan 23, 2024
@mjpost mjpost modified the milestones: 2024Q1, 2024Q2 May 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants