Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate/build support for separate text file #1446

Open
5 tasks
mbakeryo opened this issue Dec 20, 2017 · 1 comment
Open
5 tasks

Investigate/build support for separate text file #1446

mbakeryo opened this issue Dec 20, 2017 · 1 comment

Comments

@mbakeryo
Copy link

mbakeryo commented Dec 20, 2017

As a publisher, I may provide a separate text file that is more accessible than the original file. E.g. I provide PDFs that are made up of page scans and bundled into a PDF that cannot be made accessible. The generated OCR derived from the PDF is not accessible.

I need to be able to upload a separate, cleaned up/rekeyed and accessible text file to the platform.

  • This file would be associated with the original PDF file (not replacing it).

  • This file would be downloadable.

Questions

  • Need to determine if the download button should say something different than Download OCR Text.

  • Need to determine if the disclaimer message would appear (see ticket Add disclaimer for OCR text/accessibility #1429).

  • Is it possible to include this text in the PDF that could then be part of the auto-generated OCR text?

For Turner, we need to get the rekeyed or better OCR-generated (through Prime OCR) file.

@mbakeryo
Copy link
Author

We are waiting to get the rekeyed text file. The ability to replace the Extracted File text file should be ready(ish).

@mbakeryo mbakeryo modified the milestones: Winter, Beyond Grant Feb 15, 2018
@mbakeryo mbakeryo removed the blocked label Apr 30, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant