-
Notifications
You must be signed in to change notification settings - Fork 10k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unicode detection error #10247
Comments
In order to help with this, we need an example PDF file. Could you create one that has these symbols in it and attach it to this issue? Moreover, what do you mean with "detecting symbols"? Is the rendering not working correctly, it is not in the text layer, et cetera? |
Please find the attached PDF containing some of these symbols. By "detecting symbols", I mean if I render this PDF in Firefox browser, where pdf.js is used for rendering, and perform inspect element, I will not find any of these symbols in HTML. In terms of visual rendering on the web, they are perfect but if I want to access them in HTML, I don't find these symbols in UTF-8. I wanted to attach HTML file also, but Git not allowing me to do so. Let me know if there is any way, I can do. Please let me know, Is this clarifies your doubt or not? |
There isn't any This is a bug in the PDF file itself, and please note that the PDF.js library is already doing a better job here than some other PDF viewers (since the text is mostly copyable). |
I completely agree with you. What I am interested is how pdf.js is extract information for other symbols? And why it is not possible for these few symbols? We are interested in this for making mathematics equations accessible with screen readers. We will be happy if you can help us in understanding how pdf.js is able to extract the other symbol's information. |
For standard glyphs, these maps are used to provide a reasonable fallback for missing @timvandermeij All in all, given #10247 (comment), this probably ought to be closed as INVALID/WONTFIX. |
We are trying to detect symbols in a LaTeX generated PDF file. PDF.js is working very well for most of the symbols but creating issues with following symbols
The text was updated successfully, but these errors were encountered: