Skip to content
This repository has been archived by the owner on Aug 4, 2023. It is now read-only.

emoji unicode error #5

Open
cole-dda opened this issue May 3, 2023 · 0 comments
Open

emoji unicode error #5

cole-dda opened this issue May 3, 2023 · 0 comments

Comments

@cole-dda
Copy link

cole-dda commented May 3, 2023

when pdf include emoji,such as:😄
unicode=0x1f604

when use ms word to generate pdf
screenshot_5883

the unicode include space

https://pdfium.googlesource.com/pdfium/+/refs/heads/main/core/fpdfapi/font/cpdf_tounicodemap.cpp
screenshot_5884

when space is break,so get unicode=0xd83d

but right is =[d8,3d,de,04], then [d8,3d,de,04].decode('utf-16-be') => '😄'

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant