Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improved ExtractText(3) #969

Merged
merged 45 commits into from
Jun 13, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
45 commits
Select commit Hold shift + click to select a range
48421df
Relative import
pubpub-zz Jun 10, 2022
c7829d8
improve TextExtraction
pubpub-zz Jun 10, 2022
7a9c22c
Extend testing
pubpub-zz Jun 10, 2022
b0a7736
improve readability of BooleanObjects
pubpub-zz Jun 10, 2022
d7f84d0
Apply Black
pubpub-zz Jun 10, 2022
59504ec
fix early mypy
pubpub-zz Jun 10, 2022
58bd0e5
fix mypy2
pubpub-zz Jun 10, 2022
941461a
attempt fix iss with test_utils
pubpub-zz Jun 10, 2022
e4c37cb
Merge branch 'main' into ExtractText
pubpub-zz Jun 10, 2022
39e94f9
Minor flake8 fix
MartinThoma Jun 10, 2022
9763868
Adjust mypy types
MartinThoma Jun 10, 2022
53294f2
Merge branch 'pubpub-zz-ExtractText' into origin/ExtractText
pubpub-zz Jun 10, 2022
744464f
revert in test_utils
pubpub-zz Jun 10, 2022
0ed4d9a
paste error
pubpub-zz Jun 10, 2022
5b96216
flake 8
pubpub-zz Jun 10, 2022
b2830e9
flake8
pubpub-zz Jun 10, 2022
1223d75
Add 'test_previous_line' back
MartinThoma Jun 11, 2022
f5c6406
Merge remote-tracking branch 'py-pdf/pubpub-zz-ExtractText' into Extr…
pubpub-zz Jun 11, 2022
284519b
fix iss with test_util
pubpub-zz Jun 11, 2022
6dd00e2
Update tests/test_workflows.py
pubpub-zz Jun 11, 2022
921d396
flake8
pubpub-zz Jun 11, 2022
ee06f4f
flake8
pubpub-zz Jun 11, 2022
d90109d
flake8
pubpub-zz Jun 11, 2022
8ea99ef
ROB : extract text of empty page
pubpub-zz Jun 11, 2022
e5948fc
ROB : cope with missing cr in cmap
pubpub-zz Jun 11, 2022
0bd3f04
Fix intrepretation for /Symbol and /Zapfdingbats
pubpub-zz Jun 11, 2022
e34b4a9
default encoding for missing Fonts
pubpub-zz Jun 11, 2022
932e103
TD operator forgotten
pubpub-zz Jun 11, 2022
2b423d8
Remove Tm scaling for lf detection
pubpub-zz Jun 11, 2022
b7ebc55
mix betwen str and bytes
pubpub-zz Jun 11, 2022
249d812
default encodings fix
pubpub-zz Jun 11, 2022
7e3ce51
extend tests
pubpub-zz Jun 11, 2022
114fe68
fix Flake8 and mypy
pubpub-zz Jun 11, 2022
ee8d4b6
replace test files with links
pubpub-zz Jun 11, 2022
9768d5f
Fix xform in xfoms inducing loop (#966)
pubpub-zz Jun 11, 2022
e9fd89c
ROB : cope with \r crlf
pubpub-zz Jun 12, 2022
e60a1fa
FIX : Incorrect decoding of Zapfdingbats
pubpub-zz Jun 12, 2022
2146056
ROB: cope with both /Encoding and /ToUnicode
pubpub-zz Jun 12, 2022
cd26abc
flake8
pubpub-zz Jun 12, 2022
534a8bb
fix Encoding / ToUnicode at the same time
pubpub-zz Jun 12, 2022
d92597a
Merge branch 'main' into ExtractText
MartinThoma Jun 13, 2022
0ba91aa
Apply suggestions from code review
MartinThoma Jun 13, 2022
88f1298
typo
pubpub-zz Jun 13, 2022
de7ddc0
typoUpdate PyPDF2/_cmap.py
pubpub-zz Jun 13, 2022
2aea3e9
fix 'utf-16-be' codec can't decode bytes in position 0-1: unexpected…
pubpub-zz Jun 13, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion PyPDF2/__init__.py
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
from ._merger import PdfFileMerger, PdfMerger
from ._page import Transformation, PageObject
from ._page import PageObject, Transformation
from ._reader import DocumentInformation, PdfFileReader, PdfReader
from ._version import __version__
from ._writer import PdfFileWriter, PdfWriter
Expand Down
Loading