PDFplumber extract_tables() recognizes two cells when 1 cell contains multiple lines #1219
bloktom
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 4 replies
-
I am also facing the same problem. In my case it not only splits the single row to multiple but the column with the problem itself gets splitted. And it's only occurring for the first row. I tried recreating the issue in other rows but this was not occuring This is the pdf i am using (Its just a random pdf i created for testing pdf plumber) : PATIENTRECORD12.pdf
And this is the output i got for the first table
@jsvine Can you please look into this. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I use the extract_tables() function on pages that i extract with PDFplumber. However the tables I use sometimes contain multiple lines of text. I guess it recognizes an hard enter. I tried using different sizes for the rectangles of the cells but nothing worked. Is there an easy fix for this issue?
Beta Was this translation helpful? Give feedback.
All reactions