Replies: 2 comments 7 replies
-
Hi there ahmed-bhs It's kind of similar to the recent discussion: #1005 There is an approach discussed there, but I didn't think about it too much, so I'm not sure if it's a general solution - it may need some tweaking. (#1009 is another recent question which essentially asks for the same thing) Perhaps @jsvine / @samkit-jain have some thoughts on the issue? Is this particular type of extraction strategy something that can be done reliably? Is it something that could be supported by pdfplumber? |
Beta Was this translation helpful? Give feedback.
-
Following up on this question, I face the similar problem: table contains alot of "None" values in the result content. Is it possible to remove them and combine the result table in the format like: Also, if the table span across pages, how to combine them together? |
Beta Was this translation helpful? Give feedback.
-
Is there any method you're aware of for converting a PDF document that contains both tables and text into a text format? Specifically, the tables should be transformed into comma-separated data.
For instance, the outcome should be:
Beta Was this translation helpful? Give feedback.
All reactions