Skip to content

No-dependency Python code that extracts text from a PDF file

License

Notifications You must be signed in to change notification settings

jankais3r/PDF-Text-Extract

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF Text Extract

No-dependency Python code that extracts text from a PDF file

Screenshot of script in action

By default, the script extracts text from file called sample.pdf. There is also a commented code showing you how to extract text from an online PDF file (uncomment rows 7 & 11).

In my testing the script hasn't always been succesfull at extracting text from complex PDFs, so some more work might be required to support those.

About

No-dependency Python code that extracts text from a PDF file

Topics

Resources

License

Stars

Watchers

Forks

Languages