Skip to content

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

License

Notifications You must be signed in to change notification settings

Anish-M-code/pdftotext

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF TO TEXT CONVERTER

A simple Python script to convert PDF Documents to Text Files .

Primary Supported Platforms

  • Debian / Debian Based Linux Distros
  • Ubuntu / Ubuntu Based Linux Distros
  • Fedora / Fedora Based Linux Distros
  • Arch Linux / Arch Linux Based Distros
  • Void Linux / Void Linux Based Distros
  • Windows 10 and above Windows Operating Systems.

Quick Installation

To Install from PyPI:

Run the following commands in Linux terminal / Windows powershell / command prompt to install:-

pip install pdftotext3

Then simply type the following command inside the folder/Directory containing PDF Files to start converting PDF to text :-

pdftotext

For Windows Platform Additional software is required for Proper Functioning of this program , refer Windows Requirements here. To run the program by directly downloading from github refer Instructions here.

NOTE: THIS TOOL IS MEANT TO CONVERT THOSE PDF DOCUMENTS WHICH ARE NOT EASILY CONVERTBLE TO OTHER FORMATS. CURRENTLY THIS TOOL SUPPORTS ENGLISH ONLY.

About

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •  

Languages