Skip to content

Python script to convert multiple scanned images into a single word document using Google Vision API.

Notifications You must be signed in to change notification settings

teraflik/VisionScript

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

VisionScript

A Python3 script to convert multiple images of scanned text into a single word document using the Google Vision API and python-docx.

Follow the steps below to download, install, and run this project.

Dependencies

Install these prerequisites:

Step 1. Clone the project or download ZIP

git clone https://github.com/teraflik/VisionScript.git

Step 2. Install dependencies

Open PowerShell or Bash and type:

$ cd VisionScript
$ pip install -r requirements.txt

Step 3. Set up your Google Cloud API Key:

On Windows go to Environment Variables and add a new key. Set Variable Name to GOOGLE_APPLICATION_CREDENTIALS and Variable Value to the path where the your access key is stored.

Step 4. Store your images and run the script

Copy your images to the images\ folder alongside main.py and execute the script by double-clicking it or typing in console:

python main.py

Step 5. Output is stored in Word.docx

About

Python script to convert multiple scanned images into a single word document using Google Vision API.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages