Skip to content

raffaem/pdfextbook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdfextbook

Introduction

Extract pages from PDF files on bookmark boundaries.

PDF bookmarks point to a single page, but all the pages starting from the one pointed to by the bookmark until the one pointed to by the next bookmark at the same hierarchical level (excluded) will be extracted.

Therefore this tool is useful to extract chapters or sections from books or articles.

It is more powerful than the "Extract Bookmarked Pages" tool of Adobe Acrobat (and it's also free).

You will need the following software present in your path:

  • pdftk (needed to extract bookmark metadata and to extract pages)
  • fzf (needed to select which bookmark to extract)

Installation

This software is available on PyPI, so the best way to install it is by using pipx:

pipx install pdfextbook

Alternatively, you can clone the repo and install it with pip:

pip install .

About

Extract pages from PDF on bookmark boundaries.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages