pdfextbook

Introduction

Extract pages from PDF files on bookmark boundaries.

PDF bookmarks point to a single page, but all the pages starting from the one pointed to by the bookmark until the one pointed to by the next bookmark at the same hierarchical level (excluded) will be extracted.

Therefore this tool is useful to extract chapters or sections from books or articles.

It is more powerful than the "Extract Bookmarked Pages" tool of Adobe Acrobat (and it's also free).

You will need the following software present in your path:

pdftk (needed to extract bookmark metadata and to extract pages)
fzf (needed to select which bookmark to extract)

Installation

This software is available on PyPI, so the best way to install it is by using pipx:

pipx install pdfextbook

Alternatively, you can clone the repo and install it with pip:

pip install .

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
pdfextbook		pdfextbook
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdfextbook

Introduction

Installation

About

Releases

Packages

Languages

License

raffaem/pdfextbook

Folders and files

Latest commit

History

Repository files navigation

pdfextbook

Introduction

Installation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages