Change the repository type filter
All
Repositories list
18 repositories
abx-spec-behaviors
PublicProposal for a shared user script specification between scraping, crawling, archiving, and AI tools. Allows user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, and many other contexts with minimal adjustments.abx-pkg
Publicabx-dl
Public⬇️ A CLI tool to download all discovered content from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git srcs, and more...docker-archivebox
PublicHome of the official docker image for ArchiveBox- Official Python package for ArchiveBox, the self-hosted internet archiving solution.
homebrew-archivebox
Public archiveHomebrew formula for the ArchiveBox self-hosted internet archiving solution.debian-archivebox
Public archiveHome of the official apt/deb package for Ubuntu/Debian-based systems.docs
Publicreadability-extractor
PublicJavascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
archivebox-proxy
PublicOfficial ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.good-karma-kit
Public😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...community
PublicA wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.DigestBox
PublicDigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.electron-archivebox
Publicinternet-archiving-talk
Public