Skip to content

Navigation Menu

apify

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Apify

We're making the web more programmable.

Verified
We've verified that the organization apify controls the domain:
- apify.com
Learn more about verified organizations

697 followers
The Interweb
https://apify.com/
@apify
company/apifytech
https://youtube.com/apify
https://discord.com/invite/jyEM2PRvMU
hello@apify.com

Overview
Repositories
Projects
Packages
People
Sponsoring

More

Overview
Repositories
Projects
Packages
People
Sponsoring

Pinned Loading

crawlee-python crawlee-python Public

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 4.5k 313
crawlee crawlee Public

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

TypeScript 15.5k 664
proxy-chain proxy-chain Public

Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

JavaScript 850 143
apify-sdk-js apify-sdk-js Public

Apify SDK monorepo

TypeScript 123 35
got-scraping got-scraping Public

HTTP client made for scraping based on got.

TypeScript 551 44
fingerprint-suite fingerprint-suite Public

Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

TypeScript 961 101

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All API Blueprint C++ Dockerfile HTML JavaScript PHP Python Ruby Shell TypeScript

Sort

Select order

Last updated Name Stars

Showing 10 of 128 repositories

apify-cli Public
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

apify/apify-cli’s past year of commit activity

TypeScript 122 19 34 (1 issue needs help) 5 Updated Nov 10, 2024
apify-shared-js Public
Utilities and constants shared across Apify projects.

apify/apify-shared-js’s past year of commit activity

TypeScript 12 Apache-2.0 11 4 0 Updated Nov 9, 2024
openapi Public
An OpenAPI specification for the Apify API.

apify/openapi’s past year of commit activity

JavaScript 2 MIT 0 17 2 Updated Nov 10, 2024
crawlee Public
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

apify/crawlee’s past year of commit activity

TypeScript 15,521 Apache-2.0 664 116 (1 issue needs help) 14 Updated Nov 10, 2024
docusaurus-plugin-typedoc-api Public Forked from milesj/docusaurus-plugin-typedoc-api
Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.

apify/docusaurus-plugin-typedoc-api’s past year of commit activity

TypeScript 0 29 0 1 Updated Nov 9, 2024
rag-web-browser Public
RAG Web Browser is a tool to provide your RAG pipelines with up-to-date information from the web.

apify/rag-web-browser’s past year of commit activity

TypeScript 2 0 0 1 Updated Nov 9, 2024
actor-whitepaper Public
This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.

apify/actor-whitepaper’s past year of commit activity

2 Apache-2.0 0 7 5 Updated Nov 8, 2024
crawlee-python Public
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

apify/crawlee-python’s past year of commit activity

Python 4,512 Apache-2.0 313 73 11 Updated Nov 8, 2024
actor-beautifulsoup-scraper Public

apify/actor-beautifulsoup-scraper’s past year of commit activity

Python 3 Apache-2.0 0 2 0 Updated Nov 8, 2024
apify-client-python Public
Apify API client for Python

apify/apify-client-python’s past year of commit activity

Python 47 Apache-2.0 11 8 2 Updated Nov 8, 2024

View all repositories

People

Top languages

Loading…

Most used topics

apify web-scraping scraping headless-chrome playwright

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.