Apify
Pinned Loading
Repositories
- crawlee Public
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee’s past year of commit activity - docusaurus-plugin-typedoc-api Public Forked from milesj/docusaurus-plugin-typedoc-api
Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
apify/docusaurus-plugin-typedoc-api’s past year of commit activity - rag-web-browser Public
RAG Web Browser is a tool to provide your RAG pipelines with up-to-date information from the web.
apify/rag-web-browser’s past year of commit activity - actor-whitepaper Public
This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
apify/actor-whitepaper’s past year of commit activity - crawlee-python Public
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee-python’s past year of commit activity - actor-beautifulsoup-scraper Public
apify/actor-beautifulsoup-scraper’s past year of commit activity
Top languages
Loading…