Skip to content
@apify

Apify

We're making the web more programmable.

Pinned Loading

  1. crawlee-python crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

    Python 4k 258

  2. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 15.2k 641

  3. proxy-chain proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

    JavaScript 839 139

  4. apify-sdk-js apify-sdk-js Public

    Apify SDK monorepo

    TypeScript 119 31

  5. got-scraping got-scraping Public

    HTTP client made for scraping based on got.

    TypeScript 527 40

  6. fingerprint-suite fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 914 97

Repositories

Showing 10 of 128 repositories
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee’s past year of commit activity
    TypeScript 15,202 Apache-2.0 641 111 (1 issue needs help) 15 Updated Oct 4, 2024
  • apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    apify/apify-cli’s past year of commit activity
    TypeScript 121 18 34 (1 issue needs help) 8 Updated Oct 4, 2024
  • apify-sdk-js Public

    Apify SDK monorepo

    apify/apify-sdk-js’s past year of commit activity
    TypeScript 119 Apache-2.0 31 9 7 Updated Oct 4, 2024
  • workflows Public

    Apify's reusable github workflows

    apify/workflows’s past year of commit activity
    6 3 2 3 Updated Oct 3, 2024
  • apify-shared-js Public

    Utilities and constants shared across Apify projects.

    apify/apify-shared-js’s past year of commit activity
    TypeScript 12 Apache-2.0 10 4 2 Updated Oct 3, 2024
  • apify-docs Public

    This project is the home of Apify's documentation.

    apify/apify-docs’s past year of commit activity
    API Blueprint 26 Apache-2.0 73 64 24 Updated Oct 3, 2024
  • fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    apify/fingerprint-suite’s past year of commit activity
    TypeScript 914 Apache-2.0 97 18 11 Updated Oct 3, 2024
  • crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee-python’s past year of commit activity
    Python 4,038 Apache-2.0 258 70 8 Updated Oct 3, 2024
  • airbyte Public Forked from airbytehq/airbyte

    Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

    apify/airbyte’s past year of commit activity
    Python 0 4,086 0 0 Updated Oct 3, 2024
  • homebrew-tap Public

    A Homebrew tap for Apify tools

    apify/homebrew-tap’s past year of commit activity
    Ruby 8 1 0 4 Updated Oct 2, 2024