Scrappy

Scrappy is a desktop application built with Electron.js that provides an intuitive UI for web scraping. Designed to simplify the extraction of data from websites, Scrappy integrates Selenium for handling both static and dynamic content, including infinite scrolls and content loaders.

The core concept of Scrappy revolves around pipelines: sequences of actions that are executed step-by-step to automate data collection tasks.

✨ Features

Customizable Automation Scripts
Create and manage your own automation scripts to scrape any kind of data from websites.
Dynamic Content Handling
Supports dynamic loaders and infinite scrolling, ensuring complete data capture.
Pipeline Workflow
Easily define a sequence of actions such as clicks, scrolls, waits, and data extraction to build efficient scraping pipelines.
Modern UI
Built using React and Vite, Scrappy provides a fast and user-friendly interface for managing scraping tasks.
Cross-Platform
Compatible with Windows, macOS, and Linux, thanks to the power of Electron.js.

🚀 Tech Stack

Frontend: React, Vite
Backend Automation: Selenium
Desktop Framework: Electron.js

📖 Getting Started

Follow these steps to get started with Scrappy:

1. Clone the Repository

git clone https://github.com/k1tbyte/scrappy.git
cd scrappy
bun install
bun run dev

2. Define a pipeline

Create a sequence of actions (e.g., scroll, delay, retrieve, script, pipe)
Put website URL and execute your pipeline to automate the data collection process
View results in json format directly in the application or redirect to a file using pipe action

🚀 Contributing

We welcome contributions! To contribute to Scrappy

Fork the repository
Create a new branch (git checkout -b feature)
Make your changes
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature)
Create a new Pull Request

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
electron		electron
public		public
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
electron-builder.json5		electron-builder.json5
index.html		index.html
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrappy

✨ Features

🚀 Tech Stack

📖 Getting Started

1. Clone the Repository

2. Define a pipeline

🚀 Contributing

📜 License

About

Releases

Packages

Languages

License

k1tbyte/scrappy

Folders and files

Latest commit

History

Repository files navigation

Scrappy

✨ Features

🚀 Tech Stack

📖 Getting Started

1. Clone the Repository

2. Define a pipeline

🚀 Contributing

📜 License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages