python-web-scrapper

This Python script performs web scraping to extract company data from a website and saves it in a CSV file. It uses the BeautifulSoup library for HTML parsing and requests library for making HTTP requests. It was first created and test for an argentinian website as part of a market research, where it pulled 800+ company's data in 30+ pages within 2 minutes.

Installation

Clone the repository or download the source code.
Ensure you have Python 3.x installed on your machine.
Install the required Python dependencies by running the following command:

pip install requests beautifulsoup4

Usage

Update the url_base variable in the code with the URL of the website's page containing the list of companies you want to scrape.
Run the Python script by executing the following command:

python scrapper.py

The script will start scraping the company data from each page and save it in a CSV file named company.csv in the same directory.

Customization

You can modify the CSV file name and the fields being extracted by editing the csv_writer.writerow() statement in the code.

Contributing

Contributions are welcome! If you find any issues or have suggestions for improvements, please open an issue or submit a pull request.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
remove-duplicates.py		remove-duplicates.py
scrapper.py		scrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

python-web-scrapper

Installation

Usage

Customization

Contributing

License

About

Releases

Packages

Languages

VitorCoelho/web-scrapper-w-pagination

Folders and files

Latest commit

History

Repository files navigation

python-web-scrapper

Installation

Usage

Customization

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages