Data Crawling Bootcamp

This repository contains a collection of Python scripts developed during a data crawling bootcamp. The scripts use various libraries including selenium, webdriver_manager, pandas, and BeautifulSoup to scrape data from different websites such as Naver Shopping, Instagram, Naver News, Google Play Store, and YouTube.

Prerequisites

Before running any scripts, ensure you have the necessary libraries installed. You can install them using the following commands:

pip install selenium
pip install webdriver_manager
pip install pandas
pip install beautifulsoup4

Scripts Overview

1. Chrome.py

This script scrapes product data from Naver Shopping.

URL: Naver Shopping
Data Scraped: Product names, prices, and categories.
Output: 애견용품.csv

How to Run

python chrome.py

2. Instagram.py

This script scrapes Instagram comments for a specified hashtag.

URL: Instagram
Data Scraped: Comments on posts with a specific hashtag.
Output: insta.csv

How to Run

python instagram.py

3. Naver_news.py

This script scrapes news articles from Naver News based on a search query.

URL: Naver News
Data Scraped: Article titles, links, and descriptions.
Output: test.csv

How to Run

python naver_news.py

4. Playstore.py

This script scrapes reviews and ratings from a specified app on the Google Play Store.

URL: Google Play Store
Data Scraped: Review text and star ratings.
Output: bae_review.csv

How to Run

python playstore.py

5. Youtube.py

This script scrapes video titles and view counts from YouTube based on a search query.

URL: YouTube
Data Scraped: Video titles and view counts.
Output: 연세대학교.csv

How to Run

python youtube.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
.DS_Store		.DS_Store
README.md		README.md
chrome.py		chrome.py
csv_test.py		csv_test.py
instagram.py		instagram.py
naver_news.py		naver_news.py
playstore.py		playstore.py
youtube.py		youtube.py
데이터분석 경진대회 기획안 9조 (최종).pdf		데이터분석 경진대회 기획안 9조 (최종).pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Crawling Bootcamp

Prerequisites

Scripts Overview

1. Chrome.py

How to Run

2. Instagram.py

How to Run

3. Naver_news.py

How to Run

4. Playstore.py

How to Run

5. Youtube.py

How to Run

About

Releases

Packages

Languages

j0ntendo/data-crawling-bootcamp

Folders and files

Latest commit

History

Repository files navigation

Data Crawling Bootcamp

Prerequisites

Scripts Overview

1. Chrome.py

How to Run

2. Instagram.py

How to Run

3. Naver_news.py

How to Run

4. Playstore.py

How to Run

5. Youtube.py

How to Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages