Skip to content

Latest commit

 

History

History
187 lines (148 loc) · 13.7 KB

README.md

File metadata and controls

187 lines (148 loc) · 13.7 KB

General Overview

See the releases page to see new additions/modifications for each release!

See this comparison page to see new additions/modifications that will be available in the NEXT release!

See sister YouTube-Channels repository for a list of interesting channels!
  • The YouTube-Channels sister repository is a separate repository that uses this package to create a list of videos uploaded by every channel supported by the repository.
  • The sister repository will update the lists of videos once a week.
  • NOTE: In order to minimize the size of the sister repo, the repo contains the list of videos in ONLY the csv format, and not in txt or md format.

GitHub stars GitHub forks

GitHub license PyPI version
GitHub repo size GitHub code size in bytes Lines of code
GitHub Open Issues GitHub closed issues GitHub pull requests GitHub closed pull requests

GitHub most recent (Pre-)Release Date
GitHub release (latest SemVer including pre-releases) version
GitHub commit activity/week GitHub commit activity/month GitHub commit activity/year
GitHub last commit GitHub last commit to main (branch) GitHub commits since latest release (by SemVer including pre-releases)

This package provides a quick, simple way to create a list of all videos posted to any YouTube channel by providing just the URL to that channel's videos. Follow the link to the API in the language you want to work in from the API guides section below.

This program relies on Selenium so it has built-in dependecy setup for all Selenium webdrivers, but you can also explicitly use submodules of this package to download Selenium webdriver dependencies for your own projects. This makes it much easier to work with Selenium even if you update your browser constantly! See the Explicitly downloading all Selenium dependencies for directions.

If you already scraped a channel and the channel uploaded a new video, simply rerun this program on that channel and this package updates your files to include the newer video(s)!

API guides

Other Information

Dependencies
  • The first time you run this package the automated downloader should install everything you need, but in case it doesn't, refer to the link below and/or file an issue here.
    • Manual Selenium downloads here
  • The Selenium drivers are all pretty similar but differ in subtle ways, so play around with them and see what's different :)

NOTE that you also need the corresponding browser installed to properly run the selenium driver.

Interesting information about YouTube

There are two types of YouTube channels: one type is a user channel and the other is a channel channel.

Future Features

Main Features

  • take url and scrape the video name and url for every video for that user
  • create txt, csv, md files

Additional Features

  • enable custom name for output file
  • update previously created file with new videos uploaded since file creation
    • put all urls from file in a (hash) set and check to see if any urls on page (next time program runs on same page) are missing from the (hash) set
  • create web interface
    • input box for channel url
    • generate static HTML page with video URLs for output
Technical Specifications

This python3 package supports multi-platform, mutli-driver use and is currently under active development.

Currently supported operating systems include MacOS, Linux32, Linux64, Windows32, and Windows64. Currently supported drivers include Firefox, Opera, Safari (MacOS), Chrome, Brave, and Edge (Windows).

This package provides built-in support for common errors and exceptions, along with helpful hints, including

  • checking to see if the file to be created already exists
  • checking to see if the user explicitly specified a driver to use, and in case they didn't specify a driver, running the program using Firefox as default and showing the user the available driver options in the terminal output
    • explicitly specifying driver using
      • ListCreator(driver='firefox')
      • ListCreator(driver='opera')
      • ListCreator(driver='safari')
      • ListCreator(driver='chrome')
      • ListCreator(driver='brave')
      • ListCreator(driver='edge')
  • running the program in headless mode if using the geckodriver (Firefox) or chromedriver
    • ListCreator(headless=True)
  • checking to see if the user has the correct Selenium dependency installed
    • installs and shows the user the commands they can run to install the correct dependency if the user has an incorrect dependency (or hasn't downloaded the dependency at all)
      • program checks the operating system of the user's machine and returns a curl command piped into a tar command to download the correct dependecy into a directory the program can access without having to add the executable to PATH manually
      • user still needs to pick the correct command to run by following the directions (if autoamted download doesn't complete)
  • checking to see if the result of the scraping returns anything, and prompts the user to verify the url argument if nothing is found

Overview of package structure

__init__.py

__main__.py

script.py

execute.py

program.py

file submodule

download submodule

notifications.py

Stargazers Over Time

Stargazers over time

Usage Statistics

PyPI - Daily Downloads PyPI - Weekly Downloads PyPI - Monthly Downloads
PePY Weekly Downloads PePY Monthly Downloads PePY Total Downloads
GitHub release (latest by SemVer including pre-releases) downloads

If you found this interesting or useful, please consider starring this repo so other people can more easily find and use this. Thanks!