uCourse-crawler

🎒 Scrape the courses info from the University of Nottingham's website. (Different campuses and academic years supported.)

Requirements

Nodejs
MongoDB (optional)

Usage

git clone https://github.com/Songkeys/uCourse-crawler.git
cd uCourse-crawler
npm i
npm start

Demo

Output Methods

There are two output methods provided:

MongoDB (Recommended)
Local JSON file

Output (MongoDB)

For mongoDB, you will need to input a mongo connection string URI. The output will be stored in a table called course_[campus]_[year]. E.g. course_china_2020.

The output example:

Output (JSON file)

For local JSON file, the output will be in a JSON format stored in /dist/[tablename].json.

The output example:

Size & Time

The estimated output size will be 2~3 MB per campus per year.

The estimated crawling time will be 30~50 mins per campus per year (depending on your network).

Todo

Concurency using pupeteer-cluster
Breakpoint resume

Resources

Resouce website: https://mynottingham.nottingham.ac.uk/psp/psprd/EMPLOYEE/HRMS/c/UN_PROG_AND_MOD_EXTRACT.UN_PAM_CRSE_EXTRCT.GBL
- There is also a short url for this: https://u.nu/course. (You may need to visit twice to open it for some authorization issue.)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
course.js		course.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

uCourse-crawler

Requirements

Usage

Demo

Output Methods

Output (MongoDB)

Output (JSON file)

Size & Time

Todo

Resources

About

Releases

Packages

Languages

License

uFair-Tech/uCourse-crawler

Folders and files

Latest commit

History

Repository files navigation

uCourse-crawler

Requirements

Usage

Demo

Output Methods

Output (MongoDB)

Output (JSON file)

Size & Time

Todo

Resources

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages