Skip to content

Get infomations of all students who joined the Entrace HighSchool Examination in 2019 in Binh Duong.

Notifications You must be signed in to change notification settings

leduythuccs/GetEntranceExaminationData

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

GetEntranceExaminationData

The purpose of this project is get some infomations of all students who had join the Entrace HighSchool Examination in 2019 in Binh Duong.

The infomations is got by crawling the html of this website.

Because the result of every students is store at a url like: prefix+ID+suffix (which "http://sgd.binhduong.gov.vn/Tracuudiem/Tuyensinh10/tabid/294/NamThi/2019/DotThiId/28/ThiSinhId/" is the prefix, "/Default.aspx" is the suffix), so I just brute force all the ID and crawl the html data.

I've used BeautifulSoup4 to crawl html data much more easier, and used unicodecsv instead of csv because the data was written in Vietnamese :(

About

Get infomations of all students who joined the Entrace HighSchool Examination in 2019 in Binh Duong.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages