Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

node.js製作一個爬蟲 #31

Open
BenhuangBot opened this issue Jan 7, 2018 · 2 comments
Open

node.js製作一個爬蟲 #31

BenhuangBot opened this issue Jan 7, 2018 · 2 comments

Comments

@BenhuangBot
Copy link

目的

我想要用node.js製作一個爬蟲來抓取特定網站上的資料,這些資料需要登錄網站後才看得到,我有該網站的帳號密碼,請問我應該怎樣下手比較好?

@poying
Copy link

poying commented Jan 8, 2018

剛好有個範例 https://github.com/taiwan/tocwc-data

如果需要讓網站執行 js 那讓程式開瀏覽器抓取資料比較簡單,如同上面那個範例。其他情況只需要確保每個 http request 都有帶 cookie,response 回來的 cookie 也都有存下來供發 request 使用即可。至於怎的處理 cookie 就要看你用那套 library。

CI build error 是因為懶的修,功能是對的

@BenhuangBot
Copy link
Author

BenhuangBot commented Mar 1, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants