Skip to content

yuriy-sorokin/crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crawler

This package downloads a HTML part of a web page and follows its links to download related pages into ./storage directory.

Usage

php ./composer.phar install

php bin/app.php --url=https://spiegel.de

By default, it downloads 100 pages with 5 parallel processes.

To change that, provide additional CLI arguments:

php bin/app.php --url=https://spiegel.de --max-pages=500 --max-processes=30

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages