Skip to content

harlo/MinxScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

harloharlo
harlo
and
harlo
Jan 7, 2014
c909628 · Jan 7, 2014

History

38 Commits
Jan 4, 2014
Jan 6, 2014
Jan 7, 2014
Jan 6, 2014
Jan 4, 2014
Jan 4, 2014
Jan 4, 2014
Jan 5, 2014
Jan 5, 2014
Jan 4, 2014
Jan 4, 2014
Jan 5, 2014

Repository files navigation

MinxScraper


Setup

After cloning, run

git submodule update --init --recursive

Install the submodules tornado, requests, and m2x

cd library/[package]
sudo python setup.py install

Install dependencies (not included in packaging):

sudo pip install beautifulsoup4

(or see http://www.crummy.com/software/BeautifulSoup/ if you have trouble)


Chrome Extension

Enable Developer mode in chrome://extensions. Click the "Load unpacked extension" button, navigate to

/browser 

and click OK.


Config

Please edit the config files at:

/conf.json.example
/conf.py.example

And copy them as:

/conf.json
/conf.py

Start

First:

cd MinxScraper
python api.py

Then use the chrome extension. See this video for help: http://www.youtube.com/watch?v=3zVlcJeAssk

Stop

[ctrl+c]

About

Frankenstein's scraper-to-structured-data toolkit

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published