a newspaper scrubber for http://prothom-alo.com to collect the images and their corresponding caption.
a newspaper scrubber for http://www.thedailystar.net to collect news info.
- just use
pip install -r requirements.txt
- set
start_date
andend_date
- set
category
andpage range
- after installing dependencies and setting the parameters just run with
python dailystar-scrubber.py
orpython prothomalo-scrubber.py