Name: scrape_c4j_tumblr
Owner: Code for Japan
Description: Code for Japan ?????Tumblr??????????????WordPress??????????CSV??????????????????????
Created: 2017-12-11 03:58:13.0
Updated: 2017-12-11 04:17:25.0
Pushed: 2017-12-11 04:21:12.0
Homepage: null
Size: 11
Language: Python
GitHub Committers
User | Most Recent Commit | # Commits |
---|
Other Committers
User | Most Recent Commit | # Commits |
---|
Code for Japan ??Website (http://archive.code4japan.org/) ??????????WordPress?????????????CSV????????????????? ????????????????????????????????????????????????????????????CSV??????????? ?????????????????????????????????????img???src???????????????????
out/export.csv — ????CSV???????WordPress ?CSV?????????????????? out/images/full — ????????????????????????????????????????URL?SHA1?????????????????
Python 2.7.12 ??????????????
???????? scrapy ???????????????????scrapy ??????????????
p install scrapy
??????????????????
t clone https://github.com/codeforjapan/scrape_c4j_tumblr.git
????????????????????????????
scrape_c4j_tumblr
rapy crawl c4j
???????????????????OK??? out ???????????????????????? export.csv ????CSV??????images ???????????????????
ut
rt.csv images
This software is released under the MIT License, see LICENSE.txt.