Skip to content

Latest commit

 

History

History
48 lines (25 loc) · 1.04 KB

README.md

File metadata and controls

48 lines (25 loc) · 1.04 KB

Description

personal web album

crawl images from web(www.sina.com.cn for example)

put image urls into a database, url filtered by python bloomfilter

add images into your web album

extract the fingerprint of images for future image search

Environment & Tools

0. Ubuntu11.10

1. Apache2

2. MySQL5.0

3. Scrapy

4. Gallery2.0(galleryremote needed for adding images through gallery API)

5. OpenCV

6. pHash Open Library

Details

install and configure apache2, mysql5.0, scrapy, gallery2.0.

modify the directory name of image storage.

run the command 'scrapy crawl sina_image' in the crawl directory to start to crawl images.

you can set the number of crawling for one time and at the same time the url of images will be written into the database.

after crawling, execute the addimages.py to add the images into gallery2.

if all goes well, you can browse your images already.

of course, you can execute above steps repeatedly to update your album.