scrapi

Getting started

You will need to:
- Install requirements.
- Install Elasticsearch
- Install Cassandra
- Install harvesters
- Install rabbitmq

Requirements

Create and enter virtual environment for scrapi, and go to the top level project directory. From there, run

$ pip install -r requirements.txt

and the python requirements for the project will download and install.

Installing Cassandra and Elasticsearch

note: JDK 7 must be installed for Cassandra and Elasticsearch to run

Mac OSX

$ brew install cassandra
$ brew install elasticsearch

Now, just run

$ cassandra
$ elasticsearch

Or, if you'd like your cassandra session to be bound to your current session, run:

$ cassandra -f

and you should be good to go.

Running the server

Just run

$ python server.py

from the scrapi/website/ directory, and the server should be up and running!

Harvesters

To set up harvesters for the first time, Just run

invoke init_harvesters

and the harvesters specified in the manifest files of the worker_manager, and their requirements, will be installed.

Rabbitmq

Mac OSX

$ brew install rabbitmq

Ubuntu

$ sudo apt-get install rabbitmq-server

Running the scheduler

from the top-level project directory run:

$ invoke celery_beat

to start the scheduler, and

$ invoke celery_worker

to start the worker.

Testing

To run the tests for the project, just type

$ invoke test

and all of the tests in the 'tests/' directory will be run.

Name		Name	Last commit message	Last commit date
Latest commit History 2,481 Commits
img/favicons		img/favicons
scrapi		scrapi
scripts		scripts
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
.travis.yml		.travis.yml
CHANGELOG		CHANGELOG
LICENSE		LICENSE
README.md		README.md
dev-requirements.txt		dev-requirements.txt
normalized.json		normalized.json
requirements.txt		requirements.txt
setup.cfg		setup.cfg
tasks.py		tasks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scrapi

Getting started

Requirements

Installing Cassandra and Elasticsearch

Mac OSX

Running the server

Harvesters

Rabbitmq

Mac OSX

Ubuntu

Running the scheduler

Testing

About

Releases

Packages

Languages

License

stitchinthyme/scrapi

Folders and files

Latest commit

History

Repository files navigation

scrapi

Getting started

Requirements

Installing Cassandra and Elasticsearch

Mac OSX

Running the server

Harvesters

Rabbitmq

Mac OSX

Ubuntu

Running the scheduler

Testing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages