Hostview Processing

Hostview Processing contains the data processing scripts that store the raw files (sqlite dbs, pcaps, jsons) from Hostview clients (received with hostview-upload) to the backend database. The basic workflow is as follows:

a master process is watching new files in an incoming folder
when a new file is received, a processing task is queued on the task queue
a free worker process picks up the processing task from the queue and records the data to the db
the worker notifies the master process of the success/failure of the task processing
on failure, the master will requeue (up to some number of times) the task
on success, the file is moved to the permanent storage location
on repeated failures, the file is moved to a separate failed files folder for manual inspection

This workflow is implemented with a node.js cluster and a Redis backed job queue.

The app can be run without the file watcher to batch process a set of files. In this case it will just process the files in the configured folder, parse and store all data to the database, and exit.

The app depends on Redis (job queue) and Postgresql database (to store the data).

The app processes the following raw files:

xxx_stats.db - sqlite db
xxx.json - browser plugin data (TODO)
xxx.part|last.pcap - pcap

Deployment

The dev and prod deployments are managed as Docker containers.

Docker volume sharing makes storing the raw files on the host a bit tricky as the user ids of the container and the host do not match ... The current solution is to make sure the /data mount point on the host is writable by everyone. Check the configuration before running the containers.

Development

To (re)build the app image, use Docker Compose:

docker-compose -f dev.yml build

To run the containers, use Docker Compose (will start required Redis + Postgresql + App containers):

docker-compose -f dev.yml up

Note, the first boot is long + verbose as the Postgresql container initialzes the database.

To get a shell access to the app container (will not start the app) with data mounted on the host, do:

docker run --rm -it hostview/processing /bin/bash

To find out more about running containers, networks etc:

$ docker ps -a                       // list all containers
$ docker network ls                  // list networks
$ docker start/stop <container id>   // start/stop existing container

To connect to the development database container:

$ docker run -it --rm --net=<hostview back-tier network name> --link <postgres container name>:postgres postgres psql -h postgres -U hostview

The code uses 'debug' library, enable debug logs with environment variable DEBUG=hostview.

Testing

To run all the unit tests (in ./app/test), first make sure a postgres + redis are running somewhere (e.g. in a container from above). NOTE: the tests will clear all data in the DB so do not run the tests againts a production DB !!!

Then run the unit tests:

$ docker run --rm --net=<hostview back-tier> --link <postgres container name>:postgres -v $PWD/app:/app -v /app/node_modules -e DEBUG=hostview -e NODE_ENV=development -e TEST=1 -e PROCESS_DB=postgres://hostview:h0stvi3w@postgres/hostview hostview/processing

Production

To (re-)create the database tables, do:

$ psql -h localhost -U hostview -d hostview2016 -f docker-entrypoint-initdb.d/init-hostview.sql

To connect to the local database, do:

$ psql -W -h localhost -U hostview -d hostview2016

To run the app in production mode, use Docker Compose (will start Redis + App containers, with on-host Postgresql):

PROCESS_DB=postgres://<user>:<password>@ucn.inria.fr/hostview2016 docker-compose -f prod.yml up -d

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
app		app
app2		app2
docker-entrypoint-initdb.d		docker-entrypoint-initdb.d
python-src		python-src
tcptrace-src		tcptrace-src
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
datarsync.sh		datarsync.sh
dbdrop.sh		dbdrop.sh
dev.yml		dev.yml
prod.yml		prod.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hostview Processing

Deployment

Development

Testing

Production

About

Releases

Packages

Contributors 4

Languages

License

inria-muse/hostview-processing

Folders and files

Latest commit

History

Repository files navigation

Hostview Processing

Deployment

Development

Testing

Production

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages