Random Data Maker

Overview

Random Data Maker is a REST webservice for generation random data sets from a defined model, including personal data, geojson features collections and medical metrics data for machine learning, tests, mocks etc..

Requirements

Software:

Python 3.7.5
MySQL 8.0 with Docker support provided

Frameworks & libraries:

See requirements.txt

Django 2.2.7
django-cors-headers 3.1.1
djangorestframework 3.10.3
enum34 1.1.6
jsonfield 2.0.2
mysqlclient 1.4.5
sqlparse 0.3.0

Run

Docker

Create external docker network named webproxy

docker network create webproxy

Run container stack

docker-compose up

Manual

To run this app you will need an external MySQL database, either your own, or one created using dockerfile from inside of the project

docker-compose -f docker-compose-db-only.yml up.

Install virtual environments

python3 -m venv venv
. venv/bin/activate
pip install -r requirements.txt

Make migrations

python manage.py makemigrations
python manage.py migrate --settings=peselgen.settings-dev

And run the app:

python manage.py runserver --settings=peselgen.settings-dev

Endpoints

/generate - method: POST - generates specified number of: persons, metrics, geolocations and attributes

/person

method: GET - returns generated persons
method: POST - generates specified number of persons
method: DELETE - deletes all persons

/metrics

method: GET - returns generated metrics
method: POST - generates specified number of metrics
method: DELETE - deletes all metrics

/attributes

method: GET - returns generated attributes
method: POST - generates specified number of attributes
method: DELETE - deletes all attributes

/geolocation

method: GET - returns generated geolocation collections
method: POST - generates specified number of geolocation collections, number of points to generate and conutry can be specified in body parameters
method: DELETE - deletes all geolocations

Number of records to generate can be specified in body parameter

Data model

Person model

Generated person entities contain:

id: int, private key
first_name: varchar(30)
last_name: varchar(30)
pesel: varchar(11) - unique person identification number based on birth date and sex
email: varchar(64)
phone: varchar(12)
password: varchar(16)
sex: varchar(1) - '1' - male; '2' - female
birth_date: datetime - generated basing on polish General Statistics Departments' informations about births count
geolocation_id: varchar - foreign key references id from geolocation

Metrics model

Patient metrics entities contain:

id: int, private key
patient_id: varchar, foreign key references id from person
doctor_id: varchar
created: datetime
attributes_id: varchar, foreign key references id from attributes
notes: varchar(1024)

Attributes model

Dynamic data model is defined in JSON format. It is used to specify what kind of data app should generate. It is generated in a form of the list of objects with predefined attributes:

key - string value representing object name
count: optional - integer value representing how many values should be generated, default = 1
Generating values from array
- array - array of values from whitch data will be generated
- weights: optional - array of numbers showing the probability of occurance of element from attribute array, must be the same length as array
Generating numbers between two values, type is being resolved dynamically (1 - discrete value, 1.0 - continuous value):
- minimum - smallest value that can be generated
- maximum - biggest value that can be generated
- floating_points: optional - number of digits after decimal point
Generating datetime between two values:
- min_date - smallest datetime value that can be generated
- max_date - biggest datetime value that can be generated
- date_format: optional - default format is: '%m/%d/%Y %I:%M %p'
Geolocation model
id: varchar, foreign key references id from person
geojson: json, includes location based on geojson model - default is random point within Poland polygon

Anonimization

Data from any endpoint can be returned in anonimized form. To do so sensitive fileds should be specified in array as anonymize_array parameter e.g. /api/person/?anonymize_array=pesel,first_name in GET person method. To get full raw data parameter should be equal to 'none'

Swagger

API can be tried out with Swagger interface under default aplication address

How it works?

Random Data Maker is a rest api app and has defined endpoints to communicate with it. It has three kinds of endpoints: generating number of random records (POST), getting them from database in the form of JSON (GET) and flushing tables (DELETE). When post method is used it generates random data e.g personal data, metrics data using apps own random data generator and saves it in database. This random data generator takes JSON file with defined data model and uses it to generate new data. When data is in a database it can be accessed with get methods. Project was originally made as support for medical machine learning system OCULUS Project

About us

We're Computer Science students at Poznan University of Technology. We like programming and had a lot of fun creating this project.
Artur Bałczyński
Mateusz Ostrowski
Adam Przywuski
Maciej Stosik

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
.circleci		.circleci
anonymization		anonymization
attributes		attributes
generator		generator
geolocation		geolocation
metrics		metrics
person		person
peselgen		peselgen
resources		resources
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose-db-only.yml		docker-compose-db-only.yml
docker-compose.yml		docker-compose.yml
init.sh		init.sh
manage.py		manage.py
requirements.txt		requirements.txt
tests		tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Random Data Maker

Overview

Requirements

Software:

Frameworks & libraries:

Run

Docker

Manual

Endpoints

Data model

Person model

Metrics model

Attributes model

Geolocation model

Anonimization

Swagger

How it works?

About us

About

Releases

Packages

Contributors 4

Languages

RandomDataMaker/rdm-services

Folders and files

Latest commit

History

Repository files navigation

Random Data Maker

Overview

Requirements

Software:

Frameworks & libraries:

Run

Docker

Manual

Endpoints

Data model

Person model

Metrics model

Attributes model

Geolocation model

Anonimization

Swagger

How it works?

About us

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages