demo-redisconf19

127.0.0.1:6379> BF.RESERVE bloomTest 0.0001 10
OK
127.0.0.1:6379> BF.MADD bloomTest elem1 elem2 elem3 elem4 elem5 elem6 elem7 elem8
1) (integer) 1
2) (integer) 1
3) (integer) 1
4) (integer) 1
5) (integer) 1
6) (integer) 1
7) (integer) 1
8) (integer) 1
127.0.0.1:6379> BF.ADD bloomTest elem7
(integer) 0
127.0.0.1:6379> BF.MADD bloomTest elem9 elem10 elem11
1) (integer) 1
2) (integer) 1
3) (integer) 1
127.0.0.1:6379> BF.MADD bloomTest elem12 elem13 elem14 elem 15
1) (integer) 1
2) (integer) 1
3) (integer) 1
4) (integer) 1
5) (integer) 1
127.0.0.1:6379> BF.MADD bloomTest elem16 elem 17 elem18
1) (integer) 1
2) (integer) 0
3) (integer) 1
4) (integer) 1

You'll notice that we were able to add 16 elements until the bloom filter failed on us. That's because the error rate goes up quite rapidly once you exceed your desired capacity.

Timeseries in Redis with RedisTimeSeries

What is a Timeseries Dataset

Time Series data is sequential data. Analysis of this data is often reduced to running aggregration queries to reduce processing overhead and to extract intelligence in real time. We can use RedisTimeseries to help us achieve this.

Use Cases

Stock ticker data
IoT sensor data
Fleet management (vehicle id, timestamp, GPS coordinates, average speed)
Requiring a database where other RDBMS are too heavy

Note

In Redis, if you do not set a retention policy, Redis may use LRU to determine when to clean up old items

Demo

Within the scripts folder run > python .\populateTimeSeries.py --sensor-id=10

Then exec into Redis:

> docker exec -it db-redis sh
# date +%s
1554591979
# redis-cli
127.0.0.1:6379> TS.RANGE temperature:10 1554591900 1554591979
...
...
...
127.0.0.1:6379> TS.RANGE temperature:10 1554591900 1554591979 AGGREGATION SUM 10
...
127.0.0.1:6379> TS.RANGE temperature:10 1554591900 1554591979 AGGREGATION AVG 10

RedisGears

What are RedisGears

Officially according to the documentation, RedisGears is a Dynamic execution framework for your Redis Data.

Unofficially, it's a neat way to add map/reduce/filter functionality with a built-in Python interpreter. Unfortunately this means that any map reduce functions are single-threaded due to the Python Global Interpreter Lock. That also means processing may not be the fastest.

Use Cases

TBD. But let's go over the topics off the top of my head.

Demo

The following script can be used to try out the following RedisGears.

> python .\populateRandomData.py

Within the redis-cli, you can run these following commands:

Use Case	Command
List all keys	`RG.PYEXECUTE "GearsBuilder().map(lambda x: x['key']).run()"`
Delete all keys	`RG.PYEXECUTE "GearsBuilder().map(lambda x: x['key']).foreach(lambda x: execute('del', x)).count().run()"`
Count number of keys	`RG.PYEXECUTE "GearsBuilder().count().run()"`
Count keys with string value greater than 50	`RG.PYEXECUTE "GearsBuilder().map(lambda x: {'key':x['key'], 'value': 0 if int(x['value']) < 50 else 100}).countby(lambda x: x['value']).collect().run()"`
Get the average of these keys	`RG.PYEXECUTE "GearsBuilder().map(lambda x:int(x['value'])).avg().run()"`
Aggregate over the keys to get the total sum	`RG.PYEXECUTE "GearsBuilder().map(lambda x:int(x['value'])).aggregate(0, lambda r, x: x + r, lambda r, x: x + r).run()"`
Delete all keys prefixed with city	`RG.PYEXECUTE "GearsBuilder().map(lambda x: x['key']).foreach(lambda x: execute('del', x)).count().run('city:*')"`
Delete all keys added in the future	`RG.PYEXECUTE "GearsBuilder().foreach(lambda x: execute('del', x['key'])).register()"`

Unfortunately, we currently cannot delete an existing gear, so the last command effectively makes Redis useless.

This feature is expected soon though: RedisGears/RedisGears#44

To revert any gears you have created. Restart all your services:

docker-compose down && docker system prune -f && docker volume prune -f && docker-compose up -d

An interesting use case I learned about

Training multiple machine learning models at the same time

In traditional machine learning pipelines, data sits on a disk/database and is pulled into memory and used on demand while training. In the case of LSTM Neural Networks, you often take a linear sequence of data and turn it into batches of sequential sliding windows. These batches are often stored in variables in memory and rebuilt every time a new model is created.

But what if you wanted to create multiple models at the same time on separate machines to see its performance with the most up-to-date data as soon as possible?

The simple solution would be to share this data across all servers and have them build the training batches on demand. This solution while correct, has a lot of repeated, redundent steps of building the batches of sequential sliding windows. This is where Redis fits into our picture.

Instead of building the batches multiple times and storing it in variables within the machine learning pipeline. We can build the batches separately, store them in Redis and have each of the machine learning models pull from Redis as each batch is needed during both the training and evaluation phases. This way, reading off disk is only done once. Data can be discarded from memory as soon as it is consumed and can be fetched from Redis the next time it is needed.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
grafana/provisioning		grafana/provisioning
prometheus		prometheus
redis		redis
scripts		scripts
README.md		README.md
docker-compose.build.yml		docker-compose.build.yml
docker-compose.yml		docker-compose.yml

Service	URL
Grafana	http://localhost:3000
Prometheus	http://localhost:9090
Cadvisor	http://localhost:8080
Nodexporter	http://localhost:9100

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

demo-redisconf19

Table of Contents

Introduction

Prerequisites

Launch containers

Bloom Filters in Redis with RedisBloom

What are bloom filters

Use Cases

Caveat

Demo

Timeseries in Redis with RedisTimeSeries

What is a Timeseries Dataset

Use Cases

Note

Demo

RedisGears

What are RedisGears

Use Cases

Demo

An interesting use case I learned about

Training multiple machine learning models at the same time

About

Releases

Packages

Languages

sgama/demo-redisconf19

Folders and files

Latest commit

History

Repository files navigation

demo-redisconf19

Table of Contents

Introduction

Prerequisites

Launch containers

Bloom Filters in Redis with RedisBloom

What are bloom filters

Use Cases

Caveat

Demo

Timeseries in Redis with RedisTimeSeries

What is a Timeseries Dataset

Use Cases

Note

Demo

RedisGears

What are RedisGears

Use Cases

Demo

An interesting use case I learned about

Training multiple machine learning models at the same time

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages