A simple project that get information from Etherium blockchain and proceed it to Hive via Kafka and Flink
Pipeline:
Etherium node (Infura) -> Kafka -> Flink (Table API) -> HDFS FileSystem -> HiveQL (planning)
pip3 install pyyaml
pip3 install kafka-python
pip3 install apache-flink
sudo apt install python-is-python3
See details in kafka-start.sh
CURRRENT_BLOCK_NUBER: XXXXXXX
INFURA_API_KEY: XXXXXXXXXXXXXXXXXXXXXXXXXX
KAFKA_BOOTSTRAP: localhost:9092
Yes, I'm planning to make Docker image for this project