Skip to content

Debezium-Kafka-PySpark pipeline with Transformer based predictive model to process Realtime data

Notifications You must be signed in to change notification settings

Aktharnvdv/Realtime_dataAnalysis

Repository files navigation

Debezium-kafka-pyspark pipeline with Transformer based predictive model

Prerequisites

  • Docker
  • Docker Compose

Setup & Running Instructions

  1. Start the Docker Containers:

    sudo docker-compose up
  2. Run Kafka Console Consumer:

    bin/kafka-console-consumer.sh --bootstrap-server kafka:9092 --topic server1.Naveed.dbo.Table1 --from-beginning
  3. Enter the Kafka Container:

    docker exec -it oman_ai_project-kafka-1 /bin/bash
  4. Validate SqlServerConnector Configuration:

    sudo curl -i -X PUT -H "Accept:application/json" -H "Content-Type:application/json" localhost:8083/connector-plugins/io.debezium.connector.sqlserver.SqlServerConnector/config/validate -d @payload.json
  5. POST Request to Connectors:

    sudo curl -H "Accept:application/json" -H "Content-Type:application/json" -X POST -d @payload1.json http://localhost:8083/connectors
  6. GET Request to Check Connectors:

    curl -X GET http://localhost:8083/connectors

About

Debezium-Kafka-PySpark pipeline with Transformer based predictive model to process Realtime data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published