Skip to content

amod26/DataEngineeringWithPython

Repository files navigation

Data Engineering With Python

  • This repo contains different projects using Python language and different SQL & NoSQL databases like PostgreSQL, Apache Cassandra, SQlite3.
  • Prime focus of this projects are to get data from different sources and load into some database based on the nature and usecase of the project.
  • We will use some API to connect to the data and do transformation of the data using Python libraries and functions.
  • Usage of AWS services like S3, Redshift, IAM, Glue,EMR.
  • Transforming schemas from 3NF to star schema for simplification of query and to increase optimization.