Junior Data Engineer | AWS Certified | Python, SQL, ETL, Cloud Data Platforms | Building Scalable Data Pipelines | 2+ Years of Experience in Data
Hi, I'm my name is Mariam, a passionate Junior Data Engineer with over 2 years of experience working with various tools and technologies in data engineering, ETL processes, and cloud platforms. My journey in data has allowed me to build pipelines, analyse large datasets, and implement cloud-based solutions, making an impact on data-driven decision-making and business outcomes.
A fully automated data pipeline designed to streamline the process of collecting, analysing, and storing daily news articles. Leveraging the Newsdata.io API, it fetches relevant news articles, performs sophisticated sentiment analysis, and securely stores the processed data in a PostgreSQL database deployed on Amazon RDS.
This project showcases a robust, scalable, and efficient data pipeline designed for Airbnb data using the power of AWS and GCP. The pipeline extracts, transforms, and loads (ETL) data from multiple sources, ensuring high performance and reliability. It leverages AWS services like S3, Lambda, and Redshift for storage and processing, while utilising GCP's BigQuery and Dataflow for advanced analytics and real-time data processing.