Skip to content
View SA01's full-sized avatar

Block or report SA01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SA01/README.md

About Me - Suffyan Asad

Hi there πŸ‘‹, I'm Suffyan Asad, a Senior Data Engineer with over 11 years of experience in data engineering and building data platforms on the cloud. My GitHub mostly contains code related to my articles on Medium. Follow me on Medium to learn about Apache Spark, SQL, and other data engineering topics: @suffyan.asad1.


🌟 Career Highlights

  • Senior Data Engineer at Yotascale
    Building a cloud cost management and optimization product that helps companies like Hulu, Zoom, ClickUp, and Okta reduce cloud infrastructure costs.
  • Designed and optimized data pipelines to process billions of rows efficiently.
  • Developed anomaly detection systems to detect and alert on cost spikes and runaway costs.
  • Built time-series forecasting solutions using ARIMA and Facebook Prophet.
  • Published multiple articles on Medium covering topics including Data Engineering, Apache Spark, Databases, and Data Warehousing.

πŸ’¬ Let's Connect


πŸ“š Education

  • MS in Business Analytics | George Washington University, Washington DC, USA
  • BS in Computer Science | FAST-NU Lahore, Pakistan

Popular repositories Loading

  1. spark-read-jdbc-tutorial spark-read-jdbc-tutorial Public

    This repository contains the code and examples for my article on Medium, which explains how to parallelize reading data from JDBC sources in Apache Spark.

    Python 5 3

  2. dbt-tutorial dbt-tutorial Public

    This repository contains the code and project files for my article on Medium, which serves as an introduction to DBT (Data Build Tool) and how to build data transformations with it.

    Dockerfile 4 1

  3. docker-spark-cluster docker-spark-cluster Public

    Forked from mvillarrealb/docker-spark-cluster

    A simple spark standalone cluster for your testing environment purposses

    Dockerfile 3 5

  4. spark-window-functions spark-window-functions Public

    This repository contains the code and examples for my article on Medium, which provides an introduction to Window Functions in Apache Spark.

    Python 2

  5. spark-custom-datasource-tutorial spark-custom-datasource-tutorial Public

    Contains the code and examples for my article on Medium, which explains how to create a custom JDBC read-only data source in Apache Spark 3

    Scala 2 1

  6. ProgrmmingForAnalyticsGroupProject ProgrmmingForAnalyticsGroupProject Public

    This is the class project of Programming for Analytics class

    TeX 1