Mortgage Approval Prediction System

It is a home loan approval prediction platform. It predicts the decision of home loan application based on borrower's financial information, property information, geographic context, loan application details, etc.

Below is the diagram of ML pipeline used to train the model for the platform. The pipeline was trained on large-scale (10GB) official 2020 U.S Home Mortgage Disclosure act(HMDA) dataset. And the trained ML model was integrated in web-application to make predictions.

ML pipeline

Stack

Pyspark framework was used to make the entire pipeline
AWS EMR / GCP Dataproc cluster of (1 Master machine and 3 Worker machines) was used as compute to train the pipeline
AWS S3 was used as datasource and warehouse
Streamlit was used to make web-app

Files Overview

2020_lar.txt - dataset file (~10GB and ~25M rows)

Final-GCP.py - contains Pyspark code to build ML pipeline that contains stage like Ingestion, cleaning, Preprocessing, Feature Engineering, Model training and Model evaluation which was run on GCP cluster

models - folder contains trained tranformer & ML models (trained on GCP cluster)

app.py - contains Web UI to make predicts from input fields

Download and run

To run the application run the following commands

streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.ipynb_checkpoints		.ipynb_checkpoints
cleaned_dataset		cleaned_dataset
mlartifacts		mlartifacts
mlruns		mlruns
models		models
output		output
.DS_Store		.DS_Store
.gitignore		.gitignore
Data Loading.ipynb		Data Loading.ipynb
Final 2.ipynb		Final 2.ipynb
Final-GCP.py		Final-GCP.py
Final.ipynb		Final.ipynb
Final_submission.zip		Final_submission.zip
Img_1.png		Img_1.png
Img_2.png		Img_2.png
METCS777-Project-Assignment.pdf		METCS777-Project-Assignment.pdf
Mortgage Approval Prediction System_Report.pdf		Mortgage Approval Prediction System_Report.pdf
Mortgage Approval System_PPT.pdf		Mortgage Approval System_PPT.pdf
Mortgage Approval System_PPT.pptx		Mortgage Approval System_PPT.pptx
README.md		README.md
Results.xlsx		Results.xlsx
TermProject_777_proposal_Template(2).docx		TermProject_777_proposal_Template(2).docx
app.py		app.py
image.png		image.png
~$Results.xlsx		~$Results.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mortgage Approval Prediction System

ML pipeline

Stack

Files Overview

Download and run

About

Releases

Packages

Languages

Dhruv-praju/Mortgage-approval-system

Folders and files

Latest commit

History

Repository files navigation

Mortgage Approval Prediction System

ML pipeline

Stack

Files Overview

Download and run

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages