This project focuses on exploratory data analysis (EDA) and financial insights using Lending Club loan data from Kaggle. I performed data cleaning, visualization, and trend analysis to uncover key financial patterns, loan defaults, and risk factors.
β
Data Source: Kaggle (Lending Club Loan Data)
β
Visualization Tool: Tableau
β
Key Analysis Areas: Loan status, interest rates, borrower risk profiling, and lending trends
β
Interactive Dashboard: Published on Tableau Public
π― View my Tableau Dashboard: Northeastern RAF Lender Data
π Kaggle Dataset: Lending Club Insightful Financial EDA
- Extracted Lending Club loan dataset from Kaggle.
- Removed missing values, duplicates, and irrelevant columns.
- Standardized data formats (e.g.,
loan_status,interest_rate).
- Loan Default Analysis: Identified patterns in loan defaults based on credit grades.
- Interest Rate Distribution: Analyzed interest rate variations by loan amount and term.
- Borrower Segmentation: Grouped borrowers based on income, loan purpose, and credit history.
- Risk Profiling: Assessed risk factors affecting loan approval and repayment success.
β
Loan Status Distribution (Charged Off, Fully Paid, Current)
β
Interest Rate Trends by loan amount and term
β
Top Reasons for Loan Defaults
β
Credit Score Impact on Loan Approval
β
Debt-to-Income Ratio Analysis
π Higher credit scores correlated with lower interest rates and higher approval rates.
π Short-term loans had significantly higher interest rates compared to long-term loans.
π Debt-to-income ratio was a critical factor in determining borrower risk.
π Top reasons for loan defaults included debt consolidation and small business loans.
πΉ Feature Engineering: Add more derived features for better insights.
πΉ Predictive Modeling: Implement ML models to forecast loan defaults.
πΉ More Interactive Visuals: Enhance Tableau dashboards with drill-down analytics.
πΉ Real-time Data Updates: Automate dataset updates for ongoing analysis.