2018_CarListings

About

This is a project from the Career Foundry Data Analytics Program centered around using scikit-learn with Python for supervised/unsupervised machine learning techniques.

Objective

This data was scraped from TrueCar.com (and uploaded to Kaggle, where the dataset was collected here for use). Information included in the scraping were car price, mileage, year, make, and model. The purpose of this project is to explore relationships and patterns among the data.

Data

Below is the link the dataset collected from Kaggle as "true_car_listings.csv":

https://www.kaggle.com/datasets/jpayne/852k-used-car-listings

The csv files in the "02 Data" folder were uploaded using Git LFS.

Code

The scripts walk through:

Data quality checks and exploratory analysis
Predicting price using linear regression, random forest, and gradient boosting
Using folium and geospatial data to create a choropleth map of price residuals
Performing K-means clustering

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
01 Project Management		01 Project Management
02 Data/Original Data		02 Data/Original Data
03 Scripts		03 Scripts
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2018_CarListings

About

Objective

Data

Code

About

Releases

Packages

Languages

kimballwightman/2018_CarListings

Folders and files

Latest commit

History

Repository files navigation

2018_CarListings

About

Objective

Data

Code

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages