Skip to content

A project from the Career Foundry Data Analytics Bootcamp centered around using scikit-learn with Python for supervised/unsupervised machine learning techniques.

Notifications You must be signed in to change notification settings

kimballwightman/2018_CarListings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

2018_CarListings

About

This is a project from the Career Foundry Data Analytics Program centered around using scikit-learn with Python for supervised/unsupervised machine learning techniques.

Objective

This data was scraped from TrueCar.com (and uploaded to Kaggle, where the dataset was collected here for use). Information included in the scraping were car price, mileage, year, make, and model. The purpose of this project is to explore relationships and patterns among the data.

Data

Below is the link the dataset collected from Kaggle as "true_car_listings.csv":

The csv files in the "02 Data" folder were uploaded using Git LFS.

Code

The scripts walk through:

  • Data quality checks and exploratory analysis
  • Predicting price using linear regression, random forest, and gradient boosting
  • Using folium and geospatial data to create a choropleth map of price residuals
  • Performing K-means clustering

About

A project from the Career Foundry Data Analytics Bootcamp centered around using scikit-learn with Python for supervised/unsupervised machine learning techniques.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published