Table of Contents
A description of the data in 'UsArrests.csv' is given as: “This data set contains statistics, in arrests per 100,000 residents, for assault, murder, and rape in each of the 50 US states in 1973. Also given is the percent of the population living in urban areas.” This contains a .ipynb file that generates an in-depth PCA and data clustering report of the data using unsupervised machine learning. Dataset can be found: https://www.kaggle.com/datasets/halimedogan/usarrests
- Visual Studio Code
This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.
This is an example of how to list things you need to use the software and how to install them.
- npm
npm install npm@latest -g
- Clone the repo
git clone https://github.com/EstherSlabbert/finalCapstone.git
- Install python go to: https://www.python.org/downloads/ download and install
- Install sklearn packages
py -m pip install -U scikit-learn
- Install pandas packages
py -m pip install pandas
- Install numpy packages
py -m pip install numpy
- Download the .ipynb and .csv files from this repository
- Run the files Project was created and run with Visual Studio Code (to download: https://code.visualstudio.com/) and can be run from there or from Jupyter Notebook.
Once the files have been downloaded run them with Visual Studio Code or Jupyter Notebook or your preferred choice of program to run .ipynb files. Running with Visual Studio Code one can see the PCA and cluster analysis report generated by this project. Below are some examples of what the code should reproduce:
Esther Slabbert - [email protected]
Project Link: https://github.com/EstherSlabbert/finalCapstone
- HyperionDev Data Science Bootcamp
- Esther Slabbert