KMeans-Predictive-Modeling

K-Means Clustering is a Machine Learning Algorithm which creates a model based on the nearest-neighbour-choice.

The model in this repository has been implemented by k-means classification which predics the favourite sport in a state(in India).

Explanation

util_methods.py - Contains the linear algebra methods that are used in the k-means model.

fav_sport.py - Run this file to get output of the model. It contains the following methods:

major_cluster(labels) : Takes a list of input variables as parameter and returns a winner with most number of occurences.
knn_classify(k, labeled_points, new_point) : Driver algorithm which creates cluster based on the value of k.
plot_cities() : Plots the cities based on (latitude and longitude) with the appropriate markers of favourite sport.
classify_and_plot_grid() : Plots the graph of k-means clustering model!

Use

An attempt to make the code as readable as possible has been made.
You can modify the code to create your own k-means classification.
e.g. Instead of states(a list used in the fav_sport.py file), you can use languages which people speak throughout the world.
Feel free to reach out for any suggestions or help!

Inspiration

Joel Grus and his book: Data Science from Scratch

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.spyproject		.spyproject
__pycache__		__pycache__
README.md		README.md
fav_sport.py		fav_sport.py
util_methods.py		util_methods.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KMeans-Predictive-Modeling

Explanation

Use

Inspiration

About

Uh oh!

Releases

Packages

Languages

JaspreetRFSingh/KMeans-Predictive-Modeling

Folders and files

Latest commit

History

Repository files navigation

KMeans-Predictive-Modeling

Explanation

Use

Inspiration

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages