Skip to content

ML modeling and Data Science for Gaming Data. Counter-Strike GO and League of Legends analysis and modeling.

License

Notifications You must be signed in to change notification settings

jakorostami/gaming_ds

Repository files navigation

Gaming Data Science and Machine Learning

This repo is a demo of a classical Data Science and Machine Learning approach but for gaming data, which I've never worked with before. The gaming data is from the games of Counter-Strike Global Offensive (CSGO) and League of Legends (LoL).

For CSGO, the approach is analytical at first producing statistical and probabilistic analysis of the games played. Later on, the coordinate data (longitude, latitude) is used to do movement prediction with an LSTM, training on the GPU.

But for LoL, the approach is a bit different because of the composition of the dataset, first an analysis is done which then leads to a binary classification challenge where the predictions are which teams wins.

CS-GO

What we'll look at first is the equipment value after buy time for each match and split them by which side won.

- Counter-Terrorists tend to buy more expensive gear
- Terrorists might be saving depending on the round

Mapper



Of all the matches analyzed, we find that Counter-Terrorists have a higher propensity to be the first attacker.

Mapper


Terrorists have a tendency to spread around bomb site while Counter-Terrorists focus more on bomb site B and the center.

Mapper

When we test the difference of time between attacks between CT and T using a non-parametric statistical test 500 times to see if this happens by chance 1% of the time - the majority of our tests generate values that are compatible with our data. Meaning we don’t really find a statistical difference.

Mapper

League of Legends

For LoL, at a first glance we find the following:

- Higher team champion level comes with high team minions killed
- More wards placed by the team also comes with a higher team champion level

Mapper


If a team has an extremely high kill rate they tend be very different in their first 10 minutes of the game.
- lower deaths than the non-anomaly teams
- higher kill assists
- higher kills
- more gold per minute
- higher total experience
- higher champion level

Mapper


We've got all the features for modeling

Mapper


The best model is a logistic regression using our augmented dataset with the PCA embeddings. The second best model is the Random Forest which is a tree-based model.

Mapper

About

ML modeling and Data Science for Gaming Data. Counter-Strike GO and League of Legends analysis and modeling.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages