Diplodatos 2020
Diplomatura en Ciencia de Datos, Aprendizaje Automático y sus Aplicaciones
- Data visualization
- Data cleaning
- Introduction to Machine Learning
- Supervised Learning
- Unsupervised Learning
- Distributed Computing
├── analisisvis
│ ├── data
│ └── notebooks
├── curacion
│ ├── input
│ ├── notebooks
│ ├── output
│ └── queries
├── distributed
│ ├── bin
│ ├── conf
│ ├── docs
│ ├── inputs
│ └── notebooks
├── doc
├── introml
│ └── notebooks
├── scripts
├── supervised
│ └── notebooks
└── unsupervised
└── notebooks
Data analytics and visualization on The Human Freedom Index - 2019
Linear Regression on Boston Housing Dataset
Cross-validated grid-search on Loan Dataset where TARGET Label: 1 = client defaulted on loan - 0 = loan repaid
Prediction Competition Walmart Recruiting: Trip Type Classification in diplodatos 2020,
- Team 26
- Score
0.71833
- Notebook
FIFA 20 complete player dataset kaggle
Clustering using
k means
dbscan
mean shift
- Notebook
Distributed computing in big data using pyspark
and graphframe
Distributed under the MIT License. See LICENSE for more information.
- Diplodatos Diplodatos