The contents of my portfolio are the following:
Personal project where data in real-time is extracted from an API to create a data warehouse of football scores in the European Leagues and Competitions. After this, some basic Machine Learning and Deep Learning models are created for predictive analytics purposes.
- SQL
- Pendaho Data Integration
- Power BI
- Python
Power BI dashboard showing youtube geographical and video category insights from a public dataset.
- Python (Jupyter Notebooks)
- Power BI
Sponsorship company that wants to evaluate the best team to invest in based on previous team performance and expected revenews.
- Power BI
Coding of regression methods from scratch (Vanilla, Lasso, Ridge and M (Robust)). Coding of different clustering techniques such as K-means, Hierarchical, DBScan and HBDScan. Coding of SVM, logistic regression.
- Python (Jupyter Notebooks)
Creation of a decision tree from scratch for classification and regression using NumPy and Pandas libraries.
- Python
Banking and retail exercises to support decision making by creating ML Classification Models (Random Forest and Boost)
- Python (Jupyter Notebooks)
Creating a Skipgram model from scratch using NumPy and creating an Aspect-Based Sentiment Analysis using Hugging Face Models
- Python
Detect and classify workers on construction-site images to detect potential risks. The Image detection was performed using Detectron V2
- Python (Jupyter Notebooks)
Creation of a UNet Neural Network Architecture to segment images and classify the objects that were detected.
- Python (Jupyter Notebooks)
Graph NN Architecture to predict protein characteristics (PPI Dataset)
- Python
GAN, Auto-encoders and Normalizing flows Exercise
- Python (Jupyter Notebooks)
Neural Network tuning exercise
- Python (Jupyter Notebooks)
Regression and Classification NN and hyperparameter tuning
- Python (Jupyter Notebooks)
Document extraction, named entity recognition and to identify corrupt behaviours in Colombian Public Sector
- Python (Jupyter Notebooks)
- Microsoft Azure ML Services
Excercises that aim to understand the overall map-reduce paradigm and include
- Python (Jupyter Notebooks)
Excersise for marketing, use a Pareto-NBD Model to understand the behaviour of the clients and prioritize campaigns
- R
Multi-variate time series analysis focused on M5 Competition
- R
Business Intelligence trainings, delivered in a previous work-experience.
- Power BI