GitHub - andysingal/machine-learning

Books and Resources	Status of Completion
1. Hands On Machine Learning with Scikit Learn, Keras and TensorFlow
2. Hands-On Unsupervised Learning Using Python
3. Hands-On Gradient Boosting with XGBoost and scikit-learn
4. Machine Learning with PyTorch and Scikit-Learn
5. Feature Engineering Bookcamp
6. Deep Learning with Python
7. Python Machine Learning - Third Edition
8. Automated Machine Learning

Topics	Status of Completion
1. Regression and Regularization	✅
2. SVM	✅
3. Decision Tree	✅
4. XGBoost	✅
5. Ensemble_Learning_and_Random_Forests	✅
6. Dimension Reduction	✅
7. Feature Engineering

Projects	Status of Completion
1. California House Prices	✅
2. Mobile Price Classification	✅
3. Bank Churn	✅

Data Science Workflow:

Problem definition: Typically, any data science and machine learning project starts with problem definition. In this first step, you need to define the problems that you are trying to solve with data science, the scope of the project, and the approaches to solving this problem. When you are thinking about some of the approaches to solving your problem, you will need to brainstorm on what types of analyses (descriptive versus explanatory versus predictive) and types of learning algorithms (supervised versus unsupervised versus reinforcement learning) that we discussed previously will be suitable for solving the given problem.
Data collection: Once you have a clear definition of the project, you will then move on to the data collection step. This is where you gather all the data you need to proceed with your data science project. It is not uncommon that you will need to purchase data from third-party vendors, scrape and extract data from the web, or use publicly available data. In some cases, you will also need to collect data from your internal systems for your project. Depending on the cases, the data collection step can be trivial or it can also be tedious.
Data preparation: When you have gathered all of the data you need from the data collection step, then the next step is data preparation. The goal of this step is to transform your data and prepare it for future steps. If the formats of the data sources are different, then you will have to transform and unify the data. If the data doesn't have a certain structure, then you will have to structure the data, typically in tabular format, so that you can easily conduct different analyses and build machine learning models.
Data analysis: When you are done with the data preparation step, then you will have to start looking into the data. In the data analysis step, typically, descriptive analyses are conducted to compute some descriptive summary statistics and build visual plots to better understand the data. Quite often, you can find some recognizable patterns and draw some insight from data during this step. You may also be able to find any anomalies in the data, such as missing values, corrupted data, or duplicate records, from this step.
Feature engineering: Feature engineering is the most important part of data science and machine learning, as it directly affects the performance of predictive models. Feature engineering requires expertise and good domain knowledge of the data, as it requires you to transform the raw data into more informative data for your algorithms to learn from. One good example of feature engineering is transforming text data into numerical data. As the machine learning algorithms can only learn from numerical data, you will need to come up with an idea and strategy to translate textual data into numerical data. As we work through this book and as we build machine learning models, we will discuss and experiment with various feature engineering techniques.
Model building: Once you are done with the feature engineering step, then you can start training and testing your machine learning models. In this step, you can experiment with various learning algorithms to figure out which one works the best for your use case. One thing to keep in mind in this step is the validation metrics. It is important to have a good measure of your model performance, as machine learning algorithms will try to optimize on the given performance measure. As we start building machine learning models in the following chapters, we will discuss more in detail regarding what metrics to use depending on the type of problems that we are working on.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.idea		.idea
Books		Books
Dimension_Reduction		Dimension_Reduction
EICT-IITR-AI-DL-2023-Batch-1		EICT-IITR-AI-DL-2023-Batch-1
EICT-IITR-AI-DL-2023-Batch-2		EICT-IITR-AI-DL-2023-Batch-2
Ensemble_methods		Ensemble_methods
Feature_Engineering		Feature_Engineering
Hyperparameters		Hyperparameters
Projects		Projects
Unsupervised Learning		Unsupervised Learning
imbalance-learn		imbalance-learn
machine-learning-articles		machine-learning-articles
misc_files		misc_files
04_anomaly_detection.ipynb		04_anomaly_detection.ipynb
3-variants-of-classification-problems-in-machine-learning.md		3-variants-of-classification-problems-in-machine-learning.md
Advanced_Linear_Regression_.ipynb		Advanced_Linear_Regression_.ipynb
Interactive-ML.md		Interactive-ML.md
Medical_Insurance_Cost_Prediction.ipynb		Medical_Insurance_Cost_Prediction.ipynb
README.md		README.md
Telco-Customer-Churn-dataset.csv		Telco-Customer-Churn-dataset.csv
USA_Real_Estate_Price_Prediction.ipynb		USA_Real_Estate_Price_Prediction.ipynb
a-gentle-introduction-to-long-short-term-memory-networks-lstm.md		a-gentle-introduction-to-long-short-term-memory-networks-lstm.md
a-simple-conv3d-example-with-keras.md		a-simple-conv3d-example-with-keras.md
about-loss-and-loss-functions.md		about-loss-and-loss-functions.md
albert-explained-a-lite-bert.md		albert-explained-a-lite-bert.md
an-introduction-to-dcgans.md		an-introduction-to-dcgans.md
an-introduction-to-tensorflow-keras-callbacks.md		an-introduction-to-tensorflow-keras-callbacks.md
automating-neural-network-configuration-with-keras-tuner.md		automating-neural-network-configuration-with-keras-tuner.md
avoid-wasting-resources-with-earlystopping-and-modelcheckpoint-in-keras.md		avoid-wasting-resources-with-earlystopping-and-modelcheckpoint-in-keras.md
batch-normalization-with-pytorch.md		batch-normalization-with-pytorch.md
best-machine-learning-artificial-intelligence-books.md		best-machine-learning-artificial-intelligence-books.md
beyond-swish-the-lisht-activation-function.md		beyond-swish-the-lisht-activation-function.md
bidirectional-lstms-with-tensorflow-and-keras.md		bidirectional-lstms-with-tensorflow-and-keras.md
binary-crossentropy-loss-with-pytorch-ignite-and-lightning.md		binary-crossentropy-loss-with-pytorch-ignite-and-lightning.md
build-an-lstm-model-with-tensorflow-and-keras.md		build-an-lstm-model-with-tensorflow-and-keras.md
building-a-decision-tree-for-classification-with-python-and-scikit-learn.md		building-a-decision-tree-for-classification-with-python-and-scikit-learn.md
building-a-simple-vanilla-gan-with-pytorch.md		building-a-simple-vanilla-gan-with-pytorch.md
building-an-image-denoiser-with-a-keras-autoencoder-neural-network.md		building-an-image-denoiser-with-a-keras-autoencoder-neural-network.md
can-neural-networks-approximate-mathematical-functions.md		can-neural-networks-approximate-mathematical-functions.md
classifying-imdb-sentiment-with-keras-and-embeddings-dropout-conv1d.md		classifying-imdb-sentiment-with-keras-and-embeddings-dropout-conv1d.md
cnns-and-feature-extraction-the-curse-of-data-sparsity.md		cnns-and-feature-extraction-the-curse-of-data-sparsity.md
commoditizing-ai-the-state-of-automated-machine-learning.md		commoditizing-ai-the-state-of-automated-machine-learning.md
conditional-gans-cgans-explained.md		conditional-gans-cgans-explained.md
conv2dtranspose-using-2d-transposed-convolutions-with-keras.md		conv2dtranspose-using-2d-transposed-convolutions-with-keras.md
convolutional-neural-networks-and-their-components-for-computer-vision.md		convolutional-neural-networks-and-their-components-for-computer-vision.md
convolutional-neural-networks-with-pytorch.md		convolutional-neural-networks-with-pytorch.md
could-chaotic-neurons-reduce-machine-learning-data-hunger.md		could-chaotic-neurons-reduce-machine-learning-data-hunger.md
creating-a-multilabel-neural-network-classifier-with-tensorflow-and-keras.md		creating-a-multilabel-neural-network-classifier-with-tensorflow-and-keras.md
creating-a-multilayer-perceptron-with-pytorch-and-lightning.md		creating-a-multilayer-perceptron-with-pytorch-and-lightning.md
creating-a-signal-noise-removal-autoencoder-with-keras.md		creating-a-signal-noise-removal-autoencoder-with-keras.md
creating-a-simple-binary-svm-classifier-with-python-and-scikit-learn.md		creating-a-simple-binary-svm-classifier-with-python-and-scikit-learn.md
creating-an-mlp-for-regression-with-keras.md		creating-an-mlp-for-regression-with-keras.md
creating-dcgan-with-pytorch.md		creating-dcgan-with-pytorch.md
creating-dcgan-with-tensorflow-2-and-keras.md		creating-dcgan-with-tensorflow-2-and-keras.md
creating-depthwise-separable-convolutions-in-keras.md		creating-depthwise-separable-convolutions-in-keras.md
creating-interactive-visualizations-of-tensorflow-keras-datasets.md		creating-interactive-visualizations-of-tensorflow-keras-datasets.md
creating-one-vs-rest-and-one-vs-one-svm-classifiers-with-scikit-learn.md		creating-one-vs-rest-and-one-vs-one-svm-classifiers-with-scikit-learn.md
cropping-layers-with-pytorch.md		cropping-layers-with-pytorch.md
customer churn prediction for telecom industry.ipynb		customer churn prediction for telecom industry.ipynb
dall-e-openai-gpt-3-model-can-draw-pictures-based-on-text.md		dall-e-openai-gpt-3-model-can-draw-pictures-based-on-text.md
data-poisoning.md		data-poisoning.md
dialogpt-transformers-for-dialogues.md		dialogpt-transformers-for-dialogues.md
differences-between-autoregressive-autoencoding-and-sequence-to-sequence-models-in-machine-learning.md		differences-between-autoregressive-autoencoding-and-sequence-to-sequence-models-in-machine-learning.md
distributed-training-tensorflow-and-keras-models-with-apache-spark.md		distributed-training-tensorflow-and-keras-models-with-apache-spark.md
easy-causal-language-modeling-with-machine-learning-and-huggingface-transformers.md		easy-causal-language-modeling-with-machine-learning-and-huggingface-transformers.md
easy-chatbot-with-dialogpt-machine-learning-and-huggingface-transformers.md		easy-chatbot-with-dialogpt-machine-learning-and-huggingface-transformers.md
easy-grammar-error-detection-correction-with-machine-learning.md		easy-grammar-error-detection-correction-with-machine-learning.md
easy-install-of-jupyter-notebook-with-tensorflow-and-docker.md		easy-install-of-jupyter-notebook-with-tensorflow-and-docker.md
easy-machine-translation-with-machine-learning-and-huggingface-transformers.md		easy-machine-translation-with-machine-learning-and-huggingface-transformers.md
easy-masked-language-modeling-with-machine-learning-and-huggingface-transformers.md		easy-masked-language-modeling-with-machine-learning-and-huggingface-transformers.md
easy-text-summarization-with-huggingface-transformers-and-machine-learning.md		easy-text-summarization-with-huggingface-transformers-and-machine-learning.md
exploring-the-keras-datasets.md		exploring-the-keras-datasets.md
extensions-to-gradient-descent-from-momentum-to-adabound.md		extensions-to-gradient-descent-from-momentum-to-adabound.md
feature-scaling-with-python-and-sparse-data.md		feature-scaling-with-python-and-sparse-data.md
finding-optimal-learning-rates-with-the-learning-rate-range-test.md		finding-optimal-learning-rates-with-the-learning-rate-range-test.md
from-vanilla-rnns-to-transformers-a-history-of-seq2seq-learning.md		from-vanilla-rnns-to-transformers-a-history-of-seq2seq-learning.md
gans-an-introduction-to-frechet-inception-distance-fid.md		gans-an-introduction-to-frechet-inception-distance-fid.md
generative-adversarial-networks-a-gentle-introduction.md		generative-adversarial-networks-a-gentle-introduction.md
getting-out-of-loss-plateaus-by-adjusting-learning-rates.md		getting-out-of-loss-plateaus-by-adjusting-learning-rates.md
getting-started-with-pytorch.md		getting-started-with-pytorch.md
gradient-descent-and-its-variants.md		gradient-descent-and-its-variants.md
greedy-layer-wise-training-of-deep-networks-a-pytorch-example.md		greedy-layer-wise-training-of-deep-networks-a-pytorch-example.md
greedy-layer-wise-training-of-deep-networks-a-tensorflow-keras-example.md		greedy-layer-wise-training-of-deep-networks-a-tensorflow-keras-example.md
grouped-convolutions-with-tensorflow-2-and-keras.md		grouped-convolutions-with-tensorflow-2-and-keras.md
he-xavier-initialization-activation-functions-choose-wisely.md		he-xavier-initialization-activation-functions-choose-wisely.md
help-fight-covid-19-participate-in-the-cord-19-challenge.md		help-fight-covid-19-participate-in-the-cord-19-challenge.md
house_price_ml.pdf		house_price_ml.pdf
how-does-the-softmax-activation-function-work.md		how-does-the-softmax-activation-function-work.md
how-to-build-a-convnet-for-cifar-10-and-cifar-100-classification-with-keras.md		how-to-build-a-convnet-for-cifar-10-and-cifar-100-classification-with-keras.md
how-to-build-a-resnet-from-scratch-with-tensorflow-2-and-keras.md		how-to-build-a-resnet-from-scratch-with-tensorflow-2-and-keras.md
how-to-build-a-u-net-for-image-segmentation-with-tensorflow-and-keras.md		how-to-build-a-u-net-for-image-segmentation-with-tensorflow-and-keras.md
how-to-check-if-your-deep-learning-model-is-underfitting-or-overfitting.md		how-to-check-if-your-deep-learning-model-is-underfitting-or-overfitting.md
how-to-create-a-basic-mlp-classifier-with-the-keras-sequential-api.md		how-to-create-a-basic-mlp-classifier-with-the-keras-sequential-api.md
how-to-create-a-cnn-classifier-with-keras.md		how-to-create-a-cnn-classifier-with-keras.md
how-to-create-a-confusion-matrix-with-scikit-learn.md		how-to-create-a-confusion-matrix-with-scikit-learn.md
how-to-create-a-multilabel-svm-classifier-with-scikit-learn.md		how-to-create-a-multilabel-svm-classifier-with-scikit-learn.md
how-to-create-a-neural-network-for-regression-with-pytorch.md		how-to-create-a-neural-network-for-regression-with-pytorch.md
how-to-create-a-variational-autoencoder-with-keras.md		how-to-create-a-variational-autoencoder-with-keras.md
how-to-easily-create-a-train-test-split-for-your-machine-learning-model.md		how-to-easily-create-a-train-test-split-for-your-machine-learning-model.md
how-to-evaluate-a-keras-model-with-model-evaluate.md		how-to-evaluate-a-keras-model-with-model-evaluate.md

andysingal/machine-learning

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages