Neural Networks from Scratch

This repository contains a sequence of Jupyter notebooks, each representing a step in understanding and implementing neural networks from scratch. The primary focus is on understanding the underlying concepts and implementing them without relying on external libraries. The lecture series takes you from basic concepts to advanced architectures, demonstrating the conversion of traditional programming approaches to modern neural network techniques.

Lecture Series Overview

Neural Networks with Derivatives: In this introductory notebook, we build a complete neural network from scratch, focusing on understanding derivatives and their role in neural network operations.
NameWeave: Beginning with a single-layer bigram model, we gradually transition from traditional machine learning to neural network approaches for name generation.
NameWeave - Multi Layer Perceptron: We expand the NameWeave model into a multi-layer perceptron (MLP), increasing the complexity of the model.
NameWeave (MLP) - Activations, Gradients & Batch Normalization: Continuing from the previous notebook, we enhance the MLP model with activation functions, gradient handling, and batch normalization techniques.
NameWeave - Manual Back Propagation: This notebook breaks down the MLP model into atomic pieces of code, emphasizing the importance of understanding backpropagation.
NameWeave - WaveNet: Inspired by WaveNet architecture, we modify the previous model to resemble convolutional neural networks (CNNs).
GPT from Scratch: Implementing all concepts learned previously, we introduce self-attention and decoder-only architecture to generate text, demonstrating the capability of modern architectures like Transformers. The notebook generates infinite Harry Potter-like text based on the provided dataset.
GPT Tokenizer Notebook: A new addition to the repository, this notebook focuses on building a tokenizer for preprocessing text data, specifically designed for use with the GPT model.

Introduction

In this repository, I explore the implementation of neural networks from scratch. The primary goal is to deepen the understanding of neural network concepts and learn how to implement them without relying on external libraries.

Files

GPT from Scratch.ipynb: Jupyter notebook where a GPT model is built from scratch, generating text based on the Harry Potter dataset.
GPT Tokenizer Notebook.ipynb: New notebook focusing on building a tokenizer for preprocessing text data.
NameWeave (MLP) - Activations, Gradients & Batch Normalization.ipynb: Jupyter notebook enhancing the NameWeave model with activations, gradients, and batch normalization.
NameWeave - Manual Back Propagation.ipynb: Jupyter notebook demonstrating manual backpropagation through the NameWeave model.
NameWeave - Multi Layer Perceptron.ipynb: Jupyter notebook expanding the NameWeave model into a Multi Layer Perceptron with multiple layers.
NameWeave - WaveNet.ipynb: Jupyter notebook implementing a WaveNet-like architecture for name generation.
Neural Network with Derivatives.ipynb: Jupyter notebook containing the implementation of a neural network with derivatives.
NNFS-GitHub Banner.gif: Banner image for the repository.
README.md: This file, providing an overview of the repository.
LICENSE: The license information for this repository.
Datasets/: Directory containing datasets used in the notebooks.
- Harry_Potter_Books.txt: Dataset used in the GPT from Scratch.ipynb notebook.
- Indian_Names.txt: Dataset used for all other notebooks.
- Tokenizer/: Directory for tokenization related files.
  - tokenizer_train.txt: Dataset used for training the tokenizer.
ExplanationMedia/Images/: Directory containing images used for explaining the notebooks.
ExplanationMedia/Videos/: Directory containing videos used for explaining the notebooks.
GPT Scripts/: Directory containing raw Python scripts created for building the GPT model in GPT from Scratch.ipynb.
- Tokenizer/: Directory for tokenizer related scripts.
  - tokenizer_train.py: Script used for training the tokenizer.

Usage

To explore the content of the lecture series, simply open the respective Jupyter notebook files using a compatible environment.

Install Required Dependencies

You can use Google Colab to view and run these files on the cloud.

OR

To view and run these files locally you need to run to install Jupyter Notebook via PyPi of Python:

Install Jupyter Notebook:

pip install notebook

Run Jupyter Notebook:

jupyter notebook

Future Updates

Stay tuned for additional features, improvements, and possibly new lecture series exploring more advanced topics in neural networks and machine learning.

License

This project is licensed under MIT - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Networks from Scratch

Lecture Series Overview

Table of Contents

Introduction

Files

Usage

Install Required Dependencies

Future Updates

License

About

Releases

Sponsor this project

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.github		.github
.vscode		.vscode
Datasets		Datasets
ExplanationMedia/Images		ExplanationMedia/Images
GPT Scripts		GPT Scripts
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
GPT Tokenizer.ipynb		GPT Tokenizer.ipynb
GPT from Scratch.ipynb		GPT from Scratch.ipynb
LICENSE		LICENSE
NNFS_GitHub_Banner.gif		NNFS_GitHub_Banner.gif
NameWeave (MLP) - Activations, Gradients & Batch Normalization.ipynb		NameWeave (MLP) - Activations, Gradients & Batch Normalization.ipynb
NameWeave - Manual Back Propagation.ipynb		NameWeave - Manual Back Propagation.ipynb
NameWeave - Multi Layer Perceptron.ipynb		NameWeave - Multi Layer Perceptron.ipynb
NameWeave - WaveNet.ipynb		NameWeave - WaveNet.ipynb
NameWeave.ipynb		NameWeave.ipynb
Neural Network with Derivatives.ipynb		Neural Network with Derivatives.ipynb
OpenAI_GPT_v2.5.ipynb		OpenAI_GPT_v2.5.ipynb
README.md		README.md

License

AvishakeAdhikary/Neural-Networks-From-Scratch

Folders and files

Latest commit

History

Repository files navigation

Neural Networks from Scratch

Lecture Series Overview

Table of Contents

Introduction

Files

Usage

Install Required Dependencies

Future Updates

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages