This repository consists of implementations of various algorithms used for processing big data.
The algorithms used are:
- Simple and Multiple Linear Regression using Gradient Descent (batch data)
- Simple and Multiple Linear Regression using Gradient Descent (stream data)
- Simple and Multiple Linear Regression using Normal Equation Methods (batch data)
- Incremental Mathematical Stream Regression and Approximate Stream Regression
- Collaborative Filtering (Stochastic Gradient Descent for Matrix Factorization)
- Collaborative Filtering (Distributed Stochastic Gradient Descent for Matrix Factorization)
- Collaborative Filtering (Streaming Distributed Stochastic Gradient Descent for Matrix Factorization)
- k-means and k-medoids (batch data)
- Stream Algorithm
- Clustream Algorithm
- ID3 and CART (batch data)
- Hoeffding Tree