Skip to content

Ignasimg/BigDataUtils

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigDataUtils

Build Status

A few prototypes of various utilities to deal with big datasets. Specially throught online algorithms.

##Utilities

  • MassMean - Utility to compute the mean, variance or std.Dev of a continuous stream of values, in an efficient way. While also being capable of removing some values from the calculation on a later date.
  • NeuralNetwork - Utility for classifiying.
  • KMeans - Utility for clustering.
  • InsertionSort - Utility for keeping a list always sorted.
  • TimedList - Utility to keep a list of items for a given time.
  • BuyOrRent - Utility to determine weather to 'buy' or to 'rent' - ski rental problem.
  • WeightedMajority - Utility to choose the best prediction from a pool of predictions.
  • Winnow - Utility for classifiying.

##TODOS

  • Improve performance of NeuralNetwork class.

About

A few handmade utilities to deal with big datasets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages