Skip to content

a-benkhaled/Incremental_text_clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Incremental_text_clustering

We are intrested in developing a solution for clustering text documents in an incremental enviroment. The current solution uses Cobweb [1][2] to cluster streaming documents.

This system uses several other open source projects:

  • Weka's [3] for the implementation of Cobweb/Classit (with a little modification).
  • borgelt's Fp-growth[4][5] implementation to cluster words (which are the attributs of an instance).

[1]http://axon.cs.byu.edu/~martinez/classes/678/Papers/Fisher_Cobweb.pdf [2]http://www.cs.cmu.edu/~callan/Papers/cikm06-nsahoo.pdf [3]http://www.cs.waikato.ac.nz/ml/weka/ [4]http://www.borgelt.net//fpgrowth.html [5}http://web.engr.illinois.edu/~hanj/pdf/dami04_fptree.pdf

About

Incremental text clustering system using Cobweb

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages