Skip to content

Latest commit

 

History

History
9 lines (5 loc) · 339 Bytes

README.md

File metadata and controls

9 lines (5 loc) · 339 Bytes

Trec_DD_NJU

We use python to transfer the CBOR data into HTML files.Then use python to filter the in the HTML. Then we stem the text

and delete the stopwords it has.

Then we use Java to build the docs into Mysql.

In the Java program we used some particular algorithm to continue the whole process and get the runfile.