We use python to transfer the CBOR data into HTML files.Then use python to filter the in the HTML. Then we stem the text
and delete the stopwords it has.
Then we use Java to build the docs into Mysql.
In the Java program we used some particular algorithm to continue the whole process and get the runfile.