Skip to content

Commit

Permalink
minor tweaks
Browse files Browse the repository at this point in the history
  • Loading branch information
Mike Holler committed Apr 22, 2014
1 parent ef7cd74 commit 9914876
Show file tree
Hide file tree
Showing 5 changed files with 17 additions and 13 deletions.
5 changes: 4 additions & 1 deletion document.tex
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,10 @@
\begin{document}
\maketitle
\newpage
%\epigraph{You want weapons? We're in a library! Books! The best weapons in the world! This room's the greatest arsenal we could have---arm yourselves!}{---The Doctor}
\epigraph{You want weapons? We're in a library! Books! The best weapons in the world! This room's the greatest arsenal we could have---arm yourselves!}{---The Doctor}
\epigraph{Automatic indexing of books has failed miserably, as will be discussed below.}{Nancy Mulvany, {\it Indexing Books}}
\epigraph{The idea behind digital computers may be explained by saying that these machines are intended to carry out any operations which could be done by a human computer.}{---Alan Turing}
\pagebreak
\tableofcontents
\input{tex/05-styles}
\input{tex/10-abstract}
Expand Down
4 changes: 0 additions & 4 deletions tex/15-introduction.tex
Original file line number Diff line number Diff line change
Expand Up @@ -144,7 +144,3 @@ \subsection{Indexing Methods}

The goal of this research is to further the knowledge of automatic indexing by applying techniques from Natural Language Processing (NLP), a subfield of Computer Science that concerns itself with teaching computers to understand language mechanics.
The next section takes a brief look at what Natural Language Processing is, and how it pertains to indexing.

% TODO: Explain/preview structure of paper.


2 changes: 1 addition & 1 deletion tex/35-conclusion.tex
Original file line number Diff line number Diff line change
Expand Up @@ -25,4 +25,4 @@ \section{Conclusion}
Although this experiment failed to achieve human results, it did get appreciably close enough to warrant additional research and improvement.
Nancy Mulvany, professional indexer and author of {\it Indexing Books} confidently writes, ``There is nothing automatic about the index-writing process.
There is no automatic indexing tool available that could produce the index in the back of this book,''\cite{mulvany}.
Mulvany is still correct in her statement, but with the continuing advancement of technology and the gradual improvement of Natural Language Processing techniques, the day is coming when her statement might be falsified.
Mulvany is still correct in her statement, but with the continuing advancement of technology and the gradual improvement of Natural Language Processing techniques, computers may one day be able to recreate the index in Mulvany's book.
12 changes: 6 additions & 6 deletions tex/40-bibliography.tex
Original file line number Diff line number Diff line change
Expand Up @@ -3,15 +3,9 @@
\bibitem{open-nlp}
Apache Software Foundation. 2010. Welcome to Apache OpenNLP. [Online]. Available from: \url{https://opennlp.apache.org/}. Accessed 2014 Apr 16.

\bibitem{beautifulsoup}
Richardson L. 2014. Beautiful Soup. [Online]. Available from: \url{http://www.crummy.com/software/BeautifulSoup/}. Accessed 2014 Feb 26.

\bibitem{biology}
Avissar Y, Choi J, DeSaix J, Jurukovski V, Wise R, Rye C. 2013. Biology [Online]. 1st. OpenStax College. Available from: \url{http://goo.gl/hKHVhZ}. Accessed 2014 Feb 20.

\bibitem{bit-of-news}
Xu X. 2013. Bit-of-News: About. [Online]. Available from: \url{http://bitofnews.com/about.html}. Accessed 2014 Mar 1.

\bibitem{nlpwp}
Bird S, Klein E, Loper E. Natural Language Processing with Python. 1st ed. Beijing: O'Reilly; 2009.

Expand Down Expand Up @@ -72,6 +66,9 @@
\bibitem{pypi-ranking}
PyPI Ranking. 2014. All Time Package Rankings. [Online]. Available from: \url{http://pypi-ranking.info/alltime}. Accessed 2014 Apr 16.

\bibitem{beautifulsoup}
Richardson L. 2014. Beautiful Soup. [Online]. Available from: \url{http://www.crummy.com/software/BeautifulSoup/}. Accessed 2014 Feb 26.

\bibitem{rish}
Rish, I. An empirical study of the naive Bayes classifier. IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence [Internet]. 2001 [cited 2014 Apr 14]; 3(22). Available from: \url{http://www.research.ibm.com/people/r/rish/papers/RC22230.pdf}

Expand All @@ -96,4 +93,7 @@
\bibitem{wiki-dumps}
Wikimedia Foundation. 2014. Wikimedia Downloads. [Online]. Available from: \url{http://dumps.wikimedia.org/}. Accessed 2014 Feb 20.

\bibitem{bit-of-news}
Xu X. 2013. Bit-of-News: About. [Online]. Available from: \url{http://bitofnews.com/about.html}. Accessed 2014 Mar 1.

\end{thebibliography}
7 changes: 6 additions & 1 deletion tex/45-appendices.tex
Original file line number Diff line number Diff line change
Expand Up @@ -5,4 +5,9 @@ \section{Database Schema Definition}

\section{\Naive Bayes Classifier (classifier.py)}
\label{appendix:b}
\lstinputlisting[language=Python]{code/tools/classifier.py}
\lstinputlisting[language=Python]{code/tools/classifier.py}

\section{Other Data and Source Code}

Throughout the course of this paper, this research mentions several tools and data sets whose source does not appear above in order to keep the number of printed pages to a reasonable number.
All of theses tools, and the source of this paper, are available online free of charge at \url{https://undergraduate-thesis.mikeholler.me}.

0 comments on commit 9914876

Please sign in to comment.