Name: William Zhen
Student ID: 20792351
Email: [email protected]
Ensure you have Java installed!
This program was wirtten with Java JDK 17.
Click here to install Java
Compile the program if needed. In the main directory of the program:
- $ cd src
- $ javac search_engine/main/SearchEngine.java
To run the program:
$ java search_engine.main.SearchEngine
The first prompt will prompt the user to locate it's latimes index folder.
After metadata, lexicon, and inverted index has been loaded, the user can now start using the program.
In the ranked retrieval query, choose an integer from ranked 1-10 to get the full raw document OR
N - new query (lowercase or uppercase)
Q - quit (lowercase or uppercase)
Instructions (Git repo instructions)
- Open Terminal
- Change the current working directory to the location where you want the cloned directory
- $ git clone https://github.com/UWaterloo-MSCI-541/msci-541-f21-hw5-RandumbWilliam.git
- Press Enter to create your local clone
- $ cd msci-541-f21-hw5-RandumbWilliam/
- $ cd SearchEngine/src
- $ javac search_engine/main/SearchEngine.java
- $ java search_engine.main.SearchEngine
- Enter your path to the latimes index folder
- Wait for metadata, lexicon, and inverted index to load in
- Once it prompts you to "Enter your query: ", you can enter your query and hit Enter!
- Once your results have been retrieved, it'll prompt you to get one of the ranked documents or new query or quit
"Enter the document you wish to view by entering the rank ["N" new query | "Q" quit]: " - Finish
- Open Terminal
- $ cd Desktop
- $ mkdir RepoClone
- $ cd RepoClone
- $ git clone https://github.com/UWaterloo-MSCI-541/msci-541-f21-hw5-RandumbWilliam.git
- Enter
- $ cd msci-541-f21-hw4-RandumbWilliam/
- $ cd SearchEngine/src
- $ javac search_engine/main/SearchEngine.java
- $ java search_engine.main.SearchEngine
- Enter your path to the latimes-index: /Users/williamzhen/Desktop/latimes-index
- Wait
- Enter your query: UV damage eyes
- Enter the document you wish to view by entering the rank ["N" new query | "Q" quit]:
- Finish
- Interactive command line input
- Execution timer
- Split into sentences
- Priority queue comparator
- Thank you to Anjali Mistry, 4th year biology (not in this course), helped designed longest contiguous k for query biased snippet
- Croft, Bruce, et al. Search Engines: Information Retrieval in Practice. Pearson Education, 2010