This project is a deliverable prduct of my capstone project that focuses on an Advanced Search Engine & Recommendation System. The system enhances entity search by retrieving detailed information from DBpedia and recommending related entities using clustering-based techniques.
This project involved extensive research, including:
- Manual Knowledge Graphs vs. DBpedia-based Knowledge Graphs
- Embedding techniques (BERT, Word2Vec, etc.)
- Clustering methodologies for entity recommendations
- Manual Knowledge Graph construction with predefined relationships.
- Entity embedding generation using BERT and Word2Vec.
- Clustering of entities based on similarity.
- DBpedia-based Knowledge Graph analysis.
- Extracts structured entity information using SPARQL queries.
- Generates embeddings and applies clustering techniques.
- Final integration of the Advanced Search Engine & Recommendation System.
- Processes and stores clustered entities in
clusters.jsonfor use in the application.
A Streamlit-based interactive application that:
- Searches for entities and retrieves Wikipedia data (summary & images).
- Finds related entities from the same cluster and displays them with images & descriptions.
- Comparative analysis of manual vs. DBpedia-based KG.
- Clustering evaluation for better recommendations.
- Future improvements: Expanding the KG, refining embeddings, and integrating real-time knowledge updates.
This project is a practical application of advanced search and recommendation techniques, demonstrating deep research in Knowledge Graphs, NLP, and AI-driven search. 🚀
This project is an Advanced Search Engine that integrates Clustering-based Recommendations to provide detailed information about specific entities and suggest related entities from the same cluster. It is designed to enhance information retrieval by grouping entities into clusters based on criteria such as profession, field of work, or other similarities.
- Entity Search: Users can search for specific entities (e.g., Albert Einstein, Lionel Messi, Chanchal Chowdhury) to retrieve detailed information about them.
- Clustering-based Recommendations: The system groups entities into clusters and recommends related entities from the same cluster when a user searches for a specific entity.
- User-Friendly Interface: The system prompts users to enter an entity name to initiate a search, making it easy to use.
- Example Clusters:
- Cluster 1: Bangladeshi actors (e.g., Chanchal Chowdhury, Zahid Hasan, K.M. Mosharaf Hossain).
- Cluster 2: Famous footballers (e.g., Lionel Messi, Pelé, Neymar, Cristiano Ronaldo).
- Cluster 3: Renowned scientists and thinkers (e.g., Albert Einstein, Isaac Newton, Charles Darwin, Nikola Tesla).
-
Search for an Entity:
- Enter the name of the entity you want to search for (e.g.,
Albert_Einstein,Lionel Messi, orChanchal Chowdhury). - The system will display detailed information about the entity.
- Enter the name of the entity you want to search for (e.g.,
-
View Recommendations:
- After displaying the details of the searched entity, the system will recommend related entities from the same cluster.
- For example, if you search for
Albert Einstein, the system may recommendIsaac Newton,Charles Darwin, andNikola Tesla.
-
Explore Clusters:
- You can explore different clusters by searching for entities within specific domains (e.g., actors, footballers, scientists).
- Search for
Lionel Messito get details about him and recommendations likePelé,Neymar, andCristiano Ronaldo. - Search for
Chanchal Chowdhuryto get details about him and recommendations likeZahid HasanandK.M. Mosharaf Hossain.
The project aims to enhance information retrieval by providing detailed information about specific entities and suggesting related entities. This can be useful for research, exploration, or discovering similar figures or topics.




