Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft for supporting DBLP KG #4

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

Reham-Osama
Copy link

I added an explanation for why I made my changes within the code.

I have a couple of questions related to these changes

  1. When I ran the system on DBPedia KG it took around 13 hours to finish, but with the code here DBLP took only 7 mins, is that variance in the processing time expected ?
  2. The output files contain a lot of generated boolean questions, is there a way to include boolean questions but limit their number?
  3. I understood that there are two offsets that controls the node types to use and the seed nodes, what is the range of these values, and What are their expected effect?
    1. RandomSeedGenerator.addBranchesFromList (Line 102): Is this offset for choosing which node types to include
    2. RandomSeedGenerator.addBranchs (Line 125): Is this offset for choosing nodes that belongs to one node type.

@@ -122,9 +122,9 @@ public static ArrayList<Predicate> getAvailablePredicates(boolean order) {
predicate.setPredicateURI(result.getString("PredicateURI"));
predicate.setPredicateContext(new PredicateContext(result.getString("Context_Subject"), result.getString("Context_Object"), 0));
predicate.setWeight(result.getLong("ContextWeight"));
if (!predicate.getPredicateURI().contains("#")) {
// if (!predicate.getPredicateURI().contains("#")) {
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I made this change because most of the dblp predicates contains #, Do you think that this will affect the overall performance?

@@ -53,7 +53,7 @@ public static void generateShapes() throws Exception {

int benchmarkNumber = 37;

for (int j = 480; j < 10000; j = j + 10) {
for (int j = 0; j < 10000; j = j + 10) {
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I changed this to be able to have some results on DBLP. Without it, I was receiving no output files
Will this affect the performance or the output ?

}
}

private ArrayList<VariableSet> filterResult( ArrayList<VariableSet> result) {
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added this method because querying the RDF engine using the unwantedPropertiesString returned no results at all, so I returned all predicates and removed the unneeded using this method
I called it from getPredicateList_EntityObjects

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant