Provide framework for generic lazily evaluated operation results #1350

RobinTF · 2024-05-18T00:32:19Z

Still WIP. Currently missing:

Discussion about remaining TODOs
Lots of unit tests
Also most likely some functions need to be broken up into smaller pieces once we found everything else to be working "correctly".
Documentation of all newly introduced functions once they're becoming somewhat "final"
Cold Fusion & World domination?

src/engine/Operation.cpp

RobinTF · 2024-05-18T00:36:16Z

src/engine/Operation.cpp

+          result._resultPointer->resultTable()->idTable().numColumns();
+      LOG(DEBUG) << "Computed result of size " << resultNumRows << " x "
+                 << resultNumCols << std::endl;
+    }


Does this debug message provide any real benefit to make it worth somehow incorporating it into lazily evaluated operations?

codecov · 2024-05-18T00:56:07Z

Codecov Report

Attention: Patch coverage is 73.56322% with 184 lines in your changes missing coverage. Please review.

Project coverage is 88.57%. Comparing base (797f325) to head (e5ceacc).

Files	Patch %	Lines
src/engine/Result.cpp	57.76%	98 Missing and 8 partials ⚠️
src/util/CacheableGenerator.h	85.80%	1 Missing and 22 partials ⚠️
src/engine/Operation.cpp	74.11%	21 Missing and 1 partial ⚠️
src/engine/IndexScan.cpp	5.88%	15 Missing and 1 partial ⚠️
src/util/Cache.h	82.08%	0 Missing and 12 partials ⚠️
src/engine/ExportQueryExecutionTrees.cpp	93.33%	0 Missing and 3 partials ⚠️
src/engine/QueryExecutionTree.cpp	83.33%	0 Missing and 1 partial ⚠️
src/engine/QueryPlanner.cpp	50.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1350      +/-   ##
==========================================
- Coverage   89.00%   88.57%   -0.43%     
==========================================
  Files         329      331       +2     
  Lines       29155    29773     +618     
  Branches     3236     3327      +91     
==========================================
+ Hits        25948    26370     +422     
- Misses       2055     2204     +149     
- Partials     1152     1199      +47

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@hannahbast

This PR contains all the changes from the infrastructure for lazy operation evaluation (#1350) that are simple and repetitive, but touch many files. In particular: * Rename the `ResultTable` class to `Result` (a TODO suggested by @hannahbast some time ago). * Add a new parameter `bool requestLaziness` to `Operation::computeResult`. This parameter is currently unused.

…t-table

…the way

This makes the code much simpler, and makes no difference for almost all queries. The expensive part (reading from disk and decompressing) is still done in parallel, only the writing to the `IdTable` is now serialized + there is an additional copy compared to before. An example query that is slower now because of this change is: materialize a large index scan (for example, for the predicate `rdf:type`) and group by subject (there is a shortcut for grouping by object when there are few objects). But such queries will become lazy soon anyway (see #1350) and then this will be irrelevant.

…t-table

…eiburg#1323) This makes the code much simpler, and makes no difference for almost all queries. The expensive part (reading from disk and decompressing) is still done in parallel, only the writing to the `IdTable` is now serialized + there is an additional copy compared to before. An example query that is slower now because of this change is: materialize a large index scan (for example, for the predicate `rdf:type`) and group by subject (there is a shortcut for grouping by object when there are few objects). But such queries will become lazy soon anyway (see ad-freiburg#1350) and then this will be irrelevant.

…t-table

sonarcloud · 2024-06-30T19:40:25Z

Quality Gate passed

Issues
23 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

RobinTF commented May 18, 2024

View reviewed changes

src/engine/Operation.cpp Show resolved Hide resolved

RobinTF commented May 18, 2024

View reviewed changes

RobinTF mentioned this pull request May 21, 2024

Refactoring preliminaries for lazy operations (Part 1) #1352

Merged

RobinTF added 25 commits May 23, 2024 16:26

Rename ResultTable -> Result

80667bd

Wrap idTable in variant

31b2c11

Add ability to create Result from generator

4d0204c

Start fixing caching issues

515ed0c

Avoid another class of exceptions

ca1cbed

Optimize imports

9e7f3cb

Introduce ReusableGenerator class

4c75d42

Try to make caching work

892e4a5

Fiddle around with const a bit

586365c

Add more TODOs

80e2dbd

Fix TextLimit code after rebase

18ca5b1

Fix compilation issues for ReusableGenerator

86a9f4b

Remove offset calculations from exporter

7f0a5e7

Fix typo

aee20dd

Add comments

7576b2e

Make supportsLimit private to avoid misuse

7765a25

Properly use minimum limit if present

f815be8

Start adding code to manipulate code after cache extraction

90cca50

Implement fallback mechanism for failed cache share

694c21f

Fix accidental edit of Usage.md

ea8b81f

Consume result as master

50e4529

Add proper condition variables

16eedd8

Implement code that allows for proper recomputation of cache size

bf8f085

Refactor a bit

771eb5b

Aggregate tables at the end of lazy results

8aa9060

RobinTF added 8 commits May 28, 2024 00:18

Add back headers

ef17e67

Add back result limiter for subqueries

aabb81b

Try to fix subtle bug with runtime information detail

66a38b4

Merge branch 'max-send-changes' into refactor-result-table

999baee

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

c291ff7

…t-table

Add back comment

9f17e07

Rename resultTable -> result

389f3f1

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

000af28

…t-table

RobinTF force-pushed the refactor-result-table branch from c7ebab6 to 000af28 Compare June 6, 2024 16:31

RobinTF added 4 commits June 9, 2024 22:04

Add correctness check to prevent double move due to race condition

ba142a0

Start implementing tests for new cache feature and fixing bugs along …

44562c7

…the way

Some Test cleanup

0f3a59a

Mark variable as maybe_unused

d226849

hannahbast mentioned this pull request Jun 13, 2024

Implement materialized index scans by materializing lazy scans #1323

Merged

RobinTF added 5 commits June 13, 2024 23:31

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

552a268

…t-table

Restructure recomputeSize a bit to avoid unwanted behaviour

cde135a

Add remaining cache tests

cf6b4c9

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

b2138bf

…t-table

Add tests for IteratorWrapper

0c589e3

RobinTF added 8 commits June 15, 2024 16:55

Fix line endings

c465685

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

d17fc7d

…t-table

Add tests for CacheableGenerator

93c2360

Add Filter tests

15b435e

Clear Cache before running tests

633bf06

Add test to fix coverage

6d5a95e

Address some sonarcloud issues

55b4fec

Add tests for ExportQueryExecutionTrees

e5ceacc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide framework for generic lazily evaluated operation results #1350

Provide framework for generic lazily evaluated operation results #1350

RobinTF commented May 18, 2024 •

edited

Loading

RobinTF May 18, 2024

codecov bot commented May 18, 2024 •

edited

Loading

sonarcloud bot commented Jun 30, 2024

Provide framework for generic lazily evaluated operation results #1350

Are you sure you want to change the base?

Provide framework for generic lazily evaluated operation results #1350

Conversation

RobinTF commented May 18, 2024 • edited Loading

RobinTF May 18, 2024

Choose a reason for hiding this comment

codecov bot commented May 18, 2024 • edited Loading

Codecov Report

sonarcloud bot commented Jun 30, 2024

Quality Gate passed

RobinTF commented May 18, 2024 •

edited

Loading

codecov bot commented May 18, 2024 •

edited

Loading