Skip to content

Conversation

@as51340
Copy link
Contributor

@as51340 as51340 commented Oct 20, 2025

A new query is added LOAD PARQUET FROM.... New library arrow is added which is built through libs folder functionality. Loading parquet files uses a prefetcher thread which operates in column batches and from such column-wise oriented data, assembles rows which the main execution thread at the end just pulls. That's how we hide the cost of assembling rows from column-oriented data. To avoid copying allocated data from the background thread to the main thread, we make use of thread-safe memory resource. A new data queue is added which is used as a shared resource between the prefetcher and the main thread.

The class utils::Date supports instantiation now using the number of days and the number of microseconds passed since UNIX epoch.

@as51340 as51340 self-assigned this Oct 20, 2025
@as51340 as51340 added CI -build=community -test=core Run community build and core tests on push CI -build=coverage -test=core Run coverage build and core tests on push CI -build=jepsen -test=core Run jepsen build and core tests on push CI -build=debug -test=core Run debug build and core tests on push CI -build=release -test=core Run release build and core tests on push CI -build=release -test=e2e Run release build and e2e tests on push CI -build=coverage -test=clang_tidy labels Oct 20, 2025
Base automatically changed from toolchain-v7 to master October 21, 2025 08:50
@as51340 as51340 changed the title Feat/queue based batching thread allocators v2 feat: Add LOAD PARQUET clause Oct 21, 2025
@as51340 as51340 added the feature feature label Oct 21, 2025
@as51340 as51340 force-pushed the feat/queue-based-batching-thread-allocators-v2 branch 2 times, most recently from ca2818e to 91490c4 Compare October 22, 2025 13:57
@as51340 as51340 requested a review from andrejtonev October 25, 2025 16:55
@as51340 as51340 changed the title feat: Add LOAD PARQUET clause feat: Support loading parquet files from local disk Oct 27, 2025
@as51340 as51340 enabled auto-merge October 27, 2025 07:19
@as51340 as51340 removed CI -build=community -test=core Run community build and core tests on push CI -build=coverage -test=core Run coverage build and core tests on push CI -build=jepsen -test=core Run jepsen build and core tests on push CI -build=debug -test=core Run debug build and core tests on push CI -build=release -test=core Run release build and core tests on push CI -build=release -test=e2e Run release build and e2e tests on push CI -build=release -test=benchmark Run release build and benchmark on push CI -build=coverage -test=clang_tidy CI -package=ubuntu-24.04 CI -package=debian-13 CI -package=fedora-42 CI -package=fedora-42-arm labels Oct 27, 2025
@as51340
Copy link
Contributor Author

as51340 commented Oct 27, 2025

Tracking

  • [Link to Epic/Issue]

Standard development

CI Testing Labels

  • Select the appropriate CI test labels (CI -build=build-name -test=test-suite)

Documentation checklist

  • Add the documentation label
  • Add the bug / feature label
  • Add the milestone for which this feature is intended
    • If not known, set for a later milestone
  • Write a release note, including added/changed clauses
    • Memgraph now supports loading Parquet files directly from the local filesystem. Users can now use LOAD PARQUET FROM /path/to/file.parquet in their Cypher queries for efficient bulk data import. This eliminates the need for manual CSV conversion and provides significantly faster import performance for large datasets. #3356
    • What has changed? What does it mean for a user? What should a user do with it? [#{{PR_number}}]({{link to the PR}})
  • Documentation PR link
    • Is back linked to this development PR

@as51340 as51340 added this pull request to the merge queue Oct 27, 2025
@sonarqubecloud
Copy link

Merged via the queue into master with commit 00ebb11 Oct 27, 2025
33 checks passed
@as51340 as51340 deleted the feat/queue-based-batching-thread-allocators-v2 branch October 27, 2025 08:58
@gitbuda gitbuda mentioned this pull request Nov 9, 2025
35 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Docs needed Docs needed feature feature

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants