GraphRAG on CPU Example #2002

edlee123 · 2025-05-27T17:54:46Z

Description

New Feature

GraphRAG/docker_compose/intel/cpu/xeon/compose.yaml allows deployment of GraphRAG on cpu (xeon or laptop). It uses openai-like LLM endpoints for dataprep + retriever graph components, and vllm-cpu for final LLM.
GraphRAG/docker_compose/intel/cpu/xeon/README.md instructions to deploy above.
GraphRAG/example_data/README.md: small example of programming languages relations dataset.
GraphRAG/graphrag.py: - improved formatting of logs, and handle deprecation of pydantic parse_obj method.

Bug Fix

GraphRAG/ui/svelte/src/lib/network/chat/Network.ts previously hardcoded the model to "Intel/neural-chat-7b-v3-3" in UI request. Now is configured by environment variable.
GraphRAG/docker_compose/intel/hpu/gaudi/compose.yaml - related to above, configuring the model from environment variable for the UI in Gaudi deployments.
GraphRAG/graphrag.py: align_generator method needed to use byte format for strings to render properly in the UI. This method now aligns with ChatQnA.

Issues

N/A.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

Depends on the following:
• Feature: OpenAI-compatible endpoint for text generation by edlee123 · Pull Request #1395 · opea-project/GenAIComps
• Update GraphRAG dataprep and retriever components by edlee123 · Pull Request #1746 · opea-project/GenAIComps

Tests

Would want to be able to pull the updated components in Dependencies to create a test like the Gaudi version..

…ndexing Signed-off-by: Ed Lee <[email protected]>

…g improvements, and updated pydantic model validations from parse_obj to model_validate Signed-off-by: Ed Lee <[email protected]>

…ataprep, receiver, and final output. The UI payload had to be updated since the model was hardcoded and not based on LLM_MODEL_ID Signed-off-by: Ed Lee <[email protected]>

…LLM_MODEL_ID env variable. Before the UI request had model hardcoded. Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Ed Lee <[email protected]>

github-actions · 2025-05-27T17:55:00Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

Copilot

Pull Request Overview

This pull request introduces the "GraphRAG on CPU Example" feature and a bug fix for configuring the model in the UI, along with improved logging and deprecation handling in the backend. Key changes include:

In GraphRAG/ui/svelte/src/lib/network/chat/Network.ts, replacing the hardcoded model with an environment variable (LLM_MODEL_ID) and adding a guard clause.
In GraphRAG/graphrag.py, migrating from print statements to structured logging and updating Pydantic parsing methods.
New and updated Docker Compose files and documentation for deploying GraphRAG on CPU (Xeon or laptop) configurations.

Reviewed Changes

Copilot reviewed 7 out of 9 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
GraphRAG/ui/svelte/src/lib/network/chat/Network.ts	Uses LLM_MODEL_ID from the environment instead of a hardcoded model value
GraphRAG/graphrag.py	Improves logging output, replaces deprecated Pydantic parse calls with model_validate, and aligns generator output
GraphRAG/example_data/README.md	Provides example datasets to test GraphRAG functionality
GraphRAG/docker_compose/intel/hpu/gaudi/compose.yaml	Configures the UI to use the model from an environment variable
GraphRAG/docker_compose/intel/cpu/xeon/compose.yaml	New file defining the CPU deployment configuration
GraphRAG/docker_compose/intel/cpu/xeon/README.md	Offers deployment instructions and context for the CPU example
GraphRAG/docker_compose/intel/cpu/xeon/GraphRAG_LLM_notes.md	Documents the roles and configuration of the three LLMs used in the application

Files not reviewed (2)

GraphRAG/docker_compose/intel/cpu/xeon/set_env.sh: Language not supported
GraphRAG/example_data/programming_languages.txt: Language not supported

Comments suppressed due to low confidence (2)

GraphRAG/graphrag.py:159

In the exception block of the align_generator method, consider logging the exception details (e.g., using logger.debug or logger.error) before yielding the raw JSON string. This will improve troubleshooting during debugging of LLM output processing.

yield f"data: {repr(json_str.encode('utf-8'))}\n\n"

GraphRAG/docker_compose/intel/cpu/xeon/compose.yaml:17

[nitpick] For consistency and clarity, consider using uppercase naming for environment variables, e.g., change to NEO4J_APOC_EXPORT_FILE_ENABLED.

- NEO4J_apoc_export_file_enabled=true

…nent name. Signed-off-by: Ed Lee <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Ed Lee <[email protected]>

… into graphrag_workshop

Signed-off-by: Ed Lee <[email protected]>

edlee123 · 2025-06-09T18:13:26Z

PR will require these updated components to work:

opea-project/GenAIComps#1395
opea-project/GenAIComps#1746

ashahba

The PR looks good but I have one question:
How are the datasets created? Manually or AI Generated?

edlee123 · 2025-06-13T17:47:49Z

The PR looks good but I have one question: How are the datasets created? Manually or AI Generated?

I think I'll modify these a bit to quote directly from wiki and cite the source.

Signed-off-by: Ed Lee <[email protected]>

… into graphrag_workshop

ashahba

LGTM!

Signed-off-by: Ed Lee <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Rachel R <[email protected]> Co-authored-by: Abolfazl Shahbazi <[email protected]> Signed-off-by: alexsin368 <[email protected]>

Signed-off-by: Ed Lee <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Rachel R <[email protected]> Co-authored-by: Abolfazl Shahbazi <[email protected]> Signed-off-by: cogniware-devops <[email protected]>

edlee123 added 8 commits May 23, 2025 10:58

First version of GraphRAG Xeon for LLM, and open endpoint for graph i…

770ee94

…ndexing Signed-off-by: Ed Lee <[email protected]>

Updated align_generator to be like ChatQnA bytes format. Added loggin…

ca90df5

…g improvements, and updated pydantic model validations from parse_obj to model_validate Signed-off-by: Ed Lee <[email protected]>

Working GraphRAG on laptop cpu with help of three LLM endpoints for d…

1658169

…ataprep, receiver, and final output. The UI payload had to be updated since the model was hardcoded and not based on LLM_MODEL_ID Signed-off-by: Ed Lee <[email protected]>

The GraphRAG gaudi compose.yaml had to be updated so the UI can pass …

42d9d41

…LLM_MODEL_ID env variable. Before the UI request had model hardcoded. Signed-off-by: Ed Lee <[email protected]>

Provided build instructions of GraphRAG images for Xeon

fa1bc52

Signed-off-by: Ed Lee <[email protected]>

Small edits to GraphRAG xeon readme

b6e7729

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into graphrag_workshop

70330c5

Add architecture container diagram

6606ab0

Signed-off-by: Ed Lee <[email protected]>

Copilot AI review requested due to automatic review settings May 27, 2025 17:54

edlee123 requested review from ashahba and rbrugaro as code owners May 27, 2025 17:54

Copilot AI reviewed May 27, 2025

View reviewed changes

joshuayao added the feature New feature or request label May 29, 2025

joshuayao added this to the v1.4 milestone May 29, 2025

joshuayao added this to OPEA May 29, 2025

joshuayao moved this to In review in OPEA May 29, 2025

Fix to compose.yaml dataprep-neorj-llamaindex port and dataprep compo…

d50bf14

…nent name. Signed-off-by: Ed Lee <[email protected]>

edlee123 force-pushed the graphrag_workshop branch from 57e301b to d50bf14 Compare June 5, 2025 22:42

pre-commit-ci bot and others added 8 commits June 5, 2025 22:43

[pre-commit.ci] auto fixes from pre-commit.com hooks

a61e299

for more information, see https://pre-commit.ci

Recovering changes in README, compose.yaml, and example_data

0cf4c1f

Signed-off-by: Ed Lee <[email protected]>

Try to fix bash blocks for sphinx formatting

27be954

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into graphrag_workshop

9bd1689

Merge branch 'main' into graphrag_workshop

68e44de

Fix dataprep service port to be internal port

0a03771

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'graphrag_workshop' of github.com:edlee123/GenAIExamples…

7738e2f

… into graphrag_workshop

Clean up comments

9885a8f

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into graphrag_workshop

e85ed3d

chickenrae self-requested a review June 9, 2025 19:45

chickenrae approved these changes Jun 9, 2025

View reviewed changes

edlee123 and others added 2 commits June 11, 2025 17:27

Merge branch 'main' into graphrag_workshop

1ba7797

Merge branch 'main' into graphrag_workshop

8e55578

ashahba reviewed Jun 13, 2025

View reviewed changes

ashahba added 2 commits June 12, 2025 22:27

Merge branch 'main' into graphrag_workshop

19915b9

Merge branch 'main' into graphrag_workshop

324b92a

edlee123 added 2 commits June 13, 2025 15:08

Updated example data to refer to wikipedia source

2290cd7

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'graphrag_workshop' of github.com:edlee123/GenAIExamples…

fe99232

… into graphrag_workshop

edlee123 requested a review from ashahba June 13, 2025 20:13

ashahba approved these changes Jun 13, 2025

View reviewed changes

rbrugaro approved these changes Jun 13, 2025

View reviewed changes

ashahba merged commit c873c58 into opea-project:main Jun 13, 2025
15 checks passed

github-project-automation bot moved this from In review to Done in OPEA Jun 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GraphRAG on CPU Example #2002

GraphRAG on CPU Example #2002

Uh oh!

edlee123 commented May 27, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 27, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

edlee123 commented Jun 9, 2025 •

edited

Loading

Uh oh!

ashahba left a comment

Uh oh!

edlee123 commented Jun 13, 2025

Uh oh!

ashahba left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

GraphRAG on CPU Example #2002

GraphRAG on CPU Example #2002

Uh oh!

Conversation

edlee123 commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

github-actions bot commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

edlee123 commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ashahba left a comment

Choose a reason for hiding this comment

Uh oh!

edlee123 commented Jun 13, 2025

Uh oh!

ashahba left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

edlee123 commented May 27, 2025 •

edited

Loading

github-actions bot commented May 27, 2025 •

edited

Loading

edlee123 commented Jun 9, 2025 •

edited

Loading