ChatQnA Example with OpenAI-Compatible Endpoint #2054

edlee123 · 2025-06-10T00:07:03Z

Description

Allows ChatQnA to be used with thousands of OpenAI-like endpoints e.g. OpenRouter.ai, Hugging Face, Denvr, and improve the developer experience to use OPEA quickly even on low resource environments (does not require spinning up a inference service).

Key Changes Made

Created ChatQnA/docker_compose/intel/cpu/xeon/README_endpoint_openai.md: instructions to spin up example.
Created ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml: replaces vLLM with an opeai-like endpoint .

Also:

Fixed align_generator function to properly detect and skip chunks where content is null in open-ai like endpoints. Previously it'd show the null json in the UI.
Added better error handling and debug logging for easier troubleshooting of endpoint issues.

Issues

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

Tests

Tested on:
OpenRouter.ai: anthropic/claude-3.7-sonnet
Denvr: meta-llama/Llama-3.1-70B-Instruct

…local inference service e.g., vllm Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Ed Lee <[email protected]>

…ndexing Signed-off-by: Ed Lee <[email protected]>

…g improvements, and updated pydantic model validations from parse_obj to model_validate Signed-off-by: Ed Lee <[email protected]>

…ataprep, receiver, and final output. The UI payload had to be updated since the model was hardcoded and not based on LLM_MODEL_ID Signed-off-by: Ed Lee <[email protected]>

…LLM_MODEL_ID env variable. Before the UI request had model hardcoded. Signed-off-by: Ed Lee <[email protected]>

Signed-off-by: Ed Lee <[email protected]>

…nent name.

Signed-off-by: Ed Lee <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Ed Lee <[email protected]>

… into graphrag_workshop

for more information, see https://pre-commit.ci

Signed-off-by: Ed Lee <[email protected]>

… into graphrag_workshop

for more information, see https://pre-commit.ci

…mproved logging messages. Signed-off-by: Ed Lee <[email protected]>

… compose_endpoint_openai.yaml Signed-off-by: Ed Lee <[email protected]>

github-actions · 2025-06-10T00:07:17Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

for more information, see https://pre-commit.ci

Copilot

Pull Request Overview

This PR extends ChatQnA to support OpenAI-compatible endpoints and improves robustness by enhancing environment variable handling, skipping null content chunks, and adding detailed logging for debugging.

Introduces LLM_MODEL_ID validation in frontend and dynamically sets the model in requests.
Refactors align_generator, align_inputs, and align_outputs to skip null or malformed content and use structured logging.
Adds new Docker Compose configurations and READMEs for spinning up ChatQnA and GraphRAG services with external endpoints.

Reviewed Changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
GraphRAG/ui/svelte/src/lib/network/chat/Network.ts	Validates and injects `LLM_MODEL_ID` into text requests
GraphRAG/graphrag.py	Replaced prints with `logger`, skipped null chunks
GraphRAG/docker_compose/intel/*	Added `LLM_MODEL_ID` env and endpoint README/composes
ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml	Swapped vLLM for an OpenAI-compatible textgen service
ChatQnA/chatqna.py	Added structured logging to alignment methods
GraphRAG/example_data/programming_languages{,2}.txt + README.md	Expanded example datasets and documentation

Comments suppressed due to low confidence (3)

GraphRAG/graphrag.py:203

Avoid bare except: which catches all exceptions; specify the exception type to prevent masking unexpected errors.

except:

GraphRAG/graphrag.py:131

The new JSON parsing logic in align_generator should be covered by unit tests for TGI, OpenAI, and malformed chunks to prevent regressions.

def align_generator(self, gen, **kwargs):

GraphRAG/graphrag.py:141

This if appears over-indented relative to its surrounding lines and may introduce a syntax error; verify the indentation block.

if start == -1 or end <= start:

Copilot · 2025-06-10T00:08:57Z

GraphRAG/graphrag.py

+import time
+import uuid
+from typing import Dict, List, Union


The time, uuid, and typing imports are unused; consider removing them to reduce clutter.

Suggested change

import time

import uuid

from typing import Dict, List, Union

# Removed unused imports: time, uuid, typing

Copilot · 2025-06-10T00:08:58Z

GraphRAG/ui/svelte/src/lib/network/chat/Network.ts

+if (!env.LLM_MODEL_ID) {
+	throw new Error("LLM_MODEL_ID environment variable must be set");
+}
+const LLM_MODEL_ID = env.LLM_MODEL_ID;

 export async function fetchTextStream(query: string) {
 	let payload = {};
 	let url = "";



[nitpick] Throwing an error at module import may break SSR or bundling; consider checking LLM_MODEL_ID inside the fetchTextStream function or providing a fallback.

Suggested change

if (!env.LLM_MODEL_ID) {

throw new Error("LLM_MODEL_ID environment variable must be set");

}

const LLM_MODEL_ID = env.LLM_MODEL_ID;

export async function fetchTextStream(query: string) {

let payload = {};

let url = "";

export async function fetchTextStream(query: string) {

let payload = {};

let url = "";

const LLM_MODEL_ID = env.LLM_MODEL_ID || "default-model-id"; // Fallback value

if (!env.LLM_MODEL_ID) {

console.warn("LLM_MODEL_ID environment variable is not set. Using fallback value.");

}

Copilot · 2025-06-10T00:08:58Z

ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml

+        condition: service_started
+      tei-reranking-service:
+        condition: service_started
+      textgen-service-endpoint-openai:


[nitpick] It would be helpful to add a healthcheck for the textgen-service-endpoint-openai to ensure the service is ready before backend startup.

Signed-off-by: Ed Lee <[email protected]>

…nAIExamples into chatqna_with_openai_endpoint

Signed-off-by: Ed Lee <[email protected]>

for more information, see https://pre-commit.ci

edlee123 · 2025-06-10T00:40:55Z

i mixed two branches together. going to make a clean PR.

edlee123 and others added 30 commits March 17, 2025 15:55

A compose file to use openrouter, openai style endpoint instead of a …

8217203

…local inference service e.g., vllm Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into chatqna_with_openai_endpoint

304b2da

First working version of ChatQnA with openai compatible endpoint

af09744

Signed-off-by: Ed Lee <[email protected]>

First version of GraphRAG Xeon for LLM, and open endpoint for graph i…

770ee94

…ndexing Signed-off-by: Ed Lee <[email protected]>

Updated align_generator to be like ChatQnA bytes format. Added loggin…

ca90df5

…g improvements, and updated pydantic model validations from parse_obj to model_validate Signed-off-by: Ed Lee <[email protected]>

Working GraphRAG on laptop cpu with help of three LLM endpoints for d…

1658169

…ataprep, receiver, and final output. The UI payload had to be updated since the model was hardcoded and not based on LLM_MODEL_ID Signed-off-by: Ed Lee <[email protected]>

The GraphRAG gaudi compose.yaml had to be updated so the UI can pass …

42d9d41

…LLM_MODEL_ID env variable. Before the UI request had model hardcoded. Signed-off-by: Ed Lee <[email protected]>

Provided build instructions of GraphRAG images for Xeon

fa1bc52

Signed-off-by: Ed Lee <[email protected]>

Small edits to GraphRAG xeon readme

b6e7729

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into graphrag_workshop

70330c5

Add architecture container diagram

6606ab0

Signed-off-by: Ed Lee <[email protected]>

Fix to compose.yaml dataprep-neorj-llamaindex port and dataprep compo…

89dac5a

…nent name.

Added sample graphdata and updated README.md to provide instuction

5c97a0f

Signed-off-by: Ed Lee <[email protected]>

Improving documentation

7493aec

Signed-off-by: Ed Lee <[email protected]>

Made fixes to README

f5fcb1e

Signed-off-by: Ed Lee <[email protected]>

Small edits to readme and yaml for documentation

cb1e441

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'opea-project:main' into graphrag_workshop

27bd1d8

[pre-commit.ci] auto fixes from pre-commit.com hooks

88ede23

for more information, see https://pre-commit.ci

Adding instruction to browse the Neo4J db

afb61ef

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into graphrag_workshop

12ab0c5

Add missing environment variables so the UI can upload files

41cd535

Signed-off-by: Ed Lee <[email protected]>

Fix typos in readmes and change example queries

881dc8a

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'graphrag_workshop' of github.com:edlee123/GenAIExamples…

5c20c18

… into graphrag_workshop

[pre-commit.ci] auto fixes from pre-commit.com hooks

f2506ca

for more information, see https://pre-commit.ci

Fix relative paths in README.md

6c0e87b

Signed-off-by: Ed Lee <[email protected]>

Fix README formatting

342d8bc

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'graphrag_workshop' of github.com:edlee123/GenAIExamples…

1750552

… into graphrag_workshop

[pre-commit.ci] auto fixes from pre-commit.com hooks

076a3ec

for more information, see https://pre-commit.ci

Merge branch 'main' into graphrag_workshop

ea8371d

Merge branch 'main' into chatqna_with_openai_endpoint

7dd705b

edlee123 added 2 commits June 9, 2025 17:51

Fix to not show null content json strings for OpenAI-like api. Also i…

1f7e1f9

…mproved logging messages. Signed-off-by: Ed Lee <[email protected]>

Fix to use correct environment variable for HF token in README.md and…

907616e

… compose_endpoint_openai.yaml Signed-off-by: Ed Lee <[email protected]>

Copilot AI review requested due to automatic review settings June 10, 2025 00:07

edlee123 requested review from ashahba, letonghan, lvliang-intel and rbrugaro as code owners June 10, 2025 00:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

495169e

for more information, see https://pre-commit.ci

Copilot AI reviewed Jun 10, 2025

View reviewed changes

edlee123 added 2 commits June 9, 2025 19:13

Remove instruction about checking out particular version

dd03f1b

Signed-off-by: Ed Lee <[email protected]>

Merge branch 'main' into chatqna_with_openai_endpoint

9d2534a

edlee123 changed the title ~~Chatqna Example with Openai Endpoint~~ ChatQnA Example with OpenAI-Compatible Endpoint Jun 10, 2025

edlee123 and others added 3 commits June 9, 2025 19:18

Merge branch 'chatqna_with_openai_endpoint' of github.com:edlee123/Ge…

0d354f6

…nAIExamples into chatqna_with_openai_endpoint

Exclude graphrag updates

87979cb

Signed-off-by: Ed Lee <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

0ff59ce

for more information, see https://pre-commit.ci

edlee123 closed this Jun 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ChatQnA Example with OpenAI-Compatible Endpoint #2054

ChatQnA Example with OpenAI-Compatible Endpoint #2054

Uh oh!

edlee123 commented Jun 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jun 10, 2025

Uh oh!

Copilot AI Jun 10, 2025

Uh oh!

Copilot AI Jun 10, 2025

Uh oh!

edlee123 commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ChatQnA Example with OpenAI-Compatible Endpoint #2054

ChatQnA Example with OpenAI-Compatible Endpoint #2054

Uh oh!

Conversation

edlee123 commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Key Changes Made

Issues

Type of change

Dependencies

Tests

Uh oh!

github-actions bot commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

edlee123 commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

edlee123 commented Jun 10, 2025 •

edited

Loading

github-actions bot commented Jun 10, 2025 •

edited

Loading