ChatQnA Example with OpenAI-Compatible Endpoint #2060

edlee123 · 2025-06-10T16:46:51Z

Description

Allows ChatQnA to be used with thousands of OpenAI-like endpoints e.g. OpenRouter.ai, Hugging Face, Denvr, and improve the developer experience to use OPEA quickly even on low resource environments. Using endpoints would not require spinning up a inference service.

Key Changes Made:

Created ChatQnA/docker_compose/intel/cpu/xeon/README_endpoint_openai.md: instructions to spin up example.
Created ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml: replaces vLLM with an opeai-like endpoint

Also:

Fixed align_generator function to properly detect and skip chunks where content is null in open-ai like endpoints. Previously it'd show the null json in the UI.
Added better error handling and debug logging for easier troubleshooting of endpoint issues.

Issues

Helps address issue:

1972
1973

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

No new third party dependencies, but uses recent updated textgen component:
https://github.com/opea-project/

GenAIComps/pull/1395 (the component was updated to filter null inputs. Some openai standard null inputs are disallowed by OpenRouter.ai).

Tests

Tested on:
OpenRouter.ai: anthropic/claude-3.7-sonnet
Denvr: meta-llama/Llama-3.1-70B-Instruct

Signed-off-by: Ed Lee <[email protected]>

…w null json. Also improved exception handling and logging Signed-off-by: Ed Lee <[email protected]>

github-actions · 2025-06-10T16:47:10Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

Copilot

Pull Request Overview

This PR enables ChatQnA to run against OpenAI-compatible endpoints via a new Docker Compose setup and enhances the core Python logic to better handle and log streaming responses.

Added compose_endpoint_openai.yaml for spinning up ChatQnA with OPEA text-generation service backed by OpenAI-like endpoints.
Updated chatqna.py to improve logging, skip null-content chunks in the generator, and bolster error handling.
Fixed align_generator to filter out invalid or null JSON and added debug/error logs.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml	New Compose file replacing vLLM with an OpenAI-compatible endpoint and related services.
ChatQnA/chatqna.py	Enhanced input/output alignment, added logging, and refactored `align_generator`.

Comments suppressed due to low confidence (4)

ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml:98

[nitpick] Component name 'OpeaTextGenService' is inconsistent with other OPEA naming conventions; consider using uppercase 'OPEA' or aligning with existing style.

LLM_COMPONENT_NAME: OpeaTextGenService

ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml:104

[nitpick] The comment says 'Add OpenRouter API Key' but the variable is 'OPENAI_API_KEY'; clarify the env var or comment to avoid confusion.

OPENAI_API_KEY: ${OPENAI_API_KEY} # Add OpenRouter API Key

ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml:166

[nitpick] BACKEND_SERVICE_NAME 'chatqna' does not match the container name 'chatqna-xeon-backend-server'; consider aligning these to prevent misconfiguration.

- BACKEND_SERVICE_NAME=chatqna

ChatQnA/chatqna.py:189

The updated align_generator logic introduces new behaviors (skipping null content, error handling) but lacks corresponding unit tests; consider adding tests to cover these scenarios.

def align_generator(self, gen, **kwargs):

ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml

…yaml Co-authored-by: Copilot <[email protected]>

Signed-off-by: Ed Lee <[email protected]>

edlee123 · 2025-06-10T22:02:25Z

I accepted a Co-pilot commit but wouldn't let me sign the commit after that. Will create a new PR.

edlee123 added 3 commits June 10, 2025 11:29

Compose file for ChatQnA example with openai-like endpoint

4930b74

Signed-off-by: Ed Lee <[email protected]>

Adding README.md for ChatQnA + endpoint

0d99003

Signed-off-by: Ed Lee <[email protected]>

In chatqna.py handle null openai api response since UI would show sho…

5f63f69

…w null json. Also improved exception handling and logging Signed-off-by: Ed Lee <[email protected]>

Copilot AI review requested due to automatic review settings June 10, 2025 16:46

edlee123 requested review from letonghan and lvliang-intel as code owners June 10, 2025 16:46

Copilot AI reviewed Jun 10, 2025

View reviewed changes

ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.yaml Outdated Show resolved Hide resolved

edlee123 mentioned this pull request Jun 10, 2025

[Feature] Enable remote inference endpoints for examples #1973

Closed

27 tasks

Update ChatQnA/docker_compose/intel/cpu/xeon/compose_endpoint_openai.…

0f95997

…yaml Co-authored-by: Copilot <[email protected]>

edlee123 force-pushed the chatqna_with_endpoints branch from 5b9dbb2 to be85abc Compare June 10, 2025 20:13

Merge branch 'main' into chatqna_with_endpoints

2b7715e

Signed-off-by: Ed Lee <[email protected]>

edlee123 force-pushed the chatqna_with_endpoints branch from be85abc to 2b7715e Compare June 10, 2025 20:29

edlee123 closed this Jun 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ChatQnA Example with OpenAI-Compatible Endpoint #2060

ChatQnA Example with OpenAI-Compatible Endpoint #2060

Uh oh!

edlee123 commented Jun 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

edlee123 commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ChatQnA Example with OpenAI-Compatible Endpoint #2060

ChatQnA Example with OpenAI-Compatible Endpoint #2060

Uh oh!

Conversation

edlee123 commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

github-actions bot commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

edlee123 commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

edlee123 commented Jun 10, 2025 •

edited

Loading

github-actions bot commented Jun 10, 2025 •

edited

Loading