API request delay for Generic OpenAI embedding engine #4317

chaserhkj · 2025-08-20T20:56:52Z

Pull Request Type

Relevant Issues

resolves #3570

What is in this change?

This PR adds a new configurable environment variable to add a small delay for dispatching each API request calls when using the generic OpenAI embedding engine.

This is meant to be used together with GENERIC_OPEN_AI_EMBEDDING_MAX_CONCURRENT_CHUNKS=1 and is mostly relevant to the use of local LLM backends like llama.cpp, where their API server implementation is naive and will just reject API requests with a 429 error if the underlying worker is busy.

Developer Validations

I ran yarn lint from the root of the repo & committed changes
Relevant documentation has been updated
I have tested my code functionality
Docker build succeeds locally

I am skipping documentation for now since there is no change to front end UI, but I could add it to this PR if requested.

…engine

timothycarambat · 2025-09-18T03:53:30Z

The previous change here would execute all the batches at once, and they would all just wait at the same time instead of doing each batch sequentially with a wait in between. Refactored code solves this and was tested.

chaserhkj added 2 commits September 12, 2025 18:57

Add ENV to configure api request delay for generic open ai embedding …

f45ee96

…engine

yarn lint formatting

d19048c

chaserhkj force-pushed the generic-openai-embedding-delay branch from 47ad24d to d19048c Compare September 12, 2025 22:58

timothycarambat added the PR:needs review Needs review by core team label Sep 16, 2025

refactor

9039843

Merge branch 'master' into generic-openai-embedding-delay

b7d888b

timothycarambat merged commit 226802d into Mintplex-Labs:master Sep 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

API request delay for Generic OpenAI embedding engine #4317

API request delay for Generic OpenAI embedding engine #4317

Uh oh!

chaserhkj commented Aug 20, 2025 •

edited

Loading

Uh oh!

timothycarambat commented Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!

API request delay for Generic OpenAI embedding engine #4317

API request delay for Generic OpenAI embedding engine #4317

Uh oh!

Conversation

chaserhkj commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Type

Relevant Issues

What is in this change?

Developer Validations

Uh oh!

timothycarambat commented Sep 18, 2025

Uh oh!

Uh oh!

chaserhkj commented Aug 20, 2025 •

edited

Loading