Software Component Configuration

Configuration with Environment Variables
Milvus
Pgvector
Chain Server
RAG Playground

Configuration with Environment Variables

The following sections identify the environment variables that are used in the docker-compose.yaml files.

Milvus

Milvus is the default vector database server. You can configure Milvus using the following environment variable:

DOCKER_VOLUME_DIRECTORY: Specifies the location of the volume mount on the host for the vector database files. The default value is `./volumes/milvus` in the current working directory.

Pgvector

Pgvector is an alternative vector database server. You can configure pgvector using the following environment variables:

DOCKER_VOLUME_DIRECTORY: Specifies the location of the volume mount on the host for the vector database files. The default value is `./volumes/data` in the current working directory.
POSTGRES_PASSWORD: Specifies the password for authenticating to pgvector. The default value is `password`.
POSTGRES_USER: Specifies the user name for authenticating to pgvector. The default value is `postgres`.
POSTGRES_DB: Specifies the name of the database instance. The default value is `api`.

Chain Server

The chain server is the core component that interacts with the LLM Inference Server and the Milvus server to obtain responses. You can configure the server using the following environment variable:

APP_VECTORSTORE_URL: Specifies the URL of the vector database server.
APP_VECTORSTORE_NAME: Specifies the vendor name of the vector database. Values are `milvus` or `pgvector`.
COLLECTION_NAME: Specifies the example-specific collection in the vector database.
APP_LLM_SERVERURL: Specifies the URL of NVIDIA NIM for LLMs.
APP_LLM_MODELNAME: The model name used by NIM for LLMs.
APP_LLM_MODELENGINE: Specifies the backend name hosting the model. The only supported value is `nvidia-ai-endpoints` to use models hosted using NIM for LLMs in cloud based API Catalog or locally.
APP_RETRIEVER_TOPK: Number of relevant results to retrieve. The default value is `4`.
APP_RETRIEVER_SCORETHRESHOLD: The minimum confidence score for the retrieved values to be considered. The default value is `0.25`.
LOGLEVEL: Set the logging verbosity level for the logs printed by container. Chain server uses the standard python logging module. Possible values are NOTSET, DEBUG, INFO, WARN, ERROR, CRITICAL.

RAG Playground

The RAG playground component is the user interface web application that interacts with the chain server to retrieve responses and provide a user interface to upload documents. You can configure the server using the following environment variables:

APP_SERVERURL: Specifies the URL for the chain server.
APP_SERVERPORT: Specifies the network port number for the chain server.
APP_MODELNAME: Specifies the name of the large language model used in the deployment. This information is for display purposes only and does not affect the inference process.
RIVA_API_URI: Specifies the host name and port of the NVIDIA Riva server. This field is optional and provides automatic speech recognition (ASR) and text-to-speech (TTS) functionality.
RIVA_API_KEY: Specifies a key to access the Riva API. This field is optional.
RIVA_FUNCTION_ID: Specifies the function ID to access the Riva API. This field is optional.
TTS_SAMPLE_RATE: Specifies the sample rate in hertz (Hz). The default value is `48000`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configuration.md

configuration.md

Software Component Configuration

Configuration with Environment Variables

Milvus

Pgvector

Chain Server

RAG Playground

Files

configuration.md

Latest commit

History

configuration.md

File metadata and controls

Software Component Configuration

Configuration with Environment Variables

Milvus

Pgvector

Chain Server

RAG Playground