OpenAI API Reverse Proxy

A lightweight Dockerized reverse proxy for OpenAI's API endpoints with streaming response support.

Features

Transparent request forwarding to OpenAI API
Streamed response handling
Automatic <think> tag removal from responses
Runtime LLM parameter overrides (temperature, top_k, etc.)
Docker-ready deployment with Gunicorn
JSON request/response handling
Detailed error reporting
Configurable target endpoint

Quick Start

# Build the Docker image
docker build -t openai-proxy .

# Run the container
docker run -p 5000:5000 openai-proxy

Testing with curl

Health Check

# Check proxy health and target URL connectivity
curl http://localhost:5000/health

Chat Completion (Streaming)

# Test streaming chat completion
curl http://localhost:5000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "Hello!"}],
    "stream": true
  }'

Chat Completion (Non-streaming)

# Test regular chat completion
curl http://localhost:5000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Error Handling

The proxy provides detailed error responses for various scenarios:

# Test with invalid API key
curl http://localhost:5000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer invalid_key" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

# Test with invalid JSON
curl http://localhost:5000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{invalid json}'

Error responses include:

400: Invalid JSON request
502: Connection/forwarding errors
503: Health check failures
504: Connection timeouts
Original error codes from target API (401, 403, etc.)

Test with Different Target

# Test with custom API endpoint
docker run -e TARGET_BASE_URL="http://your-api:8080/" -p 5000:5000 openai-proxy

curl http://localhost:5000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Development Setup

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python cot_proxy.py

Configuration

Environment Variables

TARGET_BASE_URL: Target API endpoint (default: https://api.openai.com/v1/)
DEBUG: Enable debug logging (default: false)
LLM_PARAMS: Comma-separated parameter overrides in format key=value. Model-specific groups separated by semicolons. Example: model=gemma,temperature=0.5,top_k=40;model=llama2,max_tokens=200

Example with all options:

# Configure target endpoint and LLM parameters
export TARGET_BASE_URL="http://alternate-api.example.com/"
export LLM_PARAMS="model=gemma,temperature=0.5,top_k=40;model=llama2,max_tokens=200"
export DEBUG=true

# Docker usage example with debug logging:
docker run \
  -e TARGET_BASE_URL="http://alternate-api.example.com/" \
  -e LLM_PARAMS="model=gemma,temperature=0.5,top_k=40" \
  -e DEBUG=true \
  -p 5000:5000 openai-proxy

Production Configuration

The service uses Gunicorn with the following settings:

4 worker processes
3000 second timeout for long-running requests
SSL verification enabled
Automatic error recovery
Health check endpoint for monitoring

Dependencies

Python 3.9+
Flask
Requests
Gunicorn (production)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
cot_proxy.py		cot_proxy.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI API Reverse Proxy

Features

Quick Start

Testing with curl

Health Check

Chat Completion (Streaming)

Chat Completion (Non-streaming)

Error Handling

Test with Different Target

Development Setup

Configuration

Environment Variables

Production Configuration

Dependencies

About

Releases

Packages

Languages

License

bold84/cot_proxy

Folders and files

Latest commit

History

Repository files navigation

OpenAI API Reverse Proxy

Features

Quick Start

Testing with curl

Health Check

Chat Completion (Streaming)

Chat Completion (Non-streaming)

Error Handling

Test with Different Target

Development Setup

Configuration

Environment Variables

Production Configuration

Dependencies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages