OpenGuardrails

Hugging Face · Free Platform · Tech Report

OpenGuardrails

The only production-ready, fully open-source AI guardrails platform for enterprise data protection

OpenGuardrails is an open-source AI security gateway that prevents enterprise AI applications from leaking sensitive data to external LLM providers — without disrupting user workflows.

The Enterprise AI Data Leakage Problem

When employees use AI applications (Copilot, ChatGPT, internal AI agents), sensitive enterprise data flows to external LLM providers:

PII: Employee emails, customer phone numbers, government IDs
Credentials: API keys, database passwords, access tokens in code
Confidentials: Trade secrets, internal project names, financial data

The challenge: Blocking these requests breaks user workflows. Allowing them creates compliance and security risks.

OpenGuardrails solves this with intelligent data protection that keeps AI applications usable while keeping sensitive data secure.

How It Works

OpenGuardrails acts as an AI Gateway between your AI applications and LLM providers, automatically detecting and protecting sensitive data:

Strategy 1: Mask & Restore (Anonymization)

Sensitive data is masked before sending to external LLMs, then restored in the response — users see correct results, but external providers never see real data.

Flow:

User request contains sensitive data (e.g., thomas@openguardrails.com)
Gateway detects and masks: __email_1__@openguardrails.com
Sanitized request sent to external LLM
Response received with masked placeholders
Original values restored before returning to user

Strategy 2: Private Model Routing

For high-risk data, requests are automatically routed to your private/on-premise LLM — the original data never leaves your infrastructure.

Flow:

User request contains high-risk sensitive data
Gateway classifies risk level and applies policy
Request routed to private LLM (with original data intact)
Response returned to user seamlessly

Users experience zero disruption — they don't know their request was rerouted for security.

AI-Powered Sensitive Data Recognition

Unlike rule-based DLP systems, OpenGuardrails uses GenAI-powered recognition to detect sensitive data that regex patterns miss:

Natural Language Anonymization

Detects sensitive information expressed in natural language:

Input	Detection
"My employee ID is 12345"	`employee_id` detected
"Contact John at his personal cell"	`phone_number` context detected
"The Q3 revenue was $2.5M"	`financial_data` detected
"Our AWS secret key starts with AKIA..."	`cloud_credential` detected

Code Logic Anonymization

Detects credentials and secrets embedded in code:

# Input code snippet
db_password = "super_secret_123"
api_key = "sk-proj-abc123..."
conn_string = "postgresql://admin:password@internal-db:5432"

All three sensitive values are detected and masked, even without explicit patterns.

Recognition Methods

Method	Best For	Examples
GenAI Recognition	Context-dependent, natural language	Trade secrets, business confidentials, implicit references
Regex Patterns	Structured data with known formats	Credit cards, SSN, phone numbers, emails
Custom Keywords	Organization-specific terms	Project codenames, internal system names

Policy-Based Data Classification

Define what's sensitive according to your organization's policy, not just generic PII:

# Example: Enterprise Data Classification Policy
high_risk:
  - customer_pii
  - financial_records
  - source_code
  - api_credentials
  action: switch_private_model

medium_risk:
  - employee_emails
  - internal_project_names
  action: anonymize

low_risk:
  - general_business_info
  action: anonymize

Configurable actions per risk level:

block — Reject the request entirely
switch_private_model — Route to on-premise LLM
anonymize — Mask and restore sensitive data
pass — Allow with audit logging

Quick Start

Option 1: Try Online (Fastest)

https://www.openguardrails.com/platform/

Option 2: Python SDK

pip install openguardrails

from openguardrails import OpenGuardrails

client = OpenGuardrails("your-api-key")
result = client.check_prompt("Send report to john@company.com with Q3 revenue $2.5M")

print(result.detected_entities)
# [{"type": "email", "value": "john@company.com"},
#  {"type": "financial_data", "value": "$2.5M"}]
print(result.suggest_action)  # anonymize

Option 3: OpenAI-Compatible Gateway (Zero Code Change)

from openai import OpenAI

# Just change the base_url — all other code stays the same
client = OpenAI(
    base_url="http://localhost:5002/v1",
    api_key="sk-xxai-your-key"
)

# Automatic data protection — no code changes needed!
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Summarize the contract for Acme Corp deal #12345"}]
)
# Sensitive data automatically masked/routed based on your policy

Enterprise Deployment

OpenGuardrails is designed for air-gapped and regulated environments:

Fully on-premise — No data leaves your infrastructure
Private cloud ready — Deploy on AWS, Azure, GCP VPC
Compliance friendly — SOC2, HIPAA, GDPR compatible architecture
High performance — <50ms latency overhead, 1000+ req/sec

Deployment Options

# Quick start with Docker
docker compose up -d

# Production deployment
curl -O https://raw.githubusercontent.com/openguardrails/openguardrails/main/docker-compose.prod.yml
docker compose -f docker-compose.prod.yml up -d

Integration Points

API Gateway — Nginx, Kong, AWS API Gateway
AI Platforms — Dify, n8n, LangChain, LlamaIndex
Direct Integration — SDK for Python, REST API

See Deployment Guide for detailed instructions.

Additional Security Features

Beyond data leakage prevention, OpenGuardrails provides:

Prompt Injection Detection — Block jailbreak attempts
Content Safety — 19 risk categories (violence, hate speech, etc.)
Policy Enforcement — Business rules beyond "unsafe"
Agent Protection — Pre/post tool-call validation
Audit Logging — Complete request/response history

Model

OpenGuardrails-Text-2510
- 3.3B parameters
- 119 languages
- Purpose-built for guardrails & sensitive data detection
- Download on HuggingFace

Documentation

Deployment Guide — Complete deployment instructions
Data Leakage Guide — DLP configuration details
Custom Scanners — Build your own detectors
API Reference — Complete API documentation
Architecture — System architecture & design
Enterprise PoC Guide — PoC deployment guide
Technical Report (arXiv)

Community & Support

Star us on GitHub if this project helps you
Contributions welcome — see Contributing Guide
Contact: thomas@openguardrails.com
Website: https://openguardrails.com
Issues: GitHub Issues

Citation

If you find our work helpful, feel free to give us a cite.

@misc{openguardrails,
      title={OpenGuardrails: A Configurable, Unified, and Scalable Guardrails Platform for Large Language Models},
      author={Thomas Wang and Haowen Li},
      year={2025},
      url={https://arxiv.org/abs/2510.19169},
}

Protect enterprise data in AI workflows — automatically.

Made with care by OpenGuardrails

Name		Name	Last commit message	Last commit date
Latest commit History 258 Commits
.github/workflows		.github/workflows
backend		backend
docs		docs
frontend		frontend
integrations		integrations
landing		landing
scripts		scripts
standards		standards
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
OpenGuardrailsTechReport.pdf		OpenGuardrailsTechReport.pdf
README.md		README.md
SECURITY.md		SECURITY.md
VERSION		VERSION
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
supervisord.conf		supervisord.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenGuardrails

The Enterprise AI Data Leakage Problem

How It Works

Strategy 1: Mask & Restore (Anonymization)

Strategy 2: Private Model Routing

AI-Powered Sensitive Data Recognition

Natural Language Anonymization

Code Logic Anonymization

Recognition Methods

Policy-Based Data Classification

Quick Start

Option 1: Try Online (Fastest)

Option 2: Python SDK

Option 3: OpenAI-Compatible Gateway (Zero Code Change)

Enterprise Deployment

Deployment Options

Integration Points

Additional Security Features

Model

Documentation

Community & Support

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OpenGuardrails

The Enterprise AI Data Leakage Problem

How It Works

Strategy 1: Mask & Restore (Anonymization)

Strategy 2: Private Model Routing

AI-Powered Sensitive Data Recognition

Natural Language Anonymization

Code Logic Anonymization

Recognition Methods

Policy-Based Data Classification

Quick Start

Option 1: Try Online (Fastest)

Option 2: Python SDK

Option 3: OpenAI-Compatible Gateway (Zero Code Change)

Enterprise Deployment

Deployment Options

Integration Points

Additional Security Features

Model

Documentation

Community & Support

Citation

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages