Burgonet Gateway

AI enterprise gateway implemented in Rust 🦀 that provides secure, governed access to LLMs (Large Language Models) for organizations. It acts as a single entry point for employees and projects to access both cloud-based and self-hosted AI models while enforcing security, compliance, and usage controls. The software helps organizations manage AI governance by providing centralized control over model access, usage tracking, and security policies.

Features

🔑 Token Management: Generate, view, and delete API tokens
🎯 Quota Management: Set token quotas per user, group or project
📊 Usage Monitoring: Real-time usage tracking and analytics
🤖 Provider Management: Configure multiple LLM providers (OpenAI, Claude, DeepSeek, Ollama, etc.)
⏱️ Rate Limiting: Built-in rate limiting with configurable thresholds
📝 Audit Logs: Detailed logging of API requests and responses
🖥️ Embedded Web UI: Built-in admin interface for configuration and monitoring
🔒 PII Protection: Built-in Personally Identifiable Information detection and blocking
📈 Prometheus Metrics: Built-in Prometheus endpoint for monitoring and alerting
🔐 Trusted Header Authentication: Support for authentication via trusted HTTP headers
🚫 Content Filtering: Block requests containing blacklisted words (e.g. "confidential")
🚷 Group Access Control: Restrict access by user groups with disabled_groups configuration
📝 Audit Logs: Detailed logging of API requests and responses
💾 Audit Storage: Persistent storage of all gateway exchanges with configurable retention periods

Need another feature? Don't hesitate to send an email or create a GitHub ticket!

Getting Started

This guide will walk you through the process of setting up and using the Burgonet Gateway.

Installation

Download: Download the appropriate binary for your operating system from the releases page.
Configuration: Download the default configuration file conf.yml
Run: Execute the binary using the following command, replacing <path/to/conf.yml> with the actual path to your configuration file:

./burgonet-gw -c <path/to/conf.yml>

Accessing the Web UI

Once the gateway is running, you can access the administration web UI at: http://127.0.0.1:6189/

Screenshot

When you have created your token, you can test it via the convenience chat web app embedded, open the following URL in your browser:

http://127.0.0.1:6190/ configure the the server url (default port is 6191) and your token:

Chat Web UI

Use Cases

Explore common scenarios and configurations:

Departmental Access Control
Cost Management with Quotas
Preventing Data Leakage

Departmental Access Control

Let's consider an organization with three departments:

Department	Access Level	Restrictions
IT	All models	Quotas & rate limits
Finance	Self-hosted only	No external models
HR	All models	Unlimited budget

overview

Alice works in the IT Department
Bob works in the Finance Department
Charlie works in the Human Resources Department

The organization's governance rules are configured as follows:

The IT Department has access to all models, with quotas and rate limits
The Finance Department does not have access to external models only self-hosted.
The Human Resources Department has access to all models with unlimited budget

Restrict keywords

⚠️ Implemented, to be documented

Prevent leakage of personal idenfier & information leak

Note

⚠️ Implemented, to be documented

curl http://127.0.0.1:8080/ollama/gemma2/2b/ -i  \
-H "Authorization: Bearer your-token-here" \
-d '{
  "model": "gemma2:2b-instruct-q6_K",                                                                                                                        "messages": [
    {
      "role": "user",
      "content": "Hi my name is Jean-Claude Dusse"
    }
  ] ,
  "stream": false
}'
HTTP/1.1 403 Forbidden
Server: openresty/1.21.4.1
Date: Thu, 16 Jan 2025 13:01:06 GMT
Content-Type: text/plain
Transfer-Encoding: chunked
Connection: keep-alive

Request contains sensitive personal information

Cost Management with Quotas

Administrators can effectively control costs by configuring quotas for model usage and user access.

Key Features:

Per-Model Quotas: Set usage limits for individual models to manage resource allocation.

This granular approach allows administrators to optimize costs while maintaining flexibility and control over system resources.

models:
  - location: "/echo"
    model_name: "echo"
    proxy_pass: "http://localhost:9999"
    api_key: "$DEEPSEEK_API_KEY"
    parser: "ollama"
    disabled_groups: "mammals, birds"
    blacklist_words: "confidential, mycorp"
    pii_protection_url: "http://127.0.0.1:8001/check-pii-base64"
    quotas:
      - max_tokens:
          minute: 500
          hour: 6000
          day: 10000
          week: 40000
      - max_requests:
          second: 1
          minute: 15