Open Source Private AI MIT Licence

Supercharge Your Native Ollama.

Ollama Gateway: Supercharge your native Ollama with enterprise-grade API authentication, request auditing, and virtual model management—your secure, private AI gateway.

OllamaGateway Main screenshot
Enterprise Ready

Security and Auditing by Design.

Bearer Token Auth

Create multiple API keys for different users and applications, each with its own fine-grained permissions.

Clickhouse Auditing

Detailed auditing for every request and response, stored in high-performance Clickhouse storage for compliance.

Team Management

A built-in Role-Based Access Control (RBAC) system for managing your team's access to AI models.

Model Keep-alive

Periodically pings underlying models to ensure they stay loaded in memory for instant response times.

Default Model Support

Automatically redirect requests to a default virtual model if no model is specified in the API call.

Native Passthrough

Seamless support for native Ollama features including MCP tools, embedding, images, and stream mode.

OllamaGateway in Docker
Smart Proxy

Intelligent
Virtual Models

Create multiple aliases for your models with custom system prompts and parameter overrides.

Chat Models

Create virtual chat models with persistent system prompts.

Embedding Models

Dedicated management for embedding models for RAG applications.

Parameter Overrides

Override temperature, top_k, and other Ollama options per virtual model.

Smart Mapping
Alias -> Real
Validation
Auto Sync
Agent & Ecosystem Ready

Perfect for Agent Deployment

OllamaGateway has been officially tested and is fully compatible with popular ecosystem tools including Open-WebUI, Opencode, and Roocode. Its robust API translation makes it the ideal choice for deploying autonomous AI agents.

Recommended Models
  • qwen3.5:27b-q8_0
  • qwen3.5:35b-a3b-q4_K_M
Native vs. Gateway

Why choose OllamaGateway?

Native Ollama is great for personal use, but it lacks the enterprise features required for team collaboration and production deployment. OllamaGateway fills those gaps without changing your workflow.

Feature Comparison Native Ollama OllamaGateway
Model Hosting & Inference
Multimodal
MCP
Function call
Streaming
OpenAI API Translation
API Authentication (Bearer)
Multiple API Keys Management
Request & Response Auditing
Virtual Model Overrides
Default Model Support
Model Keep-alive (Ping)
Admin Management GUI
Chat/Embedding Segregation

Ready to secure your AI gateway?

Go to Dashboard View on GitHub