LLM Gateway

Overview

The LLM Gateway works as a standalone product or alongside your existing Aptible infrastructure. No apps or databases required. Sign up and start sending requests right away. See the pricing page for plan and usage details.

Aptible’s LLM Gateway gives you a single, compliant API for leading model providers, so your team can build securely with AI without having to configure directly with LLM providers. Through one endpoint and one key, you can:

Access 400+ models across Claude, GPT, Nova, Llama, Qwen, and more through a single API
Govern which models your team can use with environment-level model access policies
Control and monitor spend with organization-wide limits, plus per-environment and per-key cost visibility
Review request history and inspect full request and response logs to troubleshoot and refine prompts
Drain traces to Langfuse for deeper observability
Send PHI safely under Aptible’s HIPAA BAA, with automatic audit logging and encryption

Get Started

Create your first LLM key

Step-by-step guide covering key creation, initial setup, and the LLM Gateway’s core features.

Supported Models

The LLM Gateway supports 400+ models across Claude, GPT, Nova, Llama, Qwen, and more. See Supported Models for full details.

Claude

bedrock/anthropic Haiku, Sonnet, and Opus

GPT & o-series

openai GPT-4o, GPT-4.1, GPT-5, o3, o4

Nova, Llama, Qwen & more

bedrock Amazon Nova, Meta Llama, Qwen & more

Features

Security & Compliance

The LLM Gateway is designed so compliance is built in at the infrastructure layer: not something your team has to configure or maintain. You route requests through the gateway; the controls apply automatically.

Principle	How it works
Centrally managed credentials	Your raw provider API keys never leave Aptible. Developers and applications authenticate with scoped LLM keys. Rotate or revoke a key without touching any application code or provider settings.
Model access controls	Each environment has an allowlist of approved models. Any model not on the list is blocked for every key in that environment. Unapproved models are structurally unavailable, not just discouraged.
Full audit trail	Every request and response is logged automatically — prompt content, response, model, key, token counts, cost, and timestamp. No configuration required.
No PHI model training	Providers accessed through the gateway are contractually prohibited from using your data for model training. This is enforced at the infrastructure layer, not by relying on provider-side policy.
HIPAA BAA coverage	Aptible’s BAA covers all models and providers accessed through the gateway. You don’t need separate BAAs per provider, and you don’t need a dedicated stack to send PHI.
Spend controls	A hard monthly spending cap prevents runaway costs from a misconfigured agent or a compromised key.

Learn more about Security & Compliance →

Model Access Policies

Control which models your team can use by configuring model access policies at the environment level. Policies apply to all LLM keys within an environment, giving you centralized control over model usage across your applications and developers. Learn more about Model Access Policies →

Cost Visibility & Control

Set an organization-wide monthly spending limit and track LLM usage at the organization, environment, and per-key level so you always know what’s being spent and where. Learn more about Cost Visibility & Control →

Audit Logging

Every LLM request and response is automatically logged. View request history and full request and response logs in the Aptible dashboard for 7 days, and drain traces to Langfuse for long-term retention and deeper observability. Learn more about Audit Logging →

Support and Feedback

We’d love to hear from you as you use the LLM Gateway. If you have questions, run into issues, or have feature requests, contact us.

Getting Started

Platform

Managed AI