← Selected work
04

Model-agnostic AI gateway

An AI gateway on top of AWS Bedrock and Azure OpenAI, with direct Anthropic API access for newer Claude models. Around 8 product teams route through it for model selection, automatic failover, and centralized cost tracking across Claude, GPT-4, Llama, and Titan.

AWS BedrockAzure OpenAIAnthropic APICost trackingProvider failover
Product teams routing through
~8
Providers supported
3 primary, 4+ models

The problem

Each product team had picked a provider. Costs were everywhere and nowhere. When a provider degraded, the team that depended on it noticed in customer reports, not in dashboards.

The shape

One gateway with a unified request schema. Routes by use case, not by team preference: chat goes to Claude, structured extraction to whichever model is cheapest per token that week, multimodal to Vision-capable models. Automatic failover when a primary provider 5xx’s. Cost tracked per team, per use case, per request.

Key decisions

← All work Get in touch →