<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Synthorai Engineering Blog</title><description>LLM gateway 工程笔记 — BYOK / prompt cache / 计费 / 可观测</description><link>https://blog.synthorai.io/</link><item><title>What is an LLM API Gateway? A Practical Introduction</title><link>https://blog.synthorai.io/blog/what-is-an-llm-api-gateway/</link><guid isPermaLink="true">https://blog.synthorai.io/blog/what-is-an-llm-api-gateway/</guid><description>A technical introduction to LLM API gateways — what they do, when you need one, and how they differ from traditional API gateways. Covers architecture, BYOK, billing, and cross-protocol translation.</description><pubDate>Tue, 12 May 2026 00:00:00 GMT</pubDate></item><item><title>Choosing an LLM Gateway in 2026: SaaS, Library, or Self-Hosted</title><link>https://blog.synthorai.io/blog/choosing-an-llm-gateway/</link><guid isPermaLink="true">https://blog.synthorai.io/blog/choosing-an-llm-gateway/</guid><description>The LLM gateway category has settled into three architectural shapes — managed SaaS, in-app library, and self-hosted service. A practical comparison of trade-offs, plus a decision matrix and migration paths.</description><pubDate>Tue, 12 May 2026 00:00:00 GMT</pubDate></item><item><title>BYOK Pricing for LLM Gateways: The Billing Invariants That Matter</title><link>https://blog.synthorai.io/blog/byok-billing-invariants/</link><guid isPermaLink="true">https://blog.synthorai.io/blog/byok-billing-invariants/</guid><description>BYOK looks like a feature checkbox but is the largest class of correctness bugs in production LLM gateways. Five billing invariants we enforce by test, with concrete examples of what breaks when each is violated.</description><pubDate>Tue, 12 May 2026 00:00:00 GMT</pubDate></item><item><title>Prompt Caching Across LLM Providers: What Translates and What Doesn&apos;t</title><link>https://blog.synthorai.io/blog/prompt-caching-across-providers/</link><guid isPermaLink="true">https://blog.synthorai.io/blog/prompt-caching-across-providers/</guid><description>Anthropic, OpenAI, and Gemini all support prompt caching with different APIs, TTLs, and pricing. A gateway has to translate between them — and some of the translation is lossy. The rules, the edge cases, and the tests that catch the bugs.</description><pubDate>Tue, 12 May 2026 00:00:00 GMT</pubDate></item></channel></rss>