
doteb Is Now SOC 2 Type II Compliant
doteb has successfully completed its SOC 2 Type II audit. Here's what that means, why it matters for teams routing LLM traffic through us, and how to request our report.
Product updates, tutorials, benchmarks, and announcements.

doteb has successfully completed its SOC 2 Type II audit. Here's what that means, why it matters for teams routing LLM traffic through us, and how to request our report.

Nine text-to-speech models from ElevenLabs, OpenAI, and Google are now one OpenAI-compatible API call away — plus a new Audio Studio in the Playground to compare voices and models side by side.

Our new LLM SDK lets your end-users buy credits inside your app and chat with any model — billed through doteb, with your markup as margin. Here's how it works and how to ship it in ~40 lines.

Looking for a Portkey alternative? A straightforward comparison of doteb and Portkey — features, pricing, deployment, and trade-offs — so you can pick the right AI gateway for your stack.

LLM orchestration is the layer that coordinates models, providers, and steps into one reliable workflow. A practical guide to the patterns, the tools, and when you need an LLM orchestrator.

All three ByteDance Seedance video generation models — 2.0, 2.0 Fast, and 1.5 Pro with native audio — are now live on doteb. Same API key. Same billing. Same dashboard.

Generate vectors for semantic search, clustering, and RAG through the same gateway you already use for chat. OpenAI-compatible, drop-in, and tracked alongside your model spend.

Calling OpenAI or Anthropic directly is the right first call. Here's the honest case for when a gateway starts paying for itself — and when you don't need one yet.

How LLM response caching actually works, where it helps, where it doesn't, and how to turn it on without rewriting your app.

A practical guide to forecasting LLM costs: the token formula, real-world examples across GPT-5.4, Claude, and Gemini, and a free calculator to run the numbers.

A straightforward comparison of doteb and LiteLLM — features, operational cost, and trade-offs — so you can pick the right one for your stack.

No config files, no env vars. OpenCode ships doteb as a first-class provider — select it, paste your key, and start coding with 210+ models.

A practical framework for picking the right model — based on task type, budget, latency requirements, and context window — instead of chasing benchmarks.

A deep dive into the routing, retry, and failover systems that keep doteb reliable when upstream providers go down.

A straightforward comparison of doteb and OpenRouter — features, pricing, and trade-offs — so you can pick the right one for your stack.

What LLM guardrails are, why they matter in production, and how to implement content safety without building it yourself.

An honest comparison of the top AI gateways — features, pricing, and trade-offs — so you can pick the right one for your stack.

doteb offers exclusive discounts on DeepSeek V3.2, V3.1, and R1 through partner providers — up to 30% off base pricing, applied automatically to every request.

We compared DeepSeek V3.2 pricing across every major API provider. Here's the definitive ranking — and how our Token Cost Calculator can help you estimate exact savings.

Three months of updates: video generation, Image Studio, sessions, GPT-5.4 family, enterprise guardrails, 5+ new providers, and much more.

A practical breakdown of how intelligent routing, caching, and model selection through an LLM gateway can dramatically reduce your AI infrastructure costs.

Side-by-side pricing comparison of GPT-5, Claude Opus 4.6, and Gemini 2.5 Pro with real cost calculations for production workloads.

doteb partners with Alibaba Cloud to bring you 20% off 26 Qwen AI models — including Qwen3 Max, Qwen3 Coder, QwQ reasoning, vision-language, and image generation models.

A step-by-step guide to making your first LLM API request through doteb — from signup to seeing results in your dashboard.

Why building directly against a single LLM provider's API is riskier than you think, and how a gateway layer protects your AI investment.

What an LLM gateway does, why it matters, and how it lets you ship AI features faster by abstracting away provider complexity.

Learn why simple LLM proxies aren't enough and how a unified AI gateway delivers centralized access control, cost visibility, compliance, and security.

Learn what a doteb is, why you need one, and how it simplifies integrating, managing, and deploying large language models in production.

How we deploy our Next.js apps on Google Cloud Platform without relying on Vercel.

Three months of updates: 15+ new models, team management, referral program, tiered pricing, data retention, new providers, and much more.

Use the CYBERMONDAY promo code to get 50% off credits for a limited time.

Use the BLACKFRIDAY promo code to get 50% off credits for a limited time.

Compare GPT-5, Claude, Gemini, and 180+ other models side by side. Generate images, test prompts, and find the best model for your use case.

Use GPT-5, Gemini, or any model with Claude Code. Three environment variables, zero code changes.

Connect your internal LLM deployments or any OpenAI-compatible API to doteb—and get the same analytics, caching, and routing.

Run doteb on your own infrastructure in under 5 minutes. Full control, zero platform fees.

One API for 180+ models across 60+ providers. Route requests, track costs, and switch models without changing your code.