One key. Every model. Anywhere. A curated, actively maintained list of AI API relay / proxy stations ("中轉站"), global LLM gateways, and the tools to evaluate them — with an evidence-based look at why this market exists and where it's risky.
Languages: English · 繁體中文 · 简体中文
Last reviewed: 2026-05-26 · Maintained by @howardpen9 · Contributions welcome — see CONTRIBUTING.md
- What is an AI API relay?
- Why this exists (globally)
- China / Asia relay stations
- Global gateways & aggregators
- Self-hosted alternatives
- Comparison & monitoring tools
- How to choose one safely
- Canary prompts (detect silent downgrade)
- Risks (read this)
- Market context
- FAQ
- Contributing
An AI API relay (中轉站) is a forwarding endpoint that sits between your code and official LLM APIs. You point
base_urlat the relay and use its key to call OpenAI / Claude / Gemini and dozens more models — one key, local payment (Alipay/WeChat), lower latency, typically 30–90% cheaper.
An AI API relay (Chinese: API 中轉站; also "AI API proxy" / "relay
station") is a middleman endpoint. Instead of calling OpenAI, Anthropic,
Google, etc. directly, you point your base_url at the relay and use its key.
The relay forwards the request upstream and returns the result — usually with
full OpenAI API compatibility, so existing code needs almost no changes.
One key → dozens to hundreds of models across vendors.
This differs from a mirror site (镜像站), which is a proxied chat web UI for end users. A relay is for developers and applications.
Relays are not a China-only phenomenon. They are the product of two constraints — access blocking and payment friction — wherever those appear:
| Region | Access barrier | Payment barrier |
|---|---|---|
| China | Cross-border latency; some APIs blocked | Needs foreign card; relays take Alipay/WeChat |
| Russia / Belarus / Iran | OpenAI hard-blocks (even VPN detected) | Sanctioned cards (e.g. Iran Shetab) rejected |
| APAC incl. Taiwan | Generally reachable | Card rejection, quota limits, billing friction |
| Sanctioned / restricted markets | Account denial | No accepted payment rail |
The same need produces different ecosystems: Russia leans on IP proxies and state alternatives (e.g. GigaChat); China evolved a full relay retail industry. The global, "clean" version of this is the authorized aggregator (see OpenRouter below) — proof that unified LLM access is a real, growing market, not a workaround niche.
Status:
active= independently reachable ·unverified= community-sourced, not independently confirmed. Last verified = date the maintainer last visited the site; absent on unverified entries. Prices change constantly — verify on-site. Generated fromdata/providers.yaml.
| Station | Type | Payment | Status | Last verified | Entity | Notes |
|---|---|---|---|---|---|---|
| 云雾 API (YUNWU) | mixed | Alipay/WeChat | active | 2026-05-26 | unknown | Marketed on speed/stability; widely cited top-tier station. |
| 柏拉图 AI (bltcy) | mixed | Alipay/WeChat | active | 2026-05-26 | unknown | Azure-backed channel; positions as lowest-price. |
| No.1-API | aggregator | Alipay/WeChat | active | 2026-05-26 | unknown | One-stop aggregation + relay platform. |
| UiUiAPI | official-relay | Alipay/WeChat | active | 2026-05-26 | unknown | Claims official channels + official multipliers; ~49% cheaper (claimed), 300+ models. |
| DMXAPI | mixed | Alipay/WeChat | unverified | — | unknown | Community-listed; homepage not independently verified. |
| MKEAI | mixed | Alipay/WeChat | unverified | — | unknown | Community-forum + relay hybrid; pushes DeepSeek. |
| GPTGOD | reverse | Alipay | unverified | — | unknown | Reverse-engineered; cheap, stability not guaranteed. |
| CloseAI | official-relay | Alipay/WeChat/Invoice | active | 2026-05-26 | registered | Issues enterprise invoices; self-describes as largest enterprise-grade relay in Asia. |
Inclusion ≠ endorsement. Listing documents the market. Always run the evaluation checklist before sending money or data.
| Service | Type | Payment | Notes |
|---|---|---|---|
| OpenRouter | aggregator | Card/Crypto | Official-authorized routing, ~5% markup; 400+ models, 60+ providers. ARR reportedly ~$5M (2025-05) → ~$50M (early 2026). |
| LiteLLM | gateway-oss | — | Open-source gateway (100+ providers) + enterprise tier. Self-hosted, bring your own keys. |
| Helicone | observability | — | LLM observability gateway; logging/cost analytics. |
| AIMLAPI | aggregator | Card/Crypto | 400+ models, prepaid from $20; crypto support implies payment-friction workaround. |
Not relays — self-hosted gateways are the credible alternative when you don't want a third party in the request path. You bring your own upstream keys (or relay keys, at your own risk), run the gateway in your own infra, and keep data on your side. Most Chinese relay stations are themselves built on these templates.
| Project | Type | Notes |
|---|---|---|
| One-API | gateway-oss | Popular Go-based multi-vendor gateway; the de-facto OSS template behind many relay stations. |
| new-api | gateway-oss | Fork of One-API with extra channel types; same self-hosted model — you supply keys. |
| LiteLLM | gateway-oss | Listed above. Python-first, 100+ providers, used by enterprises. |
When to choose self-hosted over a relay: regulated data, production workloads, you already have foreign-card billing, or you want auditable logs. You give up the relay's Alipay/WeChat convenience and accept ops overhead.
| Tool | Notes |
|---|---|
| 中轉站競技場 (AI API PK) | Price wall across OpenAI / Reverse / Claude / DeepSeek for ~40 stations. |
| awesome-ai-proxy (mn-api) | The original list (~31 stations). Unmaintained as of 2026 — this repo aims to continue the effort. |
See docs/evaluation.md for the full framework. Short version:
- Identify the channel type —
official-relay>mixed>aggregator>reverse. - Run a canary test — known-hard prompts, official vs relay, weekly. Detects silent model downgrade ("降智"). Reproducible prompt set: docs/canary-prompts.md.
- Top up small — relays are prepaid; never prepay large balances.
- Match workload — sensitive/regulated data → official API or self-hosted
gateway-oss, neverreverse.
The single most effective defense against a relay quietly substituting a cheaper model is a small set of reproducible canary prompts you re-run against both the official API and the relay on a weekly cadence.
The full prompt set, expected baseline behavior per family (GPT-4 class, Claude Opus class, Gemini 2.5 Pro class), and a pass/fail rubric live in docs/canary-prompts.md. Use it as-is or fork it.
The market is largely unregulated. From public research:
- A study of 28 relays found 45.83% of endpoints served a model that didn't match the one requested (up to 47% performance gap on medical tasks).
- An audit of 428 stations found 9 injecting malicious code and 1 stealing funds outright.
- Of 17 top stations surveyed, 15 had no registered company.
Full breakdown with sources and mitigations: docs/risks.md.
- The original community list (mn-api/awesome-ai-proxy) catalogued ~31 stations before going stale; aggregators like aiapipk.com track ~40 — and these are the visible tip only.
- The "clean" global analogue, OpenRouter, reportedly grew ARR ~10x in under a year (≈$5M → ≈$50M), routing trillions of tokens monthly — strong evidence that unified, multi-vendor LLM access is a durable market, not a hack.
- Conclusion: relay stations are a global response to blocking + payment friction. China's ecosystem is the deepest because both pains peak there.
An AI API relay is a forwarding endpoint between your code and official LLM
APIs (OpenAI, Anthropic, Google). You change base_url to the relay and use
its key; it forwards upstream and returns the result, usually with full
OpenAI-compatible formatting. One key gives access to dozens–hundreds of
models across vendors.
Treat every relay as an untrusted intermediary. It can see every prompt, response, and any data you send — a hostile operator can log, modify, or exfiltrate it. An audit of 428 stations found 9 injecting malicious code and 1 stealing funds. Never send secrets or regulated data through a relay; for sensitive workloads use the official API or a self-hosted gateway.
Three reasons: wholesale upstream pricing, mixed channels (e.g. Azure quotas),
and — for the cheapest tier — reverse-engineered access to vendors' web
clients. Prices 1/10 to 1/50 of official usually mean a reverse channel or
silent model downgrade, not a genuine bargain.
official-relay forwards real official keys (purest, most stable, safest for
production). mixed blends official with other channels. reverse is
reverse-engineered from web clients (cheapest, least stable, highest ToS
risk). Trust order: official-relay > mixed > aggregator > reverse.
Run a canary test: keep 3–5 known-hard prompts, record the official API's baseline, then run them against the relay weekly and compare reasoning depth and format adherence. A study of 28 relays found 45.83% of endpoints served a model that didn't match the request — degradation often appears after you top up.
Official API: most reliable, needs a foreign card. OpenRouter and similar global aggregators: officially authorized, ~5% markup, card/crypto — the "clean" choice if you can pay. China/Asia relays: cheapest and accept Alipay/WeChat, but variable quality and operator risk. Use official or a self-hosted gateway for production or sensitive data.
A mirror site is a proxied chat web UI for end users (a re-hosted ChatGPT-like page). A relay is a programmatic API endpoint for developers and applications. This list only covers relays and gateways.
No. Relays appear wherever there is access blocking + payment friction. China has the deepest ecosystem; Russia/Belarus/Iran face hard blocks (even VPNs are detected) and sanctioned-card rejection; APAC including Taiwan mostly has access but hits card rejection and billing friction. It is a global response, not a China-only phenomenon.
There is no single "best" relay that is safe to recommend by name — Anthropic's terms are stricter than OpenAI's, Claude keys get rotated and banned more aggressively, and relays that look stable today can degrade overnight. Pragmatic guidance:
- For production Claude usage: use the official Anthropic API or a registered global aggregator like OpenRouter (official-authorized, ~5% markup, supports Claude).
- For experimentation: prefer
official-relaytype entries above; verify the station actually returns the real model via the canary prompts in docs/canary-prompts.md before topping up. - Treat any relay claiming "Claude Opus at 1/10 price" as a
reversechannel by default — that price implies web-client reverse-engineering or silent downgrade.
Empirically, exit scams share a pattern. Red flags, roughly in order of severity:
- No registered company / ICP filing — 15 of 17 surveyed top stations had none.
- Aggressive prepaid bonuses ("recharge ¥100 get ¥150") to attract balances before disappearing.
- Prices below physical cost — 1/10 to 1/50 of official; the math has to come from somewhere.
- Canary regressions — quality drops on prompts that previously passed (preceded ~70% of documented exits).
- Slow / vanishing support, broken billing pages, expired ICP records, social channels going quiet.
Mitigation: top up small amounts only; never prepay large balances; subscribe to relay-tracking channels (V2EX, the risks doc).
Use a self-hosted gateway when any of these apply: regulated or customer data, production workloads, you already have foreign-card billing, or you need auditable logs. You keep keys in your own infra; nothing transits an unaccountable third party.
Use a relay only for: low-stakes experimentation, learning, hobby projects, or when you genuinely cannot get a foreign card. Treat anything you send through it as public. See Self-hosted alternatives.
Marginal. Network access is generally fine, so the only real wins are (a) Alipay/WeChat payment vs foreign credit card, and (b) one key across multiple vendors. The trade-off — your prompts transit a Chinese-jurisdiction third party with no contract — is steep.
For most TW/HK/SEA developers, the better stack is:
- Official APIs where you have card billing,
- OpenRouter for the multi-vendor convenience, and
- A self-hosted One-API or LiteLLM instance if you really need a unified key.
Relays make sense mainly when you have no card option at all.
This list is actively maintained and community-driven. Add or correct a station, flag one that ran away, or contribute a documented risk case:
- Read CONTRIBUTING.md and the field schema.
- Edit
data/providers.yaml— not the README tables directly. - Open an issue with the provider template for additions.
We do not accept referral links or marketing copy. Factual, dated, source-backed entries only.

This repository is an informational catalogue. It is not an endorsement of any service. Many relays operate in legal grey areas and may violate upstream providers' Terms of Service. Using a relay can expose your prompts, code, and data to an unaccountable third party. Evaluate risk yourself and comply with all applicable laws and contracts.
