Awesome AI API Proxy

One key. Every model. Anywhere. A curated, actively maintained list of AI API relay / proxy stations ("中轉站"), global LLM gateways, and the tools to evaluate them — with an evidence-based look at why this market exists and where it's risky.

Languages: English · 繁體中文 · 简体中文

Last reviewed: 2026-05-26 · Maintained by @howardpen9 · Contributions welcome — see CONTRIBUTING.md

What is an AI API relay?

An AI API relay (中轉站) is a forwarding endpoint that sits between your code and official LLM APIs. You point base_url at the relay and use its key to call OpenAI / Claude / Gemini and dozens more models — one key, local payment (Alipay/WeChat), lower latency, typically 30–90% cheaper.

An AI API relay (Chinese: API 中轉站; also "AI API proxy" / "relay station") is a middleman endpoint. Instead of calling OpenAI, Anthropic, Google, etc. directly, you point your base_url at the relay and use its key. The relay forwards the request upstream and returns the result — usually with full OpenAI API compatibility, so existing code needs almost no changes.

One key → dozens to hundreds of models across vendors.

This differs from a mirror site (镜像站), which is a proxied chat web UI for end users. A relay is for developers and applications.

Why this exists (globally)

Relays are not a China-only phenomenon. They are the product of two constraints — access blocking and payment friction — wherever those appear:

Region	Access barrier	Payment barrier
China	Cross-border latency; some APIs blocked	Needs foreign card; relays take Alipay/WeChat
Russia / Belarus / Iran	OpenAI hard-blocks (even VPN detected)	Sanctioned cards (e.g. Iran Shetab) rejected
APAC incl. Taiwan	Generally reachable	Card rejection, quota limits, billing friction
Sanctioned / restricted markets	Account denial	No accepted payment rail

The same need produces different ecosystems: Russia leans on IP proxies and state alternatives (e.g. GigaChat); China evolved a full relay retail industry. The global, "clean" version of this is the authorized aggregator (see OpenRouter below) — proof that unified LLM access is a real, growing market, not a workaround niche.

China / Asia relay stations

Status: active = independently reachable · unverified = community-sourced, not independently confirmed. Last verified = date the maintainer last visited the site; absent on unverified entries. Prices change constantly — verify on-site. Generated from data/providers.yaml.

Station	Type	Payment	Status	Last verified	Entity	Notes
云雾 API (YUNWU)	mixed	Alipay/WeChat	active	2026-05-26	unknown	Marketed on speed/stability; widely cited top-tier station.
柏拉图 AI (bltcy)	mixed	Alipay/WeChat	active	2026-05-26	unknown	Azure-backed channel; positions as lowest-price.
No.1-API	aggregator	Alipay/WeChat	active	2026-05-26	unknown	One-stop aggregation + relay platform.
UiUiAPI	official-relay	Alipay/WeChat	active	2026-05-26	unknown	Claims official channels + official multipliers; ~49% cheaper (claimed), 300+ models.
DMXAPI	mixed	Alipay/WeChat	unverified	—	unknown	Community-listed; homepage not independently verified.
MKEAI	mixed	Alipay/WeChat	unverified	—	unknown	Community-forum + relay hybrid; pushes DeepSeek.
GPTGOD	reverse	Alipay	unverified	—	unknown	Reverse-engineered; cheap, stability not guaranteed.
CloseAI	official-relay	Alipay/WeChat/Invoice	active	2026-05-26	registered	Issues enterprise invoices; self-describes as largest enterprise-grade relay in Asia.

Inclusion ≠ endorsement. Listing documents the market. Always run the evaluation checklist before sending money or data.

Global gateways & aggregators

Service	Type	Payment	Notes
OpenRouter	aggregator	Card/Crypto	Official-authorized routing, ~5% markup; 400+ models, 60+ providers. ARR reportedly ~$5M (2025-05) → ~$50M (early 2026).
LiteLLM	gateway-oss	—	Open-source gateway (100+ providers) + enterprise tier. Self-hosted, bring your own keys.
Helicone	observability	—	LLM observability gateway; logging/cost analytics.
AIMLAPI	aggregator	Card/Crypto	400+ models, prepaid from $20; crypto support implies payment-friction workaround.

Self-hosted alternatives

Not relays — self-hosted gateways are the credible alternative when you don't want a third party in the request path. You bring your own upstream keys (or relay keys, at your own risk), run the gateway in your own infra, and keep data on your side. Most Chinese relay stations are themselves built on these templates.

Project	Type	Notes
One-API	gateway-oss	Popular Go-based multi-vendor gateway; the de-facto OSS template behind many relay stations.
new-api	gateway-oss	Fork of One-API with extra channel types; same self-hosted model — you supply keys.
LiteLLM	gateway-oss	Listed above. Python-first, 100+ providers, used by enterprises.

When to choose self-hosted over a relay: regulated data, production workloads, you already have foreign-card billing, or you want auditable logs. You give up the relay's Alipay/WeChat convenience and accept ops overhead.

Comparison & monitoring tools

Tool	Notes
中轉站競技場 (AI API PK)	Price wall across OpenAI / Reverse / Claude / DeepSeek for ~40 stations.
awesome-ai-proxy (mn-api)	The original list (~31 stations). Unmaintained as of 2026 — this repo aims to continue the effort.

How to choose one safely

See docs/evaluation.md for the full framework. Short version:

Identify the channel type — official-relay > mixed > aggregator > reverse.
Run a canary test — known-hard prompts, official vs relay, weekly. Detects silent model downgrade ("降智"). Reproducible prompt set: docs/canary-prompts.md.
Top up small — relays are prepaid; never prepay large balances.
Match workload — sensitive/regulated data → official API or self-hosted gateway-oss, never reverse.

Canary prompts (detect silent downgrade)

The single most effective defense against a relay quietly substituting a cheaper model is a small set of reproducible canary prompts you re-run against both the official API and the relay on a weekly cadence.

The full prompt set, expected baseline behavior per family (GPT-4 class, Claude Opus class, Gemini 2.5 Pro class), and a pass/fail rubric live in docs/canary-prompts.md. Use it as-is or fork it.

Risks (read this)

The market is largely unregulated. From public research:

A study of 28 relays found 45.83% of endpoints served a model that didn't match the one requested (up to 47% performance gap on medical tasks).
An audit of 428 stations found 9 injecting malicious code and 1 stealing funds outright.
Of 17 top stations surveyed, 15 had no registered company.

Full breakdown with sources and mitigations: docs/risks.md.

Market context

The original community list (mn-api/awesome-ai-proxy) catalogued ~31 stations before going stale; aggregators like aiapipk.com track ~40 — and these are the visible tip only.
The "clean" global analogue, OpenRouter, reportedly grew ARR ~10x in under a year (≈$5M → ≈$50M), routing trillions of tokens monthly — strong evidence that unified, multi-vendor LLM access is a durable market, not a hack.
Conclusion: relay stations are a global response to blocking + payment friction. China's ecosystem is the deepest because both pains peak there.

FAQ

What is an AI API relay (中轉站)?

An AI API relay is a forwarding endpoint between your code and official LLM APIs (OpenAI, Anthropic, Google). You change base_url to the relay and use its key; it forwards upstream and returns the result, usually with full OpenAI-compatible formatting. One key gives access to dozens–hundreds of models across vendors.

Is an API relay safe? Can they steal my key or read my prompts?

Treat every relay as an untrusted intermediary. It can see every prompt, response, and any data you send — a hostile operator can log, modify, or exfiltrate it. An audit of 428 stations found 9 injecting malicious code and 1 stealing funds. Never send secrets or regulated data through a relay; for sensitive workloads use the official API or a self-hosted gateway.

Why are relays so much cheaper than official APIs?

Three reasons: wholesale upstream pricing, mixed channels (e.g. Azure quotas), and — for the cheapest tier — reverse-engineered access to vendors' web clients. Prices 1/10 to 1/50 of official usually mean a reverse channel or silent model downgrade, not a genuine bargain.

What's the difference between 官轉 (official-relay), 混合 (mixed) and 逆向 (reverse)?

official-relay forwards real official keys (purest, most stable, safest for production). mixed blends official with other channels. reverse is reverse-engineered from web clients (cheapest, least stable, highest ToS risk). Trust order: official-relay > mixed > aggregator > reverse.

How do I detect a relay silently swapping the model (降智)?

Run a canary test: keep 3–5 known-hard prompts, record the official API's baseline, then run them against the relay weekly and compare reasoning depth and format adherence. A study of 28 relays found 45.83% of endpoints served a model that didn't match the request — degradation often appears after you top up.

Relay vs OpenRouter vs official API — which should I use?

Official API: most reliable, needs a foreign card. OpenRouter and similar global aggregators: officially authorized, ~5% markup, card/crypto — the "clean" choice if you can pay. China/Asia relays: cheapest and accept Alipay/WeChat, but variable quality and operator risk. Use official or a self-hosted gateway for production or sensitive data.

What's the difference between a relay (中轉站) and a mirror site (镜像站)?

A mirror site is a proxied chat web UI for end users (a re-hosted ChatGPT-like page). A relay is a programmatic API endpoint for developers and applications. This list only covers relays and gateways.

Which countries need API relays? Is this only a China thing?

No. Relays appear wherever there is access blocking + payment friction. China has the deepest ecosystem; Russia/Belarus/Iran face hard blocks (even VPNs are detected) and sanctioned-card rejection; APAC including Taiwan mostly has access but hits card rejection and billing friction. It is a global response, not a China-only phenomenon.

Which relay is the best for Claude API (Sonnet / Opus)?

There is no single "best" relay that is safe to recommend by name — Anthropic's terms are stricter than OpenAI's, Claude keys get rotated and banned more aggressively, and relays that look stable today can degrade overnight. Pragmatic guidance:

For production Claude usage: use the official Anthropic API or a registered global aggregator like OpenRouter (official-authorized, ~5% markup, supports Claude).
For experimentation: prefer official-relay type entries above; verify the station actually returns the real model via the canary prompts in docs/canary-prompts.md before topping up.
Treat any relay claiming "Claude Opus at 1/10 price" as a reverse channel by default — that price implies web-client reverse-engineering or silent downgrade.

How do I tell if a relay station is about to run away (跑路)?

Empirically, exit scams share a pattern. Red flags, roughly in order of severity:

No registered company / ICP filing — 15 of 17 surveyed top stations had none.
Aggressive prepaid bonuses ("recharge ¥100 get ¥150") to attract balances before disappearing.
Prices below physical cost — 1/10 to 1/50 of official; the math has to come from somewhere.
Canary regressions — quality drops on prompts that previously passed (preceded ~70% of documented exits).
Slow / vanishing support, broken billing pages, expired ICP records, social channels going quiet.

Mitigation: top up small amounts only; never prepay large balances; subscribe to relay-tracking channels (V2EX, the risks doc).

Should I use a relay or self-host One-API / LiteLLM?

Use a self-hosted gateway when any of these apply: regulated or customer data, production workloads, you already have foreign-card billing, or you need auditable logs. You keep keys in your own infra; nothing transits an unaccountable third party.

Use a relay only for: low-stakes experimentation, learning, hobby projects, or when you genuinely cannot get a foreign card. Treat anything you send through it as public. See Self-hosted alternatives.

Are relays worth it for users in Taiwan / Hong Kong / Southeast Asia?

Marginal. Network access is generally fine, so the only real wins are (a) Alipay/WeChat payment vs foreign credit card, and (b) one key across multiple vendors. The trade-off — your prompts transit a Chinese-jurisdiction third party with no contract — is steep.

For most TW/HK/SEA developers, the better stack is:

Official APIs where you have card billing,
OpenRouter for the multi-vendor convenience, and
A self-hosted One-API or LiteLLM instance if you really need a unified key.

Relays make sense mainly when you have no card option at all.

Contributing

This list is actively maintained and community-driven. Add or correct a station, flag one that ran away, or contribute a documented risk case:

Read CONTRIBUTING.md and the field schema.
Edit data/providers.yaml — not the README tables directly.
Open an issue with the provider template for additions.

We do not accept referral links or marketing copy. Factual, dated, source-backed entries only.

License

— To the extent possible under law, contributors have waived all copyright and related rights to this work (CC0 1.0).

Disclaimer

This repository is an informational catalogue. It is not an endorsement of any service. Many relays operate in legal grey areas and may violate upstream providers' Terms of Service. Using a relay can expose your prompts, code, and data to an unaccountable third party. Evaluate risk yourself and comply with all applicable laws and contracts.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github		.github
assets		assets
data		data
docs		docs
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
README.zh-TW.md		README.zh-TW.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome AI API Proxy

Contents

What is an AI API relay?

Why this exists (globally)

China / Asia relay stations

Global gateways & aggregators

Self-hosted alternatives

Comparison & monitoring tools

How to choose one safely

Canary prompts (detect silent downgrade)

Risks (read this)

Market context

FAQ

What is an AI API relay (中轉站)?

Is an API relay safe? Can they steal my key or read my prompts?

Why are relays so much cheaper than official APIs?

What's the difference between 官轉 (official-relay), 混合 (mixed) and 逆向 (reverse)?

How do I detect a relay silently swapping the model (降智)?

Relay vs OpenRouter vs official API — which should I use?

What's the difference between a relay (中轉站) and a mirror site (镜像站)?

Which countries need API relays? Is this only a China thing?

Which relay is the best for Claude API (Sonnet / Opus)?

How do I tell if a relay station is about to run away (跑路)?

Should I use a relay or self-host One-API / LiteLLM?

Are relays worth it for users in Taiwan / Hong Kong / Southeast Asia?

Contributing

License

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Awesome AI API Proxy

Contents

What is an AI API relay?

Why this exists (globally)

China / Asia relay stations

Global gateways & aggregators

Self-hosted alternatives

Comparison & monitoring tools

How to choose one safely

Canary prompts (detect silent downgrade)

Risks (read this)

Market context

FAQ

What is an AI API relay (中轉站)?

Is an API relay safe? Can they steal my key or read my prompts?

Why are relays so much cheaper than official APIs?

What's the difference between 官轉 (official-relay), 混合 (mixed) and 逆向 (reverse)?

How do I detect a relay silently swapping the model (降智)?

Relay vs OpenRouter vs official API — which should I use?

What's the difference between a relay (中轉站) and a mirror site (镜像站)?

Which countries need API relays? Is this only a China thing?

Which relay is the best for Claude API (Sonnet / Opus)?

How do I tell if a relay station is about to run away (跑路)?

Should I use a relay or self-host One-API / LiteLLM?

Are relays worth it for users in Taiwan / Hong Kong / Southeast Asia?

Contributing

License

Disclaimer

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages