Skip to content

howardpen9/awesome-ai-api-proxy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Awesome AI API Proxy Awesome

Awesome AI API Proxy — a curated map of AI API relay stations (中轉站), LLM gateways, and OpenAI / Anthropic Claude / Google Gemini / Meta Llama proxy services for developers in China, Taiwan, Southeast Asia, Russia, and the Middle East

One key. Every model. Anywhere. A curated, actively maintained list of AI API relay / proxy stations ("中轉站"), global LLM gateways, and the tools to evaluate them — with an evidence-based look at why this market exists and where it's risky.

Languages: English · 繁體中文 · 简体中文

Last reviewed: 2026-05-26 · Maintained by @howardpen9 · Contributions welcome — see CONTRIBUTING.md


Contents


What is an AI API relay?

An AI API relay (中轉站) is a forwarding endpoint that sits between your code and official LLM APIs. You point base_url at the relay and use its key to call OpenAI / Claude / Gemini and dozens more models — one key, local payment (Alipay/WeChat), lower latency, typically 30–90% cheaper.

An AI API relay (Chinese: API 中轉站; also "AI API proxy" / "relay station") is a middleman endpoint. Instead of calling OpenAI, Anthropic, Google, etc. directly, you point your base_url at the relay and use its key. The relay forwards the request upstream and returns the result — usually with full OpenAI API compatibility, so existing code needs almost no changes.

One key → dozens to hundreds of models across vendors.

This differs from a mirror site (镜像站), which is a proxied chat web UI for end users. A relay is for developers and applications.

Why this exists (globally)

Relays are not a China-only phenomenon. They are the product of two constraints — access blocking and payment friction — wherever those appear:

Region Access barrier Payment barrier
China Cross-border latency; some APIs blocked Needs foreign card; relays take Alipay/WeChat
Russia / Belarus / Iran OpenAI hard-blocks (even VPN detected) Sanctioned cards (e.g. Iran Shetab) rejected
APAC incl. Taiwan Generally reachable Card rejection, quota limits, billing friction
Sanctioned / restricted markets Account denial No accepted payment rail

The same need produces different ecosystems: Russia leans on IP proxies and state alternatives (e.g. GigaChat); China evolved a full relay retail industry. The global, "clean" version of this is the authorized aggregator (see OpenRouter below) — proof that unified LLM access is a real, growing market, not a workaround niche.

China / Asia relay stations

Status: active = independently reachable · unverified = community-sourced, not independently confirmed. Last verified = date the maintainer last visited the site; absent on unverified entries. Prices change constantly — verify on-site. Generated from data/providers.yaml.

Station Type Payment Status Last verified Entity Notes
云雾 API (YUNWU) mixed Alipay/WeChat active 2026-05-26 unknown Marketed on speed/stability; widely cited top-tier station.
柏拉图 AI (bltcy) mixed Alipay/WeChat active 2026-05-26 unknown Azure-backed channel; positions as lowest-price.
No.1-API aggregator Alipay/WeChat active 2026-05-26 unknown One-stop aggregation + relay platform.
UiUiAPI official-relay Alipay/WeChat active 2026-05-26 unknown Claims official channels + official multipliers; ~49% cheaper (claimed), 300+ models.
DMXAPI mixed Alipay/WeChat unverified unknown Community-listed; homepage not independently verified.
MKEAI mixed Alipay/WeChat unverified unknown Community-forum + relay hybrid; pushes DeepSeek.
GPTGOD reverse Alipay unverified unknown Reverse-engineered; cheap, stability not guaranteed.
CloseAI official-relay Alipay/WeChat/Invoice active 2026-05-26 registered Issues enterprise invoices; self-describes as largest enterprise-grade relay in Asia.

Inclusion ≠ endorsement. Listing documents the market. Always run the evaluation checklist before sending money or data.

Global gateways & aggregators

Service Type Payment Notes
OpenRouter aggregator Card/Crypto Official-authorized routing, ~5% markup; 400+ models, 60+ providers. ARR reportedly ~$5M (2025-05) → ~$50M (early 2026).
LiteLLM gateway-oss Open-source gateway (100+ providers) + enterprise tier. Self-hosted, bring your own keys.
Helicone observability LLM observability gateway; logging/cost analytics.
AIMLAPI aggregator Card/Crypto 400+ models, prepaid from $20; crypto support implies payment-friction workaround.

Self-hosted alternatives

Not relays — self-hosted gateways are the credible alternative when you don't want a third party in the request path. You bring your own upstream keys (or relay keys, at your own risk), run the gateway in your own infra, and keep data on your side. Most Chinese relay stations are themselves built on these templates.

Project Type Notes
One-API gateway-oss Popular Go-based multi-vendor gateway; the de-facto OSS template behind many relay stations.
new-api gateway-oss Fork of One-API with extra channel types; same self-hosted model — you supply keys.
LiteLLM gateway-oss Listed above. Python-first, 100+ providers, used by enterprises.

When to choose self-hosted over a relay: regulated data, production workloads, you already have foreign-card billing, or you want auditable logs. You give up the relay's Alipay/WeChat convenience and accept ops overhead.

Comparison & monitoring tools

Tool Notes
中轉站競技場 (AI API PK) Price wall across OpenAI / Reverse / Claude / DeepSeek for ~40 stations.
awesome-ai-proxy (mn-api) The original list (~31 stations). Unmaintained as of 2026 — this repo aims to continue the effort.

How to choose one safely

See docs/evaluation.md for the full framework. Short version:

  1. Identify the channel typeofficial-relay > mixed > aggregator > reverse.
  2. Run a canary test — known-hard prompts, official vs relay, weekly. Detects silent model downgrade ("降智"). Reproducible prompt set: docs/canary-prompts.md.
  3. Top up small — relays are prepaid; never prepay large balances.
  4. Match workload — sensitive/regulated data → official API or self-hosted gateway-oss, never reverse.

Canary prompts (detect silent downgrade)

The single most effective defense against a relay quietly substituting a cheaper model is a small set of reproducible canary prompts you re-run against both the official API and the relay on a weekly cadence.

The full prompt set, expected baseline behavior per family (GPT-4 class, Claude Opus class, Gemini 2.5 Pro class), and a pass/fail rubric live in docs/canary-prompts.md. Use it as-is or fork it.

Risks (read this)

The market is largely unregulated. From public research:

  • A study of 28 relays found 45.83% of endpoints served a model that didn't match the one requested (up to 47% performance gap on medical tasks).
  • An audit of 428 stations found 9 injecting malicious code and 1 stealing funds outright.
  • Of 17 top stations surveyed, 15 had no registered company.

Full breakdown with sources and mitigations: docs/risks.md.

Market context

  • The original community list (mn-api/awesome-ai-proxy) catalogued ~31 stations before going stale; aggregators like aiapipk.com track ~40 — and these are the visible tip only.
  • The "clean" global analogue, OpenRouter, reportedly grew ARR ~10x in under a year (≈$5M → ≈$50M), routing trillions of tokens monthly — strong evidence that unified, multi-vendor LLM access is a durable market, not a hack.
  • Conclusion: relay stations are a global response to blocking + payment friction. China's ecosystem is the deepest because both pains peak there.

FAQ

What is an AI API relay (中轉站)?

An AI API relay is a forwarding endpoint between your code and official LLM APIs (OpenAI, Anthropic, Google). You change base_url to the relay and use its key; it forwards upstream and returns the result, usually with full OpenAI-compatible formatting. One key gives access to dozens–hundreds of models across vendors.

Is an API relay safe? Can they steal my key or read my prompts?

Treat every relay as an untrusted intermediary. It can see every prompt, response, and any data you send — a hostile operator can log, modify, or exfiltrate it. An audit of 428 stations found 9 injecting malicious code and 1 stealing funds. Never send secrets or regulated data through a relay; for sensitive workloads use the official API or a self-hosted gateway.

Why are relays so much cheaper than official APIs?

Three reasons: wholesale upstream pricing, mixed channels (e.g. Azure quotas), and — for the cheapest tier — reverse-engineered access to vendors' web clients. Prices 1/10 to 1/50 of official usually mean a reverse channel or silent model downgrade, not a genuine bargain.

What's the difference between 官轉 (official-relay), 混合 (mixed) and 逆向 (reverse)?

official-relay forwards real official keys (purest, most stable, safest for production). mixed blends official with other channels. reverse is reverse-engineered from web clients (cheapest, least stable, highest ToS risk). Trust order: official-relay > mixed > aggregator > reverse.

How do I detect a relay silently swapping the model (降智)?

Run a canary test: keep 3–5 known-hard prompts, record the official API's baseline, then run them against the relay weekly and compare reasoning depth and format adherence. A study of 28 relays found 45.83% of endpoints served a model that didn't match the request — degradation often appears after you top up.

Relay vs OpenRouter vs official API — which should I use?

Official API: most reliable, needs a foreign card. OpenRouter and similar global aggregators: officially authorized, ~5% markup, card/crypto — the "clean" choice if you can pay. China/Asia relays: cheapest and accept Alipay/WeChat, but variable quality and operator risk. Use official or a self-hosted gateway for production or sensitive data.

What's the difference between a relay (中轉站) and a mirror site (镜像站)?

A mirror site is a proxied chat web UI for end users (a re-hosted ChatGPT-like page). A relay is a programmatic API endpoint for developers and applications. This list only covers relays and gateways.

Which countries need API relays? Is this only a China thing?

No. Relays appear wherever there is access blocking + payment friction. China has the deepest ecosystem; Russia/Belarus/Iran face hard blocks (even VPNs are detected) and sanctioned-card rejection; APAC including Taiwan mostly has access but hits card rejection and billing friction. It is a global response, not a China-only phenomenon.

Which relay is the best for Claude API (Sonnet / Opus)?

There is no single "best" relay that is safe to recommend by name — Anthropic's terms are stricter than OpenAI's, Claude keys get rotated and banned more aggressively, and relays that look stable today can degrade overnight. Pragmatic guidance:

  • For production Claude usage: use the official Anthropic API or a registered global aggregator like OpenRouter (official-authorized, ~5% markup, supports Claude).
  • For experimentation: prefer official-relay type entries above; verify the station actually returns the real model via the canary prompts in docs/canary-prompts.md before topping up.
  • Treat any relay claiming "Claude Opus at 1/10 price" as a reverse channel by default — that price implies web-client reverse-engineering or silent downgrade.

How do I tell if a relay station is about to run away (跑路)?

Empirically, exit scams share a pattern. Red flags, roughly in order of severity:

  1. No registered company / ICP filing — 15 of 17 surveyed top stations had none.
  2. Aggressive prepaid bonuses ("recharge ¥100 get ¥150") to attract balances before disappearing.
  3. Prices below physical cost — 1/10 to 1/50 of official; the math has to come from somewhere.
  4. Canary regressions — quality drops on prompts that previously passed (preceded ~70% of documented exits).
  5. Slow / vanishing support, broken billing pages, expired ICP records, social channels going quiet.

Mitigation: top up small amounts only; never prepay large balances; subscribe to relay-tracking channels (V2EX, the risks doc).

Should I use a relay or self-host One-API / LiteLLM?

Use a self-hosted gateway when any of these apply: regulated or customer data, production workloads, you already have foreign-card billing, or you need auditable logs. You keep keys in your own infra; nothing transits an unaccountable third party.

Use a relay only for: low-stakes experimentation, learning, hobby projects, or when you genuinely cannot get a foreign card. Treat anything you send through it as public. See Self-hosted alternatives.

Are relays worth it for users in Taiwan / Hong Kong / Southeast Asia?

Marginal. Network access is generally fine, so the only real wins are (a) Alipay/WeChat payment vs foreign credit card, and (b) one key across multiple vendors. The trade-off — your prompts transit a Chinese-jurisdiction third party with no contract — is steep.

For most TW/HK/SEA developers, the better stack is:

  1. Official APIs where you have card billing,
  2. OpenRouter for the multi-vendor convenience, and
  3. A self-hosted One-API or LiteLLM instance if you really need a unified key.

Relays make sense mainly when you have no card option at all.

Contributing

This list is actively maintained and community-driven. Add or correct a station, flag one that ran away, or contribute a documented risk case:

We do not accept referral links or marketing copy. Factual, dated, source-backed entries only.

License

CC0 — To the extent possible under law, contributors have waived all copyright and related rights to this work (CC0 1.0).

Disclaimer

This repository is an informational catalogue. It is not an endorsement of any service. Many relays operate in legal grey areas and may violate upstream providers' Terms of Service. Using a relay can expose your prompts, code, and data to an unaccountable third party. Evaluate risk yourself and comply with all applicable laws and contracts.

Releases

No releases published

Packages

 
 
 

Contributors