Skip to content
#

token-cost-optimization

Here are 3 public repositories matching this topic...

Language: All
Filter by language

OpenLLM Monitor ๐Ÿ“Š is a plug-and-play, real-time observability dashboard ๐Ÿ” for monitoring and debugging LLM API calls across OpenAI ๐Ÿค–, Ollama ๐Ÿฆ™, OpenRouter ๐ŸŒ, and more. It tracks tokens ๐Ÿงฎ, latency โฑ๏ธ, cost ๐Ÿ’ธ, retries ๐Ÿ”, and lets you replay prompts ๐Ÿ”„. Fully open-source ๐ŸŒ and self-hostable ๐Ÿ› ๏ธ.

  • Updated Jun 26, 2025
  • JavaScript

Context-Ring is a **production-grade reverse proxy** that places session IDs and agent virtual nodes onto a **consistent hash ring**. Prompts from the same long-running task are deterministically routed to the exact same agent instance that already holds the chat history in local memory.

  • Updated Jun 3, 2026
  • Python

Improve this page

Add a description, image, and links to the token-cost-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-cost-optimization topic, visit your repo's landing page and select "manage topics."

Learn more