token-cost-optimization

Here are 3 public repositories matching this topic...

Ansh-Sarkar / OpenLLM-Monitor

OpenLLM Monitor 📊 is a plug-and-play, real-time observability dashboard 🔍 for monitoring and debugging LLM API calls across OpenAI 🤖, Ollama 🦙, OpenRouter 🌐, and more. It tracks tokens 🧮, latency ⏱️, cost 💸, retries 🔁, and lets you replay prompts 🔄. Fully open-source 🌍 and self-hostable 🛠️.

llm llm-inference llm-monitoring token-cost token-cost-optimization

Updated Jun 26, 2025
JavaScript

david-spies / context-ring

Star

Context-Ring is a **production-grade reverse proxy** that places session IDs and agent virtual nodes onto a **consistent hash ring**. Prompts from the same long-running task are deterministically routed to the exact same agent instance that already holds the chat history in local memory.

python load-balancer reverse-proxy fastapi local-memory consistent-hash-ring state-preservation deterministic-routing token-cost-optimization ai-agent-swarm context-ring token-cost-reduction graceful-scaling streaming-passthrough zero-state-transfer

Updated Jun 3, 2026
Python

snehalshirolikar11111 / AI-PM-Workflow-Platform

Star

a super-agent orchestration layer with 5 autonomous AI agents (Executive Briefing, Autonomous PRD, Sprint Intelligence, Release Readiness, Research: Competitive/Market/Customer) and 7 AI-powered workflow automations

ai-agents super-agent ai-workflows token-cost-optimization product-manager-workflow

Updated May 8, 2026
TypeScript

Improve this page

Add a description, image, and links to the token-cost-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-cost-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly