Background
Our Autogen-based agent pods are stateless; all conversation history and outputs vanish on restart or scale-down. This blocks multi-turn workflows and wastes tokens.
Objectives
- Persist both short-term and long-term memory.
- Use Kubernetes-native storage (PVC, StatefulSet) or an in-cluster vector DB.
- Wire the chosen backend into Autogen’s memory API.
Approach (high-level)
- Start with Autogen’s built-in memory classes.
- Compare simple file-based storage on a PVC vs. a lightweight vector DB.
- Evaluate mem0 / openmemory and crewAI for deployment fit.
Action Plan (chronological)
- Spike: integrate memory with a local sqlite file on a PVC; confirm it survives pod restarts.
- Evaluate storage options: sqlite + PVC or a vector DB.
- PoC deploy vector DB: install a single-node Chroma/Milvus in dev namespace
- Refactor agent service: inject a pluggable memory provider (sqlite or vectordb) via config.
- Library review: test-drive mem0 and crewAI memory modules; note pros/cons and k8s deployment steps.
- Choose Kubernetes artifacts: finalise StatefulSet vs. Deployment + external DB; define backup & capacity plan.
Definition of Done
- Agents recall context after pod restarts and scaling.
- Backend is configurable (
sqlite or vectordb).
References
Background
Our Autogen-based agent pods are stateless; all conversation history and outputs vanish on restart or scale-down. This blocks multi-turn workflows and wastes tokens.
Objectives
Approach (high-level)
Action Plan (chronological)
Definition of Done
sqliteorvectordb).References