vector-db-sizer

Analytical CLI estimator for vector database disk and RAM sizing.

What it is

Use it for fast pre-implementation sizing work, such as:

early architecture decisions;
comparing vector dimensions;
comparing engines;
comparing index types;
estimating metadata/payload impact;
generating Markdown/CSV/JSON artifacts for architecture discussions.

What it does not do

No live database connections.
No ingestion or load execution.
No latency/recall benchmarking.
No pricing calculations.
No production guarantee.

Quick start

Run directly from PyPI with uvx:

uvx vector-db-sizer --help
uvx vector-db-sizer list-engines

Input YAML

name: qdrant_text_hnsw

dataset:
  source_type: text
  total_tokens: 50000000
  chunk_tokens: 512
  chunk_overlap: 64

embedding:
  kind: dense
  dimensions: 1536
  dtype: float32

database:
  engine: qdrant
  index_type: hnsw

Validate and estimate

uvx vector-db-sizer validate scenario.yaml
uvx vector-db-sizer estimate scenario.yaml --format markdown --out report.md

Single-scenario example (from the local repository)

uv run vector-db-sizer estimate examples/qdrant_text_hnsw.yaml --format markdown

Multi-scenario example (from the local repository)

uv run vector-db-sizer estimate examples/multi_scenario.yaml --format csv
uv run vector-db-sizer estimate examples/multi_scenario.yaml --format json

Output formats

json (machine-readable)
markdown (human report)
csv (comparison table)

Supported engines

generic
pgvector
qdrant
milvus
elasticsearch
opensearch
weaviate
pinecone

How to interpret the report

Raw vectors: uncompressed/base vector bytes.
Quantized vectors: additional quantized representation when modeled.
Record payload: IDs + metadata/text/provenance payload bytes.
Index disk: index structure bytes on disk.
Engine overhead: engine/profile-level overhead approximation.
Final disk estimate: replicated storage plus WAL/snapshot/safety factors.
Final RAM estimate: vectors + payload + index + overhead RAM approximation.
Warnings: profile caveats and scenario assumptions to review.
Confidence: per-component confidence levels for planning.

Confidence levels

high: formulaic or type-level estimate.
medium: useful engineering approximation.
low: heuristic and engine-dependent; validate with pilot load.

Production sizing warning

The estimates are analytical and should be calibrated with a representative pilot load before production capacity planning.

Development

uv sync
uv run pytest
uv run ruff check .

Current limitations

Engine profiles are approximate.
No vendor pricing model.
No actual DB measurements from live systems.
No latency/recall estimation.
No automatic database selection.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs/assets		docs/assets
examples		examples
src/vector_db_sizer		src/vector_db_sizer
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vector-db-sizer

What it is

What it does not do

Quick start

Input YAML

Validate and estimate

Single-scenario example (from the local repository)

Multi-scenario example (from the local repository)

Output formats

Supported engines

How to interpret the report

Confidence levels

Production sizing warning

Development

Current limitations

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

vector-db-sizer

What it is

What it does not do

Quick start

Input YAML

Validate and estimate

Single-scenario example (from the local repository)

Multi-scenario example (from the local repository)

Output formats

Supported engines

How to interpret the report

Confidence levels

Production sizing warning

Development

Current limitations

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages