av — Agentic Video CLI

Index once, search many. Video memory tool for agentic workflows by Pixel ML.

pip install pixelml-av

Quick Start

# 1. Set up your provider (interactive wizard)
av config setup

# 2. Ingest a video (transcript + embeddings)
av ingest video.mp4

# 3. Search across all indexed videos
av search "what was discussed about pricing"

# 4. Ask questions with RAG (citations included)
av ask "what were the key decisions made?"

# 5. List indexed videos
av list

# 6. Get transcript as VTT subtitles
av transcript <video_id> --format vtt

# 7. Export all data as JSONL
av export --format jsonl

Configuration

Interactive Setup (Recommended)

av config setup

Choose from four providers:

#	Provider	Auth	Transcription	Embeddings
1	OpenAI (Codex OAuth)	Auto-detected	Whisper	text-embedding-3-small
2	OpenAI (API key)	`sk-...` key	Whisper	text-embedding-3-small
3	Anthropic (Claude)	API key	Not supported	Not supported
4	Google (Gemini)	API key	Not supported	text-embedding-004

Config is saved to ~/.config/av/config.json and persists across sessions.

Note: Anthropic and Gemini don't support Whisper transcription. With these providers, use av ingest --captions for frame-based captioning, or set AV_OPENAI_API_KEY for transcription fallback.

Environment Variables

Env vars always override config.json:

export AV_API_KEY="sk-..."
export AV_API_BASE_URL="https://api.openai.com/v1"  # or any OpenAI-compatible endpoint
export AV_TRANSCRIBE_MODEL="whisper"
export AV_VISION_MODEL="gpt-4-1"
export AV_EMBED_MODEL="text-embedding-3-small"
export AV_CHAT_MODEL="gpt-4-1"

Requirements

Python 3.11+
FFmpeg (brew install ffmpeg)
An API key from OpenAI, Anthropic, or Google — or Codex CLI OAuth

Commands

Command	Description
`av config setup`	Interactive provider setup wizard
`av config show`	Show current configuration
`av ingest <path>`	Ingest video file(s) into the index
`av search <query>`	Full-text + semantic search
`av ask <question>`	RAG Q&A with citations
`av list`	List all indexed videos
`av info <video_id>`	Detailed video metadata
`av transcript <id>`	Output transcript (VTT/SRT/text)
`av export`	Export as JSONL/VTT/SRT
`av open <id> --at <sec>`	Open video at timestamp
`av version`	Print version JSON

License

Apache License 2.0 — see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
prompts		prompts
samples		samples
skills/agentic-video-memory		skills/agentic-video-memory
src/av		src/av
tests		tests
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

av — Agentic Video CLI

Quick Start

Configuration

Interactive Setup (Recommended)

Environment Variables

Requirements

Commands

License

About

Uh oh!

Releases

Packages

Languages

License

PixelML/av

Folders and files

Latest commit

History

Repository files navigation

av — Agentic Video CLI

Quick Start

Configuration

Interactive Setup (Recommended)

Environment Variables

Requirements

Commands

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages