safeclipper

safeclipper is a local-first image and text redaction tool for screenshots and agent workflows. It OCRs an image, detects sensitive text with OpenAI Privacy Filter q4 ONNX weights, and masks the matching image regions before the image leaves the user's machine.

Why

Proactive agents increasingly need screen context: screenshots, browser pages, app windows, documents, support tickets, dashboards, and other user data. That context is useful, but it can also contain names, emails, phone numbers, addresses, API keys, account numbers, student IDs, customer records, and other private information.

safeclipper exists to put a local privacy layer in front of those workflows. The intended use is simple:

Capture or receive a screenshot locally.
Run OCR and sensitive-span detection on-device.
Produce a redacted image with private regions masked.
Send only the redacted image to a proactive agent, cloud model, support workflow, or evaluation pipeline.

The goal is not to replace agent memory or permission systems. The goal is to reduce accidental data collection by making local screenshot sanitization cheap, repeatable, and available as both a CLI and a native macOS app.

Repository Layout

crates/safeclipper-cli/: Rust CLI. Text redaction is cross-platform. Image OCR/redaction supports Apple Vision and Tesseract on macOS, and Tesseract on non-macOS platforms.
apps/macos/SafeClipper/: native SwiftUI macOS app. It links the Rust library for OCR, privacy detection, and image redaction.
models/: model download scripts and ignored local model cache.
fixtures/: small local test images.
eval/: OCR/evaluation artifacts from dataset experiments, including the desktop-pii Hugging Face screenshot benchmark runner.

Benchmark

Measured on May 25, 2026 with a release build on an Apple M3 Max, 64 GB memory, macOS 26.5, and model_q4_embedded.onnx.

wall time is the end-to-end CLI process time, including process startup, tokenizer/model loading, OCR when present, inference, decoding, image masking, and output serialization. model time is the CLI-reported privacy-model inference path latency from the JSON summary, excluding model load and OCR.

Use case	Input	Provider	Spans	Warm wall time	Model time
Short text redaction	`Alice Smith email alice@example.com`	CPU	2	~0.65 s	~70 ms
Support-ticket text	255 chars with name, email, phone, address, API token	CPU	8	~1.04 s	~423 ms
Screenshot redaction	`fixtures/privacy-screenshot.png` with Vision OCR	CPU	8	~1.39 s	~419 ms

Desktop PII Evaluation

The eval/desktop-pii harness downloads paperboy-ai/desktop-pii-210 from Hugging Face, runs safeclipper over all 210 synthetic desktop screenshots, and scores Gemini answers on the safeclipper-redacted images.

Latest full run on May 25, 2026. All rows use Gemini redacted-image QA over the same 210 synthetic desktop screenshots and the same calibrated 2,717-question subset. The safeclipper row uses model_q4_embedded.onnx, CPU provider, and Apple Vision OCR.

Redactor	QA questions	Utility score	Privacy block rate	Privacy leak rate	Overall
WebRedact large	2,717	0.6412	0.5761	0.4239	0.6087
ScreenPipe PII image redactor	2,717	0.9043	0.2395	0.7605	0.5719
Gemini 3 Flash preview redaction	2,717	0.9681	0.7647	0.2353	0.8664
safeclipper q4 + OCR	2,717	0.9152	0.7039	0.2961	0.8095

Run it with OPENROUTER_API_KEY or OPENAI_API_KEY plus an OpenAI-compatible endpoint from OPENROUTER_BASE_URL, OPENAI_BASE_URL, LLM_BASE_URL, or --llm-base-url:

uv --project eval run python \
  eval/desktop-pii/run_eval.py --qa-mode gemini --resume

Detailed methodology and results live in eval/desktop-pii/README.md.

Setup

Download the q4 model files:

./models/download-openai-privacy-filter-q4.sh

Build the Rust CLI and dylib:

cargo +1.88.0 build --release -p safeclipper-cli

Run text detection:

./target/release/safeclipper \
  --provider cpu \
  --model models/openai-privacy-filter/onnx/model_q4_embedded.onnx \
  --tokenizer models/openai-privacy-filter/tokenizer.json \
  --config models/openai-privacy-filter/config.json \
  --text "Alice Smith email alice@example.com"

Output only the redacted text:

./target/release/safeclipper \
  --provider cpu \
  --model models/openai-privacy-filter/onnx/model_q4_embedded.onnx \
  --tokenizer models/openai-privacy-filter/tokenizer.json \
  --config models/openai-privacy-filter/config.json \
  --text "Alice Smith email alice@example.com" \
  --output text

Redact an image directly from the CLI on macOS:

./target/release/safeclipper \
  --provider cpu \
  --model models/openai-privacy-filter/onnx/model_q4_embedded.onnx \
  --tokenizer models/openai-privacy-filter/tokenizer.json \
  --config models/openai-privacy-filter/config.json \
  --image fixtures/privacy-screenshot.png \
  --output-image /tmp/privacy-screenshot-redacted.png

For image input, the Rust CLI runs OCR, feeds the OCR text into the same ONNX privacy model, maps detected spans back to OCR token boxes, and writes a redacted PNG/JPEG with black masks.

OCR backends:

--ocr-backend auto: default. Uses Apple Vision on macOS and Tesseract elsewhere.
--ocr-backend vision: macOS only. Calls Apple Vision through a small native bridge.
--ocr-backend tesseract: uses the tesseract command-line binary on macOS, Linux, or Windows.

Tesseract options:

./target/release/safeclipper \
  --provider cpu \
  --model models/openai-privacy-filter/onnx/model_q4_embedded.onnx \
  --tokenizer models/openai-privacy-filter/tokenizer.json \
  --config models/openai-privacy-filter/config.json \
  --image fixtures/privacy-screenshot.png \
  --output-image /tmp/privacy-screenshot-redacted.png \
  --ocr-backend tesseract \
  --tesseract-bin tesseract \
  --tesseract-lang eng \
  --tesseract-psm 6

Build and run the macOS app:

cd apps/macos/SafeClipper
./build-app.sh debug
open build/safeclipper.app

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
apps/macos/SafeClipper		apps/macos/SafeClipper
assets		assets
crates/safeclipper-cli		crates/safeclipper-cli
eval		eval
fixtures		fixtures
models		models
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

safeclipper

Why

Repository Layout

Benchmark

Desktop PII Evaluation

Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

safeclipper

Why

Repository Layout

Benchmark

Desktop PII Evaluation

Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages