ShadowCheck - SIGINT Forensics Platform

🛡️ Production-grade SIGINT forensics and wireless network analysis platform. Real-time threat detection, geospatial correlation via PostGIS, and interactive analysis dashboards.

Current System Status

React/Vite frontend with TypeScript support is fully integrated (client/src/ routes like /geospatial-explorer, /analytics, /wigle, /kepler, /endpoint-test)
Modern modular backend architecture with organized services in server/src/api/, server/src/services/, and server/src/repositories/
Integrated Asset Serving: The main Express server natively serves compiled frontend assets from dist/ with optimized security headers.
Universal filter system with 20+ filter types supporting complex queries across all pages
DevContainer support for consistent development environments with VS Code integration
Integrated asset serving via the main Express server plus an optional static server for Lighthouse/security-header benchmarking
PostGIS materialized views for fast explorer pages with precomputed threat intelligence
ETL pipeline lives in etl/ with modular load/transform/promote steps feeding the explorer views; staging tables remain UNLOGGED for ingestion speed
Machine learning with multiple algorithms (Logistic Regression, Random Forest, Gradient Boosting) and hyperparameter optimization
Redis Integration handles caching, session management, and rate limiting.

Features

Dashboard: Real-time network environment overview with threat indicators and interactive metrics cards.
Geospatial Analysis: Interactive Mapbox visualization with spatial correlation, clustering, heatmaps, routes, and Unified Network Tooltips.
Infrastructure Datasets: 56 FBI Field Offices, 334 Resident Agencies, and 357 Federal Courthouses with 100% PostGIS coordinate coverage for geospatial correlation.
Geospatial Explorer: Map-based network exploration with overlays and timeline views.
Network Analysis: Deep dive into individual network characteristics and behavior patterns with universal filtering.
Threat Detection: ML-powered identification of surveillance devices and anomalies with multiple algorithms.
Analytics: Advanced charts and graphs for network pattern analysis with Chart.js visualizations.
Address Enrichment: Multi-API venue and business identification (4 sources: OpenCage, LocationIQ, Abstract, Overpass).
Device Classification: Standardized OUI-to-vendor resolution (74k+ records) with behavioral profiling.
Network Tagging: Manual classification and tag retrieval for networks.
Trilateration: AP location calculation from multiple observations with accuracy estimation.
Machine Learning: Multi-algorithm threat detection with hyperparameter optimization and model versioning.
Universal Filters: 20+ filter types supporting complex temporal, spatial, and behavioral queries.
WiGLE Integration: Local WiGLE database search with live API lookups.
Kepler Integration: Kepler.gl-ready GeoJSON endpoints with filter support.
Home Location & Markers: Saved locations and distance-from-home filters.
Data Export & Backup: CSV/JSON/GeoJSON exports plus admin-protected backups.
Authentication & Roles: Session-based login with admin-gated operations.
Admin Settings: AWS Secrets Manager-backed Mapbox/WiGLE/Google Maps configuration.
DevContainer Support: Consistent development environment with VS Code integration.
Security Headers: Production-ready deployment with CSP, HTTPS enforcement, and Lighthouse optimization.
Admin Features: System administration interface with configuration management, user settings, and role-based gating.
Admin Database Security: Multi-user model with read-only shadowcheck_user and privileged shadowcheck_admin.

See docs/FEATURES.md for the full feature catalog.

Project Commandments

ShadowCheck’s core engineering constraints live in:

AGENTS.md for repository-local automation and coding agents
CONTRIBUTING.md for human contributors

Those “Ten Commandments” are the canonical short-form rules for:

secrets handling and AWS Secrets Manager usage
canonical core data versus enrichment data boundaries
precision preservation and presentation-only rounding
refactor cleanliness and test expectations
separate validation of bootstrap, restore, import, and upgrade paths

Kepler.gl Data Policy

No default limits on Kepler endpoints unless explicitly requested via query params.
Filters are used instead of caps; Kepler.gl is designed for large datasets.

Recent Improvements (February 2026)

✅ Agency Offices Dataset Completion

Field Offices: 56 total, 56 ZIP+4 (100%), 0 ZIP5-only.
Resident Agencies: 334 total, 312 ZIP+4 (93.4%), 22 ZIP5-only.
Data Completeness: 0 missing cities, states, postal codes, phones, websites, or coordinates across all 390 primary records.
Normalization: Full address and phone normalization (10 digits) with original value preservation.

✅ TypeScript Migration & Build Pipeline

Complete TypeScript migration: Converted 60+ files including server utilities, middleware, ETL scripts, and build tools
Production build pipeline: Compiled TypeScript server for Docker deployment eliminates runtime ts-node overhead
Type safety: Added comprehensive interfaces and types for database operations, API responses, and service layers
Build optimization: Frontend and server compile separately with proper path resolution for containerized deployment

✅ Redis Implementation

Threat Score Caching: Redis caches threat scores at 5-minute intervals.
Analytics Caching: Redis caches analytics aggregations.
Session Management: Redis handles session storage.
Rate Limiting: Redis backend enforces 1000 req/15min per IP.

✅ Data Integrity Fixes

Fixed GeoSpatial table showing incorrect default values (signal: 0 dBm, channel: 0, frequency: 0 MHz)
Resolved analytics widgets failures (Temporal Activity, Radio Types Over Time, Threat Score Trends)
Fixed max distance calculation to use real PostGIS geographic distances instead of ~238m signal approximation
Resolved threat score column sorting issues (rule_score, ml_score, ml_weight, ml_boost now sortable)

✅ API & Backend Improvements

Networks API now uses latest observation data for accurate real-time information
Added manufacturer field population via radio_manufacturers table with OUI prefix matching
Fixed WiGLE observation points rendering with correct schema namespace (app vs public)
Enhanced analytics endpoints with proper null value handling and appropriate data sources

✅ Frontend Enhancements

Fixed data transformer field name mismatches (network_type → type, avg_score → avgScore)
Added missing API calls for temporal, radio-time, and threat-trends analytics
Improved geographic distance display and invalid value handling

✅ Testing & Quality

Added comprehensive regression tests for networks API data integrity
Enhanced error handling and validation across all endpoints

Architecture

Backend: Node.js/Express REST API with PostgreSQL + PostGIS + Redis (Modular architecture with Repositories and Services) Frontend: React 19 + Vite 7 with TypeScript (explorers and dashboards) Database: PostgreSQL 18 with PostGIS extension (566,400+ location records, 173,326+ unique networks) Cache: Redis v7 for sessions, rate limiting, and analytics Development: DevContainer support with VS Code integration Deployment: Production builds are served via the integrated asset handler in server/server.ts.

Prerequisites

Node.js 22+
PostgreSQL 18+ with PostGIS
Redis 7.0+
TypeScript 5.0+ (included in devDependencies)

Quick Start

Local Development

git clone https://github.com/cyclonite69/shadowcheck-web.git
cd shadowcheck-web
npm install
docker compose up -d

docker compose up -d starts a self-contained local PostgreSQL, Redis, API, and frontend stack. Local Docker uses DB_HOST=postgres by default. No .env file is required unless you want to override ports, AWS settings, or other defaults.

Local Shell Helpers

For frequent local development tasks, source the included helper aliases:

source ./scripts/local-dev-aliases.sh

Available helpers:

scroot - cd to the repository root.
sclocal - Wrapper for docker compose up -d --build.
scapi - Recreate the API container with standard AWS development defaults.
scgrafana - Start local Grafana with AWS-backed secrets and reader-role sync.
scps - Formatted docker ps view for ShadowCheck services.
scdb - Open psql as shadowcheck_user on the local database.
scdba - Open psql as shadowcheck_admin on the local database.
scsecrets - Restart the API + reload frontend so the stack re-reads AWS Secrets Manager before you log in.

Home Lab Deployment

git clone https://github.com/cyclonite69/shadowcheck-web.git
cd shadowcheck-web
./deploy/homelab/scripts/setup.sh

See deploy/homelab/README.md for hardware requirements and detailed setup.

AWS Production

🚀 Quick Start: See deploy/aws/QUICKSTART.md

# 1. Launch instance (from local machine)
./deploy/aws/scripts/launch-shadowcheck-spot.sh

# 2. Connect via SSM
aws ssm start-session --target INSTANCE_ID --region us-east-1

# 3. Run automated setup
bash
curl -fsSL https://raw.githubusercontent.com/cyclonite69/shadowcheck-web/master/deploy/aws/scripts/setup-instance.sh | sudo bash
cd /home/ssm-user
git clone https://github.com/cyclonite69/shadowcheck-web.git shadowcheck
cd shadowcheck
./deploy/aws/scripts/deploy-complete.sh

# 4. Update deployments
git pull origin master
./deploy/aws/scripts/scs_rebuild.sh

Documentation:

QUICKSTART.md - Complete deployment guide
WORKFLOW.md - Development workflow
README.md - AWS infrastructure details
DEPLOYMENT_CHECKLIST.md - Verification checklist
ssm-embedded-session-policy.json - IAM policy required for Admin UI embedded SSM

Detailed Setup

1. Clone and Install

git clone https://github.com/your-username/shadowcheck-web.git
cd shadowcheck-web
npm install

2. Database & Redis Setup

Use Docker for local setup (recommended) so PostGIS, Redis, bootstrap grants, and migrations stay in sync:

docker compose up -d

For AWS/EC2, use the deployment scripts in deploy/aws/scripts/ (especially scs_rebuild.sh) rather than running legacy SQL manually.

3. Environment Configuration

Use the example env files to keep local dev separate from AWS/deployed settings:

.env.example For shared non-secret defaults.
.env.local.example For local-machine overrides when your backend runs on the host instead of inside Docker.

Local Docker behavior:

docker compose up -d starts a local postgres service and uses DB_HOST=postgres by default
docker compose up -d also starts the backend API on 127.0.0.1:3001 and the frontend on http://127.0.0.1:8080
No .env file is required for container-to-container DB connectivity
PostgreSQL data is stored in the local postgres_data volume
Preferred local secrets path: export AWS_PROFILE=shadowcheck-sso, AWS_REGION=us-east-1, and optionally SHADOWCHECK_AWS_SECRET, then let the API container read shadowcheck/config through the read-only ${HOME}/.aws mount
Alternative local secrets path: export AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and AWS_SESSION_TOKEN directly in your shell before docker compose up
Local-only mock path: if you intentionally do not use AWS Secrets Manager, export DB_PASSWORD, DB_ADMIN_PASSWORD, and any needed API keys such as MAPBOX_TOKEN or OPENCAGE_API_KEY in your shell before docker compose up
Optional shell helpers can be loaded with source ./scripts/local-dev-aliases.sh
sclocal runs docker compose up -d --build
scapi recreates the local api container with AWS defaults: AWS_PROFILE=shadowcheck-sso, AWS_REGION=us-east-1, SHADOWCHECK_AWS_SECRET=shadowcheck/config
sclocal api will refuse to run unless those three env vars are already set
scdb opens psql as shadowcheck_user
scdba opens psql as shadowcheck_admin

Host-based local development behavior:

The server now defaults to DB_HOST=postgres, which is correct for local Docker
If your backend runs on the host, set DB_HOST=localhost explicitly so it can use Postgres published on 127.0.0.1:5432

Typical local dev values when PostgreSQL and Redis are published on localhost:

DB_HOST=localhost
DB_PORT=5432
DB_USER=shadowcheck_user
DB_ADMIN_USER=shadowcheck_admin
DB_NAME=shadowcheck_db
REDIS_HOST=127.0.0.1
REDIS_PORT=6379
PORT=3001
NODE_ENV=development

Credentials needed for local dev:

DB_PASSWORD Required for the normal application DB pool.
DB_ADMIN_PASSWORD Required for admin DB routes, including /api/admin/geocoding/daemon.

If the backend runs inside Docker, DB_HOST=postgres is the expected local value.

Production/deployed environments can keep an explicit .env with the deployed database host, for example:

DB_HOST=34.204.161.164

Production keeps Secrets Manager as the source of truth for db_password, db_admin_password, mapbox_token, opencage_api_key, and related keys. The production .env should point at the deployed DB host, not duplicate those secrets.

Do not point local .env at the deployed EC2 database unless you intentionally want your local app to use the remote environment.

S3_BACKUP_BUCKET is configuration, not a secret. For local work, export it in your shell only if you plan to use S3 backup features. On EC2, it can come from .env or AWS SSM Parameter Store.

To test locally against production-derived data, pull the latest .dump backup from S3 and restore it into the local Docker PostgreSQL container:

docker compose up -d postgres redis api
export AWS_PROFILE=shadowcheck-sso
export AWS_REGION=us-east-1
export S3_BACKUP_BUCKET=dbcoopers-briefcase-161020170158
./scripts/fetch-latest-s3-backup.sh
./scripts/restore-local-backup.sh ./backups/s3/<latest-backup>.dump

If explorer/network list endpoints still fail after restoring a snapshot, refresh the materialized view that powers them:

docker exec shadowcheck_postgres_local psql -U shadowcheck_user -d shadowcheck_db -c \
  "REFRESH MATERIALIZED VIEW app.api_network_explorer_mv;"

4. Run Migrations

Migrations are applied by the standard runners:

# Local/dev
docker compose up -d

# AWS/EC2
./deploy/aws/scripts/scs_rebuild.sh

If you must run manually, use sql/run-migrations.sh with the same credentials and schema/search_path settings used by deployment scripts.

5. Start Server

npm start

Server runs on http://localhost:3001

Pages

Dashboard (React): / and /dashboard
Start Page (React): /start
Geospatial Explorer (React): /geospatial-explorer
Analytics (React): /analytics
API Test (React): /endpoint-test
Admin: /admin
WiGLE (React): /wigle
Kepler (React): /kepler

API Endpoints

Networks & Observations

GET /api/networks - Paginated network list with universal filtering
GET /api/v2/networks - Version 2 paginated networks API
GET /api/networks/observations/:bssid - All observation records for a specific network
GET /api/networks/search/:ssid - Search networks by SSID
GET /api/networks/tagged - List user-tagged networks
GET /api/location-markers - Retrieve saved map markers
GET /api/home-location - Get current home/base coordinates

Threat Analysis & ML

GET /api/threats/quick - High-performance threat detection overview
GET /api/threats/detect - Detailed movement-based forensic analysis
GET /api/ml/status - Check ML model training status and stats
POST /api/ml/train - Trigger ML model retraining on tagged data

Analytics & Intelligence

GET /api/analytics/dashboard-metrics - Key metrics for dashboard cards
GET /api/analytics/* - Various statistical distribution endpoints (temporal, signal, etc.)
GET /api/wigle/api-status - Check WiGLE API connectivity
GET /api/mapbox-token - Securely retrieve Mapbox API token

Data Management & Admin

POST /api/network-tags/:bssid - Manually classify a network (admin)
POST /api/admin/import-sqlite - Turbo SQLite database import (admin)
POST /api/admin/cleanup-duplicates - Remove redundant observation data (admin)
GET /api/csv - Export observations as CSV (full dataset)
GET /api/json - Export observations + networks as JSON (full dataset)
GET /api/geojson - Export observations as GeoJSON (full dataset)
POST /api/admin/aws/instances/:id/start - Start EC2 instance (admin)
POST /api/admin/aws/instances/:id/stop - Stop EC2 instance (admin)

Authentication

POST /api/auth/login - Session-based authentication
POST /api/auth/logout - Invalidate current session
GET /api/auth/status - Check current authentication state

See server/server.ts and server/src/utils/routeMounts.ts for implementation details.

Machine Learning

ShadowCheck includes multi-algorithm threat detection with model training and hyperparameter optimization.

Training Endpoint

POST /api/ml/train

Trains logistic regression model on all tagged networks in database.

Request:

curl -X POST http://localhost:3001/api/ml/train

Response:

{
  "ok": true,
  "model": {
    "type": "logistic_regression",
    "accuracy": 0.92,
    "precision": 0.88,
    "recall": 0.95,
    "f1": 0.91,
    "rocAuc": 0.94
  },
  "trainingData": {
    "totalNetworks": 45,
    "threats": 18,
    "falsePositives": 27
  },
  "message": "Model trained successfully"
}

Errors:

400: Fewer than 10 tagged networks (minimum required)
503: ML model module unavailable

Status Endpoint

GET /api/ml/status

Check model training status and tag statistics.

Advanced ML Iteration

Test multiple algorithms with grid search and cross-validation:

pip install -r scripts/ml/requirements.txt
python3 scripts/ml/ml-iterate.py

Tests Logistic Regression, Random Forest, and Gradient Boosting with hyperparameter tuning.

Features Used for Training

Observation count (network detections)
Unique days seen
Geographic distribution (location clustering)
Signal strength (RSSI max)
Distance range from home location
Behavioral flags (seen at home vs. away)

Project Structure

shadowcheck-web/
├── client/
│   ├── src/
│   │   ├── components/    # ⚛️ React components (TypeScript)
│   │   ├── App.tsx        # ⚛️ Main React app
│   │   └── main.tsx       # ⚛️ Frontend entry point
│   └── vite.config.ts     # ⚛️ Frontend build config (TypeScript)
├── server/
│   ├── src/
│   │   ├── api/           # 🔧 REST API routes (TypeScript)
│   │   ├── services/      # 🔧 Business logic (TypeScript)
│   │   ├── middleware/    # 🔧 Express middleware (TypeScript)
│   │   └── utils/         # 🔧 Server utilities (TypeScript)
│   ├── server.ts          # 🔧 Main server entry point
│   └── static-server.ts   # 🛠️ Benchmark static server
├── deploy/                # 🚀 Deployment configs (AWS, etc.)
│   └── aws/               # AWS-specific deployment
├── etl/                   # 📊 ETL pipeline (TypeScript)
├── scripts/               # 🛠️ Utility scripts (TypeScript)
├── tests/                 # 🧪 Jest tests (TypeScript)
├── sql/                   # 🗄️ Database migrations & functions
├── docs/                  # 📚 Documentation
├── tsconfig.json          # ⚙️ TypeScript config (client)
├── tsconfig.server.json   # ⚙️ TypeScript config (server)
└── docker-compose.yml     # 🐳 Docker configuration

📖 See docs/ARCHITECTURE.md for detailed frontend/backend organization.

Also see docs/README.md for the documentation index.

Development

Run dev server:

npm run dev

Run tests:

npm test

Configuration

Key environment variables (see .env.example):

DB_* - PostgreSQL connection
REDIS_* - Redis connection
PORT - Server port (default: 3001)
NODE_ENV - development or production

Security

Use strong database credentials in production.
Rotate passwords every 60-90 days (see deploy/aws/docs/PASSWORD_ROTATION.md).
Enable HTTPS/TLS at reverse proxy layer.
Restrict API access via rate limiting (enabled via Redis).
See SECURITY.md for detailed security guidelines.

Security Headers & Lighthouse Audits

For accurate Lighthouse Best Practices audits, use the benchmark static server:

npm run build
npm run serve:dist
# Then run Lighthouse against http://localhost:4000

The integrated server (server/server.ts) also applies high-security headers by default.

Third-Party Cookies Notice

Expected Lighthouse warning: "Uses third-party cookies" from api.mapbox.com.

This is an expected behavior when using Mapbox GL JS. Mapbox sets cookies for:

Session management and rate limiting
Telemetry (can be disabled via mapboxgl.config.COLLECT_TELEMETRY = false)
Tile caching

These cookies are required for Mapbox functionality and cannot be eliminated without self-hosting all map tiles.

SEO Indexing

By default, robots.txt disallows all crawling (dev/staging). For production:

npm run build:public  # Sets ROBOTS_ALLOW_INDEXING=true

Documentation

Additional documentation is available in the docs directory. See docs/README.md for navigation.

Wiki (Diagrams & Visual Docs)

The wiki in .github/wiki/ contains diagram-heavy documentation and is the primary source for architecture and flow visuals.

Contributing

See CONTRIBUTING.md for code standards and workflow.

Code of Conduct

See CODE_OF_CONDUCT.md.

License

MIT. See LICENSE for details.

EC2 Infrastructure Deployment

For production deployment on AWS EC2 (Graviton/ARM64):

Clone & Prep:

git clone <repo_url> /home/ssm-user/shadowcheck
cd /home/ssm-user/shadowcheck
cp .env.example .env # And fill in AWS/DB secrets

Bootstrap:

chmod +x scripts/setup-ec2.sh
./scripts/setup-ec2.sh

Start Services:

cd docker/infrastructure
docker-compose -f docker-compose.postgres.yml up -d

Name		Name	Last commit message	Last commit date
Latest commit History 1,965 Commits
.circleci		.circleci
.devcontainer		.devcontainer
.github		.github
.husky		.husky
.kiro		.kiro
.vscode		.vscode
certs		certs
client		client
config		config
deploy		deploy
docker		docker
docs		docs
etl		etl
grafana		grafana
reports		reports
scripts		scripts
server		server
sql		sql
tests		tests
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.env.local.example		.env.local.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.node-version		.node-version
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.monitoring.yml		docker-compose.monitoring.yml
docker-compose.yml		docker-compose.yml
eslint.config.js		eslint.config.js
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.server.json		tsconfig.server.json

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ShadowCheck - SIGINT Forensics Platform

Current System Status

Features

Project Commandments

Kepler.gl Data Policy

Recent Improvements (February 2026)

Architecture

Prerequisites

Quick Start

Local Development

Local Shell Helpers

Home Lab Deployment

AWS Production

Detailed Setup

1. Clone and Install

2. Database & Redis Setup

3. Environment Configuration

4. Run Migrations

5. Start Server

Pages

API Endpoints

Networks & Observations

Threat Analysis & ML

Analytics & Intelligence

Data Management & Admin

Authentication

Machine Learning

Training Endpoint

Status Endpoint

Advanced ML Iteration

Features Used for Training

Project Structure

Development

Configuration

Security

Security Headers & Lighthouse Audits

Third-Party Cookies Notice

SEO Indexing

Documentation

Wiki (Diagrams & Visual Docs)

Contributing

Code of Conduct

License

EC2 Infrastructure Deployment

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages