Mohawk Inference Engine is a local inference and management stack for splitting model execution across multiple devices while keeping transport and session handling secure. The project focuses on three capabilities that are hard to get in one place in lightweight desktop tools: multi-device layer splitting, PQC-secured edge offload, and high-concurrency session management.
- A toy layer-splitting runtime that partitions a model into slices and runs them across workers.
- A secure controller/worker path with X25519 plus optional liboqs-backed hybrid KEM support.
- A session manager and load harness for concurrent encrypted inference runs.
- Telemetry and timing hooks for preload and execute paths.
python -m pip install -r prototype/requirements.txt
python prototype/run_demo.pyFor the secure prototype, start the secure worker in a separate terminal:
python prototype/worker_secure.py --port 8003If you have a local liboqs install, set:
export OQS_INSTALL_PATH=/usr/localRun the focused prototype checks with:
python -m pytest -q prototype/test_oqs_hybrid.py prototype/test_secure_hybrid_integration.py prototype/test_concurrency_smoke.py prototype/test_secure_run.py -q -rA- Release
v0.1.0includes the liboqs devcontainer, secure prototype path, telemetry, and the scaling harness. - The repository is licensed under Apache-2.0. See LICENSE.