Skip to content

rwilliamspbg-ops/Mohawk-Inference-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mohawk Inference Engine

Mohawk Inference Engine is a local inference and management stack for splitting model execution across multiple devices while keeping transport and session handling secure. The project focuses on three capabilities that are hard to get in one place in lightweight desktop tools: multi-device layer splitting, PQC-secured edge offload, and high-concurrency session management.

What This Repo Contains

  • A toy layer-splitting runtime that partitions a model into slices and runs them across workers.
  • A secure controller/worker path with X25519 plus optional liboqs-backed hybrid KEM support.
  • A session manager and load harness for concurrent encrypted inference runs.
  • Telemetry and timing hooks for preload and execute paths.

Quick Start

python -m pip install -r prototype/requirements.txt
python prototype/run_demo.py

For the secure prototype, start the secure worker in a separate terminal:

python prototype/worker_secure.py --port 8003

If you have a local liboqs install, set:

export OQS_INSTALL_PATH=/usr/local

Recommended Reading

Testing

Run the focused prototype checks with:

python -m pytest -q prototype/test_oqs_hybrid.py prototype/test_secure_hybrid_integration.py prototype/test_concurrency_smoke.py prototype/test_secure_run.py -q -rA

Release Notes

  • Release v0.1.0 includes the liboqs devcontainer, secure prototype path, telemetry, and the scaling harness.
  • The repository is licensed under Apache-2.0. See LICENSE.

About

A lightweight, secure local inference and management engine featuring multi-device layer splitting, PQC-secured edge offloading, and high-concurrency session management.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors