Historical Windows temporal memory-state research artifact for studying time-bound memory observations, validation limits, and defensive visibility.
-
Updated
May 15, 2026 - Python
Historical Windows temporal memory-state research artifact for studying time-bound memory observations, validation limits, and defensive visibility.
PCBench: Benchmark for Python API parameter compatibility issues
Code, data, and ontologies for FAOS research papers on ontology-powered enterprise AI agent verification (RA-3 neurosymbolic, RA-6 trust certification).
Behavioral HIDS that survives baseline poisoning. Sediment robust estimator, Linux eBPF collector, tamper-evident SHA-256 audit chain.
JSON Schema for decision events as governance evidence units in automated decision and real-time risk systems. MIT.
Reproducibility package for fixed-ontology GraphRAG court-form filling experiments
Curated code and result summary for world-model inputs in Atari policy experiments.
PCART-LLM: Research artifact for LLM-based API compatibility analysis
Side-channel profiler that detects deceptive intent in LLMs by measuring the computational cost of lying.
Python library for evidence sufficiency scoring in governance assessments under delayed ground truth, drift, and decision-readiness constraints.
REQBench: Benchmark for compatible requirements inference in Python third-party library upgrades
Python toolkit for label-free monitoring of governance evidence degradation in delayed-label risk decision systems using proxy drift monitors and response chains.
PGP-inspired Post-Quantum text encryption. Features Hybrid Crypto (Kyber + X25519), TPM Hardware Binding, and paranoid memory hygiene.
PCREQ-evaluation: Evaluation artifact for PCREQ
Reference prototype and reproducibility artifact for an ML-KEM-768-based incompleteness-secured commitment framework with claim guards, benchmark scripts, and wrapper portability probes.
Comparative determinism experiment on a 285v D6 substrate (P_95 □ K_3) — Zer0pa Computation portfolio. Pure-rational deterministic Rust pipeline; 31,560 byte-identical SHA-256 hashes on commodity Android.
Benchmark dataset and evaluation harness for comparing governance evidence feasibility across rule-based, hybrid ML, streaming, and agentic AI decision systems.
Research artifact for "Duty, Defect, and Disclosure: Reassessing Developer Liability for LLM Chatbots in Suicidal Crises under Swiss and European Law" (UZH FS26, AI: Technology and Law)
Research artifact, paper, and frozen evaluation outputs for selective revocation and replay after persistent indirect prompt injection in memory-augmented LLM agents.
Python SDK for collecting raw system signals into Decision Event Schema evidence units with provenance, attribution, temporal metadata, and validation.
Add a description, image, and links to the research-artifact topic page so that developers can more easily learn about it.
To associate your repository with the research-artifact topic, visit your repo's landing page and select "manage topics."