ARK Governance Gateway — Epistemic Integrity for AI

The Problem

LLMs generate text.
They don't generate truth.

Every AI response is a mix of verified facts, reasonable inferences, and confident fabrications. Basic guardrails check for harmful content. Nobody checks for epistemic integrity.

⚠

Manufactured Certainty

Models present inferences as verified facts. "The GDP of Kiribati in 2024 was $220M" — said with full confidence, zero evidence.

↻

Sycophantic Drift

Over multiple turns, models silently shift their claims to agree with the user. Yesterday's "uncertain" becomes today's "confirmed."

◇

Unauditable Outputs

No claim-level audit trail. No evidence linking. No way to replay what the model said, why it was flagged, or what policy was active.

Architecture

Drop-in proxy.
Deep governance.

Change one line — your base_url. ARK Gateway sits between your application and any LLM, applying protocol-driven governance to every response.

📱

Your App

Any OpenAI SDK

base_url swap

§

ARK Gateway

Governance Engine

governed

🧠

Any LLM

DeepSeek · Claude · Llama

The Protocol Stack

§6.7

Calibration Gate

Every factual claim is extracted and tiered: VERIFIED, GROUNDED, INFERRED, or UNVERIFIED. Breach thresholds are configurable per tenant.

§17

Extractive Grounding

VERIFIED claims require a quotable source span from the evidence set. Manufactured citations are automatically demoted. Numeric consistency enforced — if the claim says "1%" but the span says "5%", automatic demotion via SPAN_NUMBER_MISMATCH.

The Moat

§11

Truth Budget

100-point epistemic budget per response. Each claim deducts based on tier. Budget breach = governance action. No unlimited hallucination.

§29.6

Anti-Sycophancy (SCR)

Multi-turn drift detection with risk accumulation. If the model silently shifts its position to agree with the user, the Naseeha lock fires.

§29.1

Maqasid Intent Gate

Pre-inference intent classification. Flags requests that could cause harm across five ethical dimensions before the model even generates.

AIS

Ark Integrity Score

Single 0.0–1.0 composite health score per response. Four components: truth budget health, grounded ratio, drift stability, citation coverage.

Empirical Proof

Measured. Not promised.

Every number below comes from real test runs — 6 protocol tests executed against both local infrastructure and DigitalOcean Serverless with H100-class GPUs.

~10s median per governed call

Full governance pipeline: inference → claim extraction → calibration → citation audit → truth budget → response. On cloud infrastructure.

vs ~125s on commodity local hardware 12× faster

1.27× strict vs baseline ratio

Compliance-grade governance (larger judge model) costs only 27% more latency than standard. Architecture-constant — holds on both local and cloud.

6/6 protocol tests pass

Extractive grounding, AIS scoring, shadow mode, industry profile overlay, truthful disclosure, and SCR risk accumulation — all verified on cloud infrastructure.

$0.0004 per governed call (Strict tier)

Measured live from audit-log token telemetry. Includes inference + judge extraction + all governance protocol calls. ~95% gross margin at $0.008/req customer price.

8.3× concurrency speedup at 16 parallel requests

Zero errors across 16-way concurrent governed calls. Per-call latency degraded only 27% under 16× load. One gateway instance sustains ~2,600 governed requests/hour.

One line to govern your AI

from openai import OpenAI

client = OpenAI(
    base_url="https://ark-api.yourdomain.com/v1",  # ← only change
    api_key="sk-your-tenant-key",
)

response = client.chat.completions.create(
    model="deepseek-3.2",
    messages=[{"role": "user", "content": "..."}],
    extra_body={"ark": {
        "protocols": ["calibration", "citation", "truth_budget", "scr"],
        "mode": "annotate",
    }}
)

# → response.ark.governance_report.ais.score = 0.87
# → response.ark.governance_report.claims = [{tier: "GROUNDED", ...}]
# → response.ark.governance_report.truth_budget = {remaining: 65, breach: false}

Try it. Right now.

Ask anything. Every response comes back tiered claim-by-claim. No signup. No download. No API keys (we provide a public demo key throttled to ~$0.05/day of upstream cost).

Demo feels cramped here? ↗ Open in new tab for full screen

Deployment Tiers

Choose your governance depth.

Same gateway. Same protocols. Different strictness levels — each empirically validated.

ARK Standard

$49/mo

+ $0.003 per governed request

COGS: ~$0.0005/call

Calibration Gate (§6.7)
Citation Anchor (§17)
Truth Budget (§11)
Anti-Sycophancy SCR (§29.6)
AIS composite score
Annotate mode
10K requests/month included
JSONL audit log

Get Started

Pre-configured for regulated sectors.

Each profile ships with calibrated thresholds, required disclaimers, banned-term flagging, and protocol defaults designed for the compliance reality of the sector.

☪

Islamic Finance

Shariah-compliance governance. Maqasid intent gate on by default. Scholarly-review disclaimer appended. Banned-term flagging for riba-based language — flag only, never rewrite.

Threshold: 25% Mode: Annotate Maqasid: On

⚕

Healthcare

Clinical advisory governance. Strict calibration with hard-fail on unverified claims. Clinical disclaimer on every output. Block mode default — halts on breach.

Threshold: 20% Mode: Block Budget: 80

🛡

Defense

Maximum calibration for sovereign and national-security deployments. Full audit-log content capture. Maqasid Mal/Nafs intent gate. Hard block default.

Threshold: 15% Mode: Block Audit: Full capture

Operational Modes

From observation to enforcement.

Annotate

Attach governance report to every response. Never modify content. Default for development and evaluation.

Shadow

Same as annotate, but explicitly tracks would_have_blocked. For regulated-buyer onboarding — prove governance value before enforcement.

Block

Replace content with halt notice on calibration breach, truth budget breach, or risk accumulation halt. Zero tolerance for unverified output.

Regenerate

Auto-retry with corrective system prompt on breach. The model gets a second chance with explicit guidance. If it still breaches, halt.

Compliance & Security

Truthful by design. Hardened by default.

ARK Gateway never lies about being governed. When sincerely asked, the model acknowledges: "Yes, my responses are checked for accuracy and safety." Truthful disclosure replaces the denial layer — compliant with EU AI Act Article 5 and the Sound Heart principle (§0).

Every governed call produces a full audit trail: claims extracted, tiers assigned, evidence hashes, policy snapshots. Replay-ready for compliance review.

🔒

Tenant Isolation

Per-tenant cache keys, rate limits, and audit logs. Zero cross-tenant data leakage — architecturally enforced, regression-tested.

🛡

Prompt Injection Defense

Evidence payloads are structurally delimited with anti-injection boundaries. Malicious evidence cannot hijack the extractor into false VERIFIED tags.

🔢

Numeric Consistency Gate

If a claim says "1%" but the source span says "5%", automatic demotion via SPAN_NUMBER_MISMATCH. The numbers in the claim must match the numbers in the evidence.

📋

Published Threat Model

Full SECURITY.md with threat matrix, mitigations, and verification mapping. Every security claim has a corresponding regression test.

Ready to govern your AI?

Early access is open for qualified teams in regulated industries. We onboard every customer personally.

Every AI claim.Verified or flagged.

LLMs generate text.They don't generate truth.