RUNTIME GOVERNANCE FOR ENTERPRISE AI

The Trust Fabric layer that composes with ServiceNow AICT and Microsoft Agent365.

The Trust Fabric layer that composes with ServiceNow AICT and Microsoft Agent365.

APERION is the runtime governance layer for enterprise AI agents in regulated industries. SmartFlow on-premises. Shield open-source. Built for financial services, healthcare, and defense.

APERION is the runtime governance layer for enterprise AI agents in regulated industries. SmartFlow on-premises. Shield open-source. Built for financial services, healthcare, and defense.

THE PROBLEM

Enterprise AI has a governance crisis.

Enterprise AI has a governance crisis.

Cloud-based AI gateways require you to trust a third party with your model traffic, your prompt data, and your compliance posture. For regulated industries, that trust model is structurally broken.


The March 2026 LiteLLM supply chain attack proved exactly what happens when enterprises trust open-source AI infrastructure with no supply chain governance: credential theft across 36% of cloud environments and a fully quarantined PyPI package.

Cloud-based AI gateways require you to trust a third party with your model traffic, your prompt data, and your compliance posture. For regulated industries, that trust model is structurally broken.


The March 2026 LiteLLM supply chain attack proved exactly what happens when enterprises trust open-source AI infrastructure with no supply chain governance: credential theft across 36% of cloud environments and a fully quarantined PyPI package.

INCIDENT TIMELINE — MARCH 2026

INCIDENT TIMELINE —
MARCH 2026

Mar 19

Trivy security scanner compromised via GitHub Action tag hijack

Mar 23

Checkmarx KICS GitHub Actions compromised using Trivy-exfiltrated credentials

Mar 24

LiteLLM v1.82.7 & v1.82.8 published to PyPI with credential-stealing malware

Mar 24

Entire LiteLLM package quarantined. 95M monthly downloads affected.

The Trust Fabric

Four layers. Two distinct planes.

Four layers. Two distinct planes.

Four layers. Two distinct planes.

Workflow agent governance and runtime model governance are different categories with different buyers, different budgets, and different failure modes. The Trust Fabric integrates both.

TRUST FABRIC · 4 LAYERS WORKFLOW PLANE · PARALLEL INFRASTRUCTURE LAYER 4 APERION Audit & Evidence Can we prove what happened to a regulator? LAYER 3 APERION Runtime Governance What did the agent actually send to the model? LAYER 2 Okta · Entra · AD · Veza Access Governance What is the human allowed to do? LAYER 1 APERION · NIST IAL2/AAL2 Identity Proofing Who is the actual human, really? WORKFLOW AGENT GOVERNANCE Which agent runs, on whose authority, doing what work? ServiceNow AICT Action Fabric · May 5 GA Microsoft Agent 365 "AI control plane" · May 1 GA Sits above the call between agent and model. Spawns agents, scopes authority, audits workflow actions. The Trust Fabric composes with both. APERION does not compete on workflow orchestration. APERION 2026

OUR SOLUTIONS

Flexible For Any Framework

Flexible For Any Framework

Flexible For Any Framework

Our on-premise AI firewall + control plane that enforces policy, optimizes cost, and proves ROI.

user@smartflow

:

~/config

$

smartflow deploy --mode production

✓ Validating configuration...

✓ Connecting to gateway cluster...

Providers detected:

• OpenAI GPT-4

(active)

• Anthropic Claude 3.5

(active)

• Google Gemini Pro

(standby)

✓ Cache layer initialized (Redis cluster)

✓ Policy rules loaded: 12 active

Routing configuration:

model_routing:

gpt-4:

70%

claude-3.5:

30%

cache_strategy:

ttl:

3600s

hit_rate_target:

85%

✓ Deployment successful! Gateway live at gateway.internal:8443

user@smartflow

:

~/config

$

terminal — smartflow-config

Smartflow Gateway

Smartflow Gateway

Unified routing across OpenAI, Anthropic, Google, and on-prem models. One control plane. No code changes.

Unified routing across OpenAI, Anthropic, Google, and on-prem models. One control plane. No code changes.

Unified AI provider access

Real-time compliance filtering

Granular usage tracking

Smartflow MetaCache

Smartflow MetaCache

Four-phase BERT semantic cache. 55–75% hit rates on production traffic. Published NVIDIA GTC 2026 benchmarks.

Four-phase BERT semantic cache. 55–75% hit rates on production traffic. Published NVIDIA GTC 2026 benchmarks.

55-75% cache hit rates

4x performance improvement

Intelligent routing

Smartflow Compliance

Smartflow Compliance

Inline policy enforcement before prompts reach any model. EU AI Act, NIST AI RMF, FINRA, HIPAA mapped out of the box.

Inline policy enforcement before prompts reach any model. EU AI Act, NIST AI RMF, FINRA, HIPAA mapped out of the box.

HIPAA/SOX/SEC/GDPR support

Custom blacklist/whitelist

Complete audit trail

p95

Published semantic cache benchmarks

Published semantic cache benchmarks

<5ms

<5ms

Routing overhead (Rust)

5

5

Patents filed

99.999%

99.999%

Production uptime

CAPABILITIES

Built for the architecture that is winning.

Built for the architecture that is winning.

On-Premises Deployment

Runs in your data center or private cloud. No cloud dependency. No PyPI supply chain risk. No third-party data exposure.

Identity-Aware Governance

Every AI interaction authenticated against your enterprise IdP. Entra ID, LDAP, SAML, OIDC. Per-user audit trails tied to real identities.

Inline Policy Enforcement

No-code compliance engine. Policies enforced before prompts reach any model. EU AI Act, NIST AI RMF, FINRA, HIPAA mapping.

Semantic Caching at p95

Four-phase BERT semantic cache. 55–75% hit rates. Published benchmarks from NVIDIA GTC 2026. Not marketing claims.

MCP Proxy Governance

Inline governance for agent-to-agent workflows. As agentic AI proliferates, MCP servers are the new attack surface. SmartFlow governs them.

Sub-5ms Overhead

Rust-based infrastructure. Not a Python library adding 20–80ms per request. Infrastructure-grade performance for production workloads.

99.999%

99.999%

Production uptime

Production uptime


Enterprise customers in financial services. 7+ months continuous operation.



Enterprise customers in financial services. 7+ months continuous operation.


5

Filed patents

runtime governance, identity binding, audit evidence

Fortune 500

Fortune 500

Active evaluations


Enterprise evaluations underway at institutions that define what production-grade means.


Compatible with leading AI providers and frameworks.

Compatible with leading AI providers and frameworks.

2026 Test Flight

2026 Test Flight

Smartflow is in production with named financial services partners. Currently scoping a limited set of design partners for the runtime governance plane.

Runtime governance for enterprise AI.

RESOURCES

Runtime governance for enterprise AI.

RESOURCES

Runtime governance for enterprise AI.

RESOURCES

Runtime governance for enterprise AI.

RESOURCES