Knowledge register · 2026Updated · week 20

What we know, you know too

Roadmap, release notes, architecture whitepapers, benchmark studies against hyperscalers and a full glossary of the AI terms that matter – dated, versioned and free of marketing varnish. So you can evaluate Agenivo before you commit.

6 sections
in the register
biweekly
update cadence
100% EU
sources & hosting
The register in numbers

A register that lives – not one that gets archived

We keep every section actively maintained instead of publishing once and walking away. These numbers show how much sits in the register – and how regularly we touch it.

  • 01
    0

    Knowledge sections

    Roadmap to compliance FAQ

  • 02
    0+

    Glossary terms

    A–Z, each with a source

  • 03
    0

    Whitepapers & decks

    Architecture · compliance · TCO

  • 04
    0

    Release notes

    Dated & versioned since 2024

Update cadence

Entries we revised or newly added per calendar week.

Editorial rhythm

Every 14 days

Current week

+8

4
6
3
7
5
8
4
9
6
7
5
8
W09W10W11W12W13W14W15W16W17W18W19W20

As of May 15, 2026. Values are carried forward from our knowledge board with every editorial cycle.

Public roadmap

What is coming next – dated, not promised

What is being built sits here in plain sight. Every item carries its real status – not "soon", but quarters that actually appear in engineering planning.

ShippedIn progressNextLater

Now · Q2 2026

In active development

03
  • In progressStrato

    Multi-region failover (Munich · standby)

    Full hot standby in Munich with DNS failover under 30 seconds. Quarterly failover drill with customer participation.

  • In progressLogica

    Hybrid engine v2 (symbol + LLM)

    Rule-based reasoning combined with LLM inference. Reduces hallucinations in regulated domains by 64%.

  • ShippedCustos

    WORM audit logs (365 days)

    Immutable, audit-grade logs per BSI C5 – exportable as JSON or S3 object lock.

Next · Q3 2026

Spec finalised

04
  • NextIntelligence

    Retrieval module "Sovereign Vault"

    BYOK encryption for the knowledge base and embeddings. Keys remain in the customer HSM.

  • NextKinetic

    Workflow designer (no-code)

    Visual definition of multi-step agent workflows. Triggers, tools and escalation paths by drag-and-drop.

  • NextStrato

    Sovereign cloud option (IONOS)

    Full platform distribution on IONOS Sovereign Cloud – alternative to T-Systems, BSI C5 attested.

  • NextCustos

    EU AI Act risk classifier

    Automatic risk classification of each agent per Annex III. Model cards and disclosure duties built in.

Later · Q4 2026

Strategic outlook

04
  • LaterLogica

    Explainability API (token-level)

    Explainability trace per answer: which knowledge source, which token span, which confidence score.

  • LaterKinetic

    Voice channel (telephony · SIP)

    Native telephony integration with ASR/TTS in German and Swiss German. End-to-end latency < 800 ms.

  • LaterIntelligence

    Multimodal inference (image + document)

    OCR, diagram understanding and image classification in the knowledge base – without external US models.

  • LaterPlatform

    On-premise distribution v3

    Full platform distribution for your own Kubernetes incl. update rollout via VPN and air-gap mode.

The roadmap reflects current planning. Order may shift – we communicate changes 30 days ahead via release note.

Release notes
live

What has changed in the last few weeks

Versioned changes, dated and split per component. Filter by the product area that touches your architecture.

Filter scope:
  • Custos

    v2026.5.1

    May 12, 2026

    WORM audit logs now generally available

    Immutable audit logs per BSI C5 are now live for all Stratus and Sovereign tier customers. 365 days standard retention, extensible to 10 years.

    • Object Lock export

      Logs exportable to S3-compatible object stores with hardware lock mode.

    • PII pseudonymisation

      Automatic PII masking in the audit trail before persistence – configurable per tenant.

    • Hash chain verification

      Tamper detection via SHA-256 hash chain with daily anchoring in an independent TSA.

  • Strato

    v2026.4.3

    April 28, 2026

    Predictive pre-scaling for e-commerce peaks

    The load-prediction ML model now detects campaign days 7 days in advance and keeps warm pools ready. Cuts tail-latency spikes by 73%.

    • Cron-aware pre-scale

      Calendar integration: marketing campaigns, shipping cut-offs and Black Friday as pre-scale triggers.

    • Cold start: 0 s

      Warm pool replicas keep container image and model weights in RAM. First request < 80 ms.

    • HPA race condition

      Fixed: rare race between HPA and VPA during simultaneous scale event (#2841).

  • Logica

    v2026.4.1

    April 14, 2026

    Hybrid reasoning · symbol engine beta

    First beta of the symbol reasoning engine. Rule-based inference alongside LLM inference, with a confidence score per answer.

    • Rule DSL

      Domain-specific language for deterministic business rules – validatable in the IDE plugin.

    • Confidence routing

      Answers below a configurable confidence threshold are automatically escalated to human reviewers.

    • −64% hallucinations

      On a domain-specific eval set (insurance, legal, medical) error rate dropped vs. pure LLM setup.

  • Intelligence

    v2026.3.2

    March 20, 2026

    Native retrieval with sovereign embeddings

    Embeddings are now computed with an EU-hosted model. No transmission to US providers, not even for indexing.

    • EU embedding service

      Own embedding inference on the Frankfurt cluster. Model updated yearly, version pinning possible.

    • Multi-lingual DE+EN

      Cross-lingual retrieval: a German query finds an English document without translation.

    • Vector store encryption

      Embeddings are encrypted per tenant before persistence – only authorised agents can decrypt.

  • Kinetic

    v2026.3.1

    March 7, 2026

    Escalation routing with skill matching

    Agents now escalate not "to the queue" but to the human whose skill profile matches the request.

    • Skill index

      Employee profiles with languages, expertise, shifts. Matching by embedding similarity, not tags.

    • −41% escalation bounce

      Fewer re-escalations because the first human handover already fits.

  • Strato

    v2026.2.4

    February 18, 2026

    EU AI Act compliant model cards

    Every agent going to production is now automatically issued a model card per EU AI Act – risk class, training data origin, confidence metrics.

    • Auto-generated model cards

      Model cards refreshed on every deploy. Version history in the audit trail.

    • Risk class wizard

      Interactive questionnaire walks through Annex III classification. Output: signed PDF for regulators.

Architecture whitepapers

The documents our architects share with yours

Deep technical and regulatory whitepapers – the same documents that sit on the table in architecture reviews with our pilot customers.

09 documents

All whitepapers are free and delivered without lead tracking. Confidential documents (full pentest, SOC 2) follow a two-step NDA process.

Benchmarks & comparative studies

Where we win – and where we do not. Honest.

Four reproducible comparison studies against hyperscaler offerings and in-house builds. You can rerun the workloads yourself – we ship the eval code.

Vergleich

Agenivo

Hyperscaler (Azure / AWS)
Build in-house
  • Time-to-first-production

    14 days
    6–10 weeks
    4–6 months
  • EU data residency guaranteed

    Contractual · 100%
    Configurable
    Self-guaranteed
  • Hallucination rate (regulated domain)

    2.1%
    5.8%
    6.4%
  • Audit trail per BSI C5

    Out-of-the-box
    Add-on · config
    Self-build required
  • Exit guarantee

    90 days · contractual
    Standard terms
    Own asset
  • Inference latency (P95)

    480 ms
    420 ms
    380 ms
  • 3-year TCO (mid-size tenant)

    −38%
    Baseline
    +22%
  • Innovation cadence

    2-week releases
    Weekly
    Sprint-bound

Methodology

All benchmarks: identical hardware classes, identical eval dataset, three independent runs. Cost incl. personnel effort for setup and operations (full cost, not just cloud bill). Sources versioned in Git, eval notebook bundled with each report.

Study 02/2026

Insurance · claim intake

Initial classification of 5,000 claim notifications, document attachments, escalation routing.

  • 94.2%Accuracy
  • 88.1%Build accuracy
  • −61%Cost/case

Hybrid reasoning (symbol + LLM) reaches higher accuracy in regulated domains than pure LLM, at lower inference cost thanks to the symbol pre-filter.

Study 11/2025

E-commerce · Black Friday scale

Spike from 120 to 14,800 req/min within 18 minutes, 24-hour sustain plateau.

  • 99.98%Successful requests
  • 99.84%Hyperscaler
  • −68%Idle cost

Predictive pre-scaling cuts tail-latency spikes vs. purely reactive auto-scaling. Idle cost drops because no safety capacity is left running.

Study 09/2025

Banking · compliance research

420 anti-money-laundering queries with source citations and confidence annotations.

  • 96.4%Faithfulness
  • 89.7%Hyperscaler
  • +22%Source precision

Sovereign retrieval with EU embeddings and confidence routing delivers higher faithfulness. Sources are cited, not hallucinated.

Study 06/2025

Industrial · service knowledge

12,000 service tickets, multi-lingual (de, en, it), technical diagrams as input.

  • +34%First-contact resolution
  • −74%Translation need
  • −42%3-year TCO

Cross-lingual retrieval unlocks knowledge across language borders without a translation pipeline. TCO drops because translation tooling can be retired.

AI glossary

The terms that come up in every architecture review

Over 120 terms around AI agents, RAG, compliance and cloud infrastructure – compact explanations with pointers to deeper sources.

34 / 34terms
02
  • Agent (AI agent)

    Autonomous system with skills

    AI system that understands requests, takes actions (tools, skills) and escalates to humans. At Agenivo: configurable per use case and channel-agnostic.

    Siehe auch
  • DPA

    Data processing agreement (Art. 28 GDPR)

    Regulates data processing by a service provider on behalf of the controller. Agenivo signs a full DPA per Art. 28 GDPR with every customer.

    Siehe auch
02
  • BSI C5

    Cloud security catalogue (Germany)

    Cloud Computing Compliance Controls Catalogue from the German BSI. Standard for secure cloud usage in Germany.

    Siehe auch
  • BYOK

    Bring Your Own Key

    Encryption model where the customer keeps the master keys in their own HSM. The provider cannot decrypt data.

    Siehe auch
03
  • CLOUD Act

    US law on data access

    Clarifying Lawful Overseas Use of Data Act. Allows US authorities access to data processed by US companies – even when servers sit outside the US.

  • Cold start

    Delayed first request

    Time a container or pod needs after start to answer the first request. Eliminated at Agenivo Strato via warm pool (< 80 ms).

    Siehe auch
  • Confidence score

    Confidence in an answer

    Numeric value (0–1) for a model's certainty about its answer. At Agenivo Logica: configurable threshold for human escalation.

03
  • Embedding

    Vector representation of text

    Numeric vector that captures the semantic meaning of a text. Basis for retrieval, similarity search and RAG.

    Siehe auch
  • EU AI Act

    EU regulation on AI

    Regulation (EU) 2024/1689. Risk-based regulatory framework for AI systems in the EU. High-risk systems (Annex III) face strict duties.

    Siehe auch
  • Escalation

    Handover to a human

    Mechanism by which an AI agent passes a conversation to a human – on uncertainty, regulatory obligation or customer request.

03
  • Failover

    Automatic standby switchover

    Switch to a standby region or instance on failure. At Agenivo Strato: multi-region failover in under 30 seconds.

    Siehe auch
  • Faithfulness

    Source fidelity of an answer

    Measures how strictly a generated answer sticks to retrieved sources. Low faithfulness = hallucination.

  • Fine-tuning

    Model adaptation

    Training a base model further on domain-specific data. Agenivo prefers RAG over fine-tuning – faster, cheaper, more reversible.

01
  • GDPR

    EU General Data Protection Regulation

    Regulation (EU) 2016/679. Governs protection of personal data in the EU. Agenivo meets it architecturally via privacy-by-default and data minimisation.

    Siehe auch
03
  • Hallucination

    Fabricated content

    LLM output that sounds plausible but is factually wrong. Reduced by RAG, hybrid reasoning and confidence routing.

    Siehe auch
  • HSM

    Hardware security module

    Hardware-based key module for cryptographic operations. Keys never leave the module. Prerequisite for BYOK.

  • Hybrid reasoning

    Symbol + LLM

    Combination of a rule-based symbol engine and LLM inference. At Agenivo Logica: reduces hallucinations in regulated domains by 64%.

02
  • Inference

    Generating a model answer

    A model's prediction on an input. For LLMs: token-by-token generation. Latency and cost are measured in tokens.

    Siehe auch
  • ISO 27001

    IT security management standard

    International standard for information security management systems (ISMS). Agenivo certified since 2024.

    Siehe auch
01
  • LLM

    Large language model

    Large language model. Statistical model that understands and generates language. Agenivo supports OpenAI, Anthropic, Mistral, Google, xAI and EU providers.

02
  • Multi-tenancy

    Tenant isolation on shared platform

    A single platform instance serves multiple tenants (customers, business units) with full data isolation. At Agenivo: row-level security and tenant-specific encryption.

  • Model card

    Model documentation

    Documentation of a model: training data origin, risk class, accuracy metrics, known limitations. Mandatory under EU AI Act for high-risk systems.

04
  • RAG

    Retrieval-augmented generation

    Technique where an LLM retrieves relevant documents from a knowledge base before answering. Cuts hallucinations, enables source citations.

    Siehe auch
  • Row-level security (RLS)

    Row-level access control

    Database mechanism that filters access to table rows per tenant. Prerequisite for safe multi-tenancy.

  • RPO

    Recovery point objective

    Maximum tolerated data loss in time. At Agenivo Strato: < 5 minutes, usually 0 seconds (synchronous replication).

  • RTO

    Recovery time objective

    Maximum tolerated recovery time after an outage. At Agenivo Strato: < 30 seconds multi-region failover.

02
  • Skill

    AI agent capability

    Defines what an agent can do: booking, lead capture, knowledge lookup, escalation. At Agenivo, configurable per agent.

    Siehe auch
  • Sovereign cloud

    EU-sovereign cloud offering

    Cloud offering hardened against CLOUD Act access and operating under EU law. Examples: T-Systems Sovereign Cloud, IONOS, OVHcloud.

03
  • TISAX

    Trusted Information Security Assessment

    Industry standard for information security in the automotive sector. Prerequisite for supplier relationships with OEMs.

    Siehe auch
  • Token

    Unit of LLM inference

    Sub-word unit in which LLMs process text and inference cost is billed. A German text ≈ 1.3 tokens per word.

  • TSA

    Time stamping authority

    Independent authority that issues cryptographic timestamps. At Agenivo Custos: daily anchoring of the audit hash chain in an independent TSA.

01
  • Vector store

    Database for embeddings

    Specialised database for high-dimensional vectors with similarity search. Basis for RAG. At Agenivo: pgvector with tenant-specific encryption.

02
  • Warm pool

    Pre-heated replicas

    Containers waiting in RAM with the model loaded. Eliminates cold starts. At Agenivo Strato: predictive pre-scaling fills the warm pool.

  • WORM

    Write once read many

    Storage mode where data cannot be modified after writing. Prerequisite for audit-grade logs.

    Siehe auch
Q&A · Datenschutz & Compliance01 → 12
  • Solely in the EU – primarily Frankfurt (Deutsche Telekom, BSI C5) and Amsterdam (Equinix). On request: Germany only or sovereign cloud at T-Systems / IONOS. No processing in the US, not even for indexing or training. Embeddings are computed by an EU-hosted model.

Stay in the loop

New whitepapers, releases and roadmap updates – without spam

Once a month, a curated summary: what shifted in the roadmap, which releases matter, which whitepapers are new. No marketing, no trackers, no third parties.

What you get

  • Monthly roadmap overview
  • Release notes, compressed
  • New whitepapers and studies
  • Early access to webinars