AI inference that never leaves your institution.

Xinity runs open, OpenAI-compatible language models entirely on your own infrastructure. No data sent to cloud providers, no third-party access, no regulatory grey zone. Inference under your sole control.

Start 30-day trial See the platform

The situation

Cloud APIs are not a neutral tool for banks.

Every call to an external model API leaves your system boundary. For a regulated institution that means ICT third-party dependency, a potential data-residency problem, and an attack surface outside your control. Sovereign AI inverts the logic: the model comes to your data, not the other way around.

Compliance

How on-premise supports concrete obligations.

DORA

Digital Operational Resilience Act

DORA (Regulation (EU) 2022/2554) has applied since 17 January 2025 and requires management of ICT third-party risk, a documented exit strategy, and limits on concentration risk. Inference that runs entirely inside your own infrastructure eliminates the critical dependency on an external cloud provider and makes an exit strategy trivial: there is no provider to disentangle from.

EU AI Act

High-risk obligations from 2 August 2026

Annex III point 5(b) of the AI Act (Regulation (EU) 2024/1689) classifies systems that evaluate the creditworthiness of natural persons as high-risk. From 2 August 2026 these carry obligations on logging, data governance, human oversight, and post-market monitoring. Running inference in-house keeps you in full control of the logs and processing those obligations depend on, with no data passing through a third party.

MiFID II

Record-keeping and retention

Article 16 MiFID II and its technical standards require traceable recording and tamper-resistant retention of relevant communication and decisions. When model processing happens in-house, you keep full control over logging, retention periods, and auditability, which makes meeting these obligations easier, with no data passing through a third party.

FINMA

Outsourcing and data residency (CH)

FINMA circular 2018/3 sets clear requirements for outsourcing and for keeping outsourced functions controllable. On-premise operation keeps data and processing inside your system boundary and your jurisdiction, so the question of data residency and third-party access never arises.

The architecture is the proof

Data and model stay inside your infrastructure.

Xinity runs on your hardware, on your network, behind your firewall. There is no outbound data path to a model provider that you would have to secure or cover by contract. Apache 2.0 open source also means full auditability: your security team can read every line.

Let us talk about a pilot inside your institution.

Book a demo