confidential computing

How One Fintech's Engineering Team Rebuilt Their Entire Data Pipeline Around Confidential Computing Enclaves After a Third-Party AI Vendor Breach Shattered Every Trust Assumption They'd Ever Made

Scott Miller

Mar 5, 2026 • 10 min read

In early 2026, a mid-size payments fintech called Arcveil Financial (a composite case study based on real architectural patterns observed across the industry, with identifying details abstracted) received a notification that no engineering team ever wants to see: their primary third-party AI inference vendor had suffered a breach. Customer transaction embeddings, behavioral scoring vectors, and partial PII used for fraud detection had been exposed inside a shared, multi-tenant inference environment. The breach was not a failure of Arcveil's own systems. It was a failure of a trust assumption so deeply embedded in their vendor contracts that most of the engineering team hadn't even known it existed.

This is the story of what they found, what they tore down, and how they rebuilt. It is also a warning for every fintech engineering team still running AI workloads on borrowed trust.

The Breach: What Actually Happened

Arcveil had been routing live transaction data through a third-party AI inference API since late 2024. The vendor, a well-funded AI platform company, offered low-latency fraud scoring models on a shared infrastructure model. Arcveil's data was encrypted in transit and at rest. The vendor's SOC 2 Type II report was clean. The contract included a data processing addendum that explicitly prohibited cross-tenant data access.

What the contract did not address, and what no one on Arcveil's legal or engineering team had thought to ask, was what happened to data while it was being processed, inside the model's inference runtime, in plaintext, in memory, on hardware shared with dozens of other tenants.

The attacker exploited a hypervisor-level vulnerability in the vendor's cloud infrastructure. They were able to read in-memory inference buffers across tenant boundaries. The data exposed was not raw transaction records but inference inputs: tokenized sequences, numerical feature vectors, and in some cases reconstructed partial account identifiers that had been embedded as model context.

The regulatory fallout was swift. Arcveil received inquiries from two state financial regulators within three weeks. Their CISO spent the better part of a quarter in documentation reviews. But the deeper crisis was architectural. Once the engineering team understood the mechanics of what had happened, they realized the problem was not one bad vendor. It was a systemic blind spot baked into how they had designed every AI-dependent workflow.

The Hidden Trust Assumption in Every Multi-Tenant Inference Contract

To understand what Arcveil's team discovered, you need to understand a concept that most software engineers encounter only at the edges of their security training: the trusted computing base, or TCB.

When you send data to a cloud-hosted AI inference endpoint, you are implicitly trusting a stack that includes the model runtime, the container orchestration layer, the hypervisor, the physical host hardware, and the cloud provider's management plane. Your vendor's contract governs their intent to protect your data. It does not govern the technical reality of what that data is exposed to while it is being processed.

This is the hidden trust assumption: encryption protects data at rest and in transit, but not data in use. The moment your data enters an inference pipeline, it is decrypted into memory so the model can operate on it. In a multi-tenant environment, that memory lives on hardware that is, by design, shared and managed by a party other than you.

Arcveil's engineering lead, reviewing their vendor contracts after the breach, found that every single inference agreement they had signed contained some version of the same clause: the vendor would implement "industry-standard security controls" for data protection. Not one contract specified any technical mechanism for protecting data in use. Not one mentioned trusted execution environments, memory encryption, or attestation. The trust was entirely contractual, not cryptographic.

The Architecture Audit: Mapping Every Trust Boundary

Before rebuilding anything, Arcveil's team spent six weeks conducting what their principal engineer called a "trust boundary autopsy." The goal was to map every point in their data pipeline where sensitive data was decrypted and processed by a system they did not fully control.

The findings were sobering. Their audit identified five distinct categories of exposure:

Third-party inference APIs: Transaction feature vectors sent to external fraud scoring models, processed in shared multi-tenant runtimes.
Embedded analytics SDKs: Behavioral data collected client-side and processed server-side by vendor infrastructure before being returned as aggregated signals.
Managed data warehouse integrations: ETL pipelines that temporarily decrypted records inside vendor-managed transformation environments for schema normalization.
LLM-assisted compliance tooling: A newer workflow where compliance officers used a third-party LLM interface to summarize transaction narratives. The prompts contained account context.
Model fine-tuning pipelines: A batch process that periodically sent anonymized transaction samples to a vendor for model retraining. The anonymization was weaker than assumed.

Each of these workflows had passed security review. Each had vendor contracts with appropriate data protection language. None of them had any technical guarantee that the data would remain confidential during processing. The trust was entirely based on vendor assurances and contractual obligations, both of which are only as good as the vendor's own security posture.

The Decision: Rebuild Around Confidential Computing

The engineering team evaluated three broad remediation paths. The first was to simply move all AI workloads in-house, eliminating third-party processing entirely. The second was to implement aggressive data minimization and anonymization before any data left their perimeter. The third was to adopt confidential computing as a foundational architectural principle.

They chose the third path, with elements of the second woven in. Here is why.

Bringing all AI workloads in-house was economically and operationally impractical. The models they relied on for fraud detection were sophisticated, continuously updated, and would have required a dedicated ML platform team to replicate. Data minimization and anonymization helped reduce exposure but could not eliminate it; sufficiently detailed behavioral features can be re-identified, and the fraud detection use case specifically requires rich, contextual data to be effective.

Confidential computing, by contrast, offered something neither of the other approaches could: a cryptographic guarantee that data remains protected even while being processed, enforced at the hardware level, independent of the vendor's operational security posture.

What Confidential Computing Actually Means in Practice

For engineers who have not worked with confidential computing before, a brief grounding is useful here.

Confidential computing refers to the use of Trusted Execution Environments (TEEs), hardware-isolated regions of a processor where code and data are encrypted and protected from access by the host operating system, hypervisor, or any other process, including the cloud provider's own management software. The two dominant TEE implementations in production use as of 2026 are Intel TDX (Trust Domain Extensions) and AMD SEV-SNP (Secure Encrypted Virtualization with Nested Page Tables), both of which operate at the virtual machine level and are available on all three major cloud platforms.

The key properties TEEs provide are:

Memory encryption: Data in the TEE's memory is encrypted with keys that are generated and managed by the processor itself, inaccessible to the host or hypervisor.
Attestation: Before sending data into a TEE, the client can cryptographically verify that the correct, unmodified code is running inside the enclave, and that the hardware is genuine and properly configured. This is the property that makes TEEs fundamentally different from contractual trust.
Isolation: The TEE is isolated from all other processes on the same physical host, including other tenants in a multi-tenant environment.

In practical terms, this means that if Arcveil sends transaction data into a TEE-hosted inference endpoint, they can verify before sending the data that the correct model code is running, that the hardware is a genuine Intel TDX or AMD SEV-SNP instance, and that even the vendor operating the infrastructure cannot read the data while it is being processed. The trust is no longer contractual. It is cryptographic.

The Rebuild: A New Pipeline Architecture in Four Layers

Arcveil's engineering team spent approximately five months rebuilding their AI-dependent data pipelines around a four-layer confidential computing architecture. Here is how each layer works.

Layer 1: Confidential Ingestion Gateway

All sensitive data entering the AI pipeline now passes through a confidential ingestion gateway running inside a TEE on their own cloud infrastructure. This gateway handles tokenization, feature extraction, and data normalization. The key design decision here was to ensure that raw sensitive data never leaves a TEE-protected environment once it enters the pipeline. Downstream systems receive only encrypted, attested payloads.

Layer 2: Attested Inference Endpoints

For third-party AI inference, Arcveil now exclusively uses vendors who support confidential inference, meaning inference endpoints that run inside TEEs and support remote attestation. Before any batch or real-time inference job runs, their pipeline automatically performs an attestation handshake: it verifies the enclave's identity, confirms the model version hash matches what was agreed upon, and only then releases the encrypted inference payload. If attestation fails for any reason, the job halts and alerts are triggered.

This was the most operationally complex part of the rebuild. As of early 2026, the ecosystem of vendors offering attested inference is growing but not yet universal. Arcveil had to replace two of their five AI vendors entirely, as those vendors had no roadmap for TEE-based inference. The replacements came at a modest cost increase, which the team justified to leadership by quantifying the regulatory and reputational cost of the breach they had just experienced.

Layer 3: Confidential ETL and Transformation

The managed ETL workflows that had previously run in vendor-controlled transformation environments were rebuilt using confidential VMs on their primary cloud provider. Schema normalization, data enrichment, and aggregation jobs now run inside AMD SEV-SNP protected virtual machines. The team implemented a policy that any transformation job touching data classified above a defined sensitivity threshold must run in a confidential VM, enforced through their infrastructure-as-code pipeline with automated compliance checks at deployment time.

Layer 4: Sealed Audit Logs

Every data access event, attestation result, and inference job is logged to a tamper-evident, sealed audit log that is itself stored inside a TEE-protected storage layer. This was not strictly required for confidential computing to function, but the team added it after their regulatory review highlighted that they could not prove, after the breach, exactly what data had been accessed and when. The sealed audit log solves this retroactively: every event is cryptographically signed by the enclave that produced it, making the log verifiable and tamper-evident.

The Vendor Contract Rewrite

In parallel with the technical rebuild, Arcveil's legal and engineering teams collaborated on a new vendor contract framework they now call their Cryptographic Trust Addendum. This document, which they require all AI and data processing vendors to sign, specifies technical requirements rather than intent-based language.

Key clauses include explicit requirements for TEE-based processing with named hardware specifications, mandatory support for remote attestation with defined protocols, model version pinning with cryptographic hashes, and a right-to-audit clause that allows Arcveil to perform attestation checks at any time without prior notice. The addendum also prohibits vendors from running inference workloads on hardware that does not support memory encryption, regardless of other security controls in place.

Their general counsel noted that this addendum initially met resistance from two vendors who argued the technical requirements were "overly prescriptive." Both vendors were replaced. The lesson, as Arcveil's CISO put it: "If a vendor can't meet a technical requirement, that's a data point about their security posture, not a negotiation."

Results: Six Months After the Rebuild

Six months after completing the new architecture, Arcveil's engineering team shared internal metrics that illustrated both the costs and the benefits of the transition.

Inference latency: Average inference latency increased by approximately 18 milliseconds per request due to attestation overhead. This was within acceptable bounds for their fraud detection use case, where the existing latency budget had comfortable headroom.
Infrastructure cost: Confidential VM instances run at a roughly 15 to 20 percent cost premium over standard instances on their cloud provider. This was partially offset by consolidating from five AI vendors to three, reducing per-request API costs.
Vendor footprint: They reduced their third-party AI vendor count from five to three, eliminating vendors who could not meet their new technical standards.
Regulatory posture: Both state regulators who had opened inquiries following the breach closed their reviews after Arcveil demonstrated the new architecture. One regulator specifically cited the attestation-based trust model as a "meaningful technical control" in their closure letter.
Security incident rate: No data-in-use exposure incidents since the rebuild. The team acknowledges this is partly a function of time, but the architectural surface area for such incidents has been fundamentally reduced.

What Every Fintech Engineering Team Should Take Away

Arcveil's story is not unique. The specific breach mechanism may differ from what your team will encounter, but the underlying vulnerability is universal: every multi-tenant AI inference contract you have signed contains an implicit trust assumption that your vendor's operational security is sufficient to protect your data in memory. That assumption is not cryptographically guaranteed. It is not technically enforced. It is a contractual hope.

The confidential computing ecosystem has matured significantly by 2026. Intel TDX and AMD SEV-SNP are production-ready and available on all major cloud platforms. The Confidential Computing Consortium, now with broad industry membership, has published reference architectures specifically for financial services AI workloads. The tooling for attestation, while still requiring engineering investment, is no longer experimental.

The question for your engineering team is not whether confidential computing is technically feasible. It is whether you are willing to wait for a breach to make the business case for you.

Where to Start: A Practical Checklist

If you are an engineering lead or architect at a fintech reading this and wondering where to begin, here is a prioritized starting point based on Arcveil's experience:

Audit your trust boundaries first. Map every point in your pipeline where sensitive data is decrypted and processed by a system you do not fully control. This alone will surface risks you did not know existed.
Review your vendor contracts for data-in-use language. If your contracts only address encryption at rest and in transit, you have a gap. Most contracts do.
Identify your highest-risk AI workloads. Not every workload needs TEE protection immediately. Prioritize workloads that process raw PII, behavioral data, or financial features in third-party environments.
Evaluate your vendors' TEE roadmaps. Ask your current AI vendors directly whether they support or plan to support attested inference. Their answer will tell you a great deal about their security maturity.
Run a proof of concept on one workload. Pick a single inference pipeline, rebuild it with attestation, and measure the latency and cost impact. The numbers are almost always more manageable than teams expect.

Conclusion: Trust Is Not a Contract Clause

The most important thing Arcveil's engineering team learned from their breach was not a technical lesson. It was a philosophical one. They had confused legal accountability with technical security. Their vendors were accountable, in a legal sense, for protecting their data. But accountability after a breach is not the same as prevention before one.

Confidential computing does not eliminate trust entirely. You still have to trust the hardware manufacturer's implementation of the TEE, and you still have to trust the attestation infrastructure. But it moves the trust to a layer that is cryptographically verifiable and hardware-enforced, rather than a layer that is contractually promised and operationally maintained by a party whose security posture you cannot directly observe.

In a world where AI inference is embedded in every critical financial workflow, and where the breach surface of any given fintech now extends across five, ten, or twenty third-party AI vendors, that distinction is not academic. It is the difference between a trust model that holds under pressure and one that collapses the moment a hypervisor vulnerability lands on the wrong host.

Arcveil rebuilt. The question is whether your team will do it proactively, or reactively.