Enterprise AI is no longer a sandbox—it's a battlefield where the stakes are measured in latency, compliance, and return on investment.

A recent survey of more than 2,050 senior executives in major enterprises revealed that 95 % plan to build their own AI and data platforms within the next 780 working days. Yet only 13 % have actually achieved that milestone, and the ones that have report almost five times the ROI of firms still struggling to operationalize AI.

The gap isn’t in the quality of the models; it’s in the infrastructure strategy. Successful organizations adopt a sovereign‑by‑design approach, spreading workloads across multiple clouds and on‑premises environments instead of locking into a single hyperscaler. They tailor AI to their own business, regulatory, and operational needs rather than forcing those needs to fit a vendor’s architecture.

Training and inference are distinct beasts. Training is a discrete, compute‑dense event that prioritizes GPU availability. Inference, by contrast, is an ongoing business process that can involve millions of calls per day—for fraud checks, claim reviews, customer interactions, medical recommendations, sanctions checks, or predictive maintenance. Inference workloads demand low latency, governance, reliability, and cost control, and they must run where the business data lives and where compliance rules can be enforced.

In heavily regulated sectors—financial services, healthcare, telecommunications, energy, and the public sector—data sovereignty, audit obligations, and security mandates dictate the exact location of inference workloads. The challenge, therefore, is larger than AI itself: it’s about creating an operating model that unites compute, data, and governance without sacrificing flexibility.

Neoclouds have emerged as a critical bridge. Unlike traditional hyperscalers, neoclouds are purpose‑built around AI infrastructure, focusing almost entirely on renting high‑end GPUs for AI work and on optimizing GPU access, AI performance, and flexible consumption models. For many enterprises, neoclouds provide access to the latest accelerator technologies while allowing workloads to scale without the complexity of large cloud environments.

However, neoclouds address only one part of the equation. AI models need context—customer records, transaction histories, operational workflows, policy documents, supply‑chain information, and enterprise knowledge. Moving these assets into separate AI environments creates duplication, latency, and governance challenges. The emerging consensus is that the future of AI architecture depends on bringing models closer to data rather than moving data closer to models.

PostgreSQL has become the natural foundation for this approach. It already serves as the operational backbone for many critical applications, combining transactional reliability, extensibility, and scalability with an open‑source model that enterprises increasingly demand. More than 70 % of AI‑related application development occurs on Postgres.

Postgres can act as a governed memory layer for AI systems, integrating operational data, application context, permissions, observability, and retrieval capabilities into a single architecture. This consolidation reduces the need for separate infrastructures for transactional systems, vector stores, AI memory layers, and governance frameworks.

Enterprises that need to keep data and intellectual property within designated environments—banks protecting financial data, hospitals safeguarding patient records, governments ensuring national data remains under jurisdictional control—are turning to sovereign AI architectures that operate across clouds, private infrastructure, and on‑premises environments.

Enter EDB Postgres AI. The platform connects operational Postgres, AI capabilities, and hybrid infrastructure management into a unified solution. It enables organizations to deploy AI where their data already resides, providing inference close to operational data, reducing data movement, improving performance, and strengthening compliance posture. According to Nancy Hensley, CPO of EDB, “By enabling inference close to operational data, organizations reduce data movement, improve performance, and strengthen their compliance posture.”

The new enterprise AI stack, as described by industry analysts, consists of five layers: 1. Neoclouds – specialized AI compute and GPU infrastructure. 2. Public hyperscalers – broad cloud services and global reach. 3. Postgres – operational data foundation. 4. EDB Postgres AI – sovereign AI and hybrid management layer. 5. Enterprise governance – security, compliance, and policy controls.

Together, these layers support the full AI lifecycle—from experimentation and model training through production inference and continuous optimization.

CIOs are increasingly prioritising multi‑cloud and hybrid strategies, sovereign architectures, and open operational foundations. The focus is on operationalising AI while maintaining control over cost, governance, and risk. The next era of enterprise AI will reward those who can keep intelligence close to where data already lives, governed, trusted, and ready to act.

At present, the industry is still in the process of adopting these models. No major public announcements of new product launches or regulatory actions have been reported, but the trend toward sovereign AI and neocloud‑based inference is clear. Companies that have already integrated Postgres and EDB Postgres AI report measurable improvements in inference latency and compliance audit readiness, while others are evaluating the cost and complexity of moving to a sovereign‑by‑design architecture.

The industry will continue to monitor how these emerging stacks perform at scale, how they interact with existing hyperscaler services, and how regulatory bodies respond to the growing need for data sovereignty in AI deployment.