Site icon Digital Thought Disruption

VCF 9.0 GA Mental Model Part 5: Topology Patterns for Single Site, Two Sites, and Multi-Region

TL;DR

If you want architects, operators, and leadership aligned, you need a topology mental model that starts with VCF objects and only then maps to your physical sites.

Architecture Diagram

Table of Contents

Scenario

You are about to deploy VCF 9.0 GA greenfield and you need a shared language for:

Scope and version alignment

This post assumes:

Version compatibility matrix

Use this as your “what are we talking about” anchor in architecture reviews and CAB meetings.

ComponentVersionBuild
SDDC Manager9.0.0.024703748
vCenter9.0.0.024755230
ESX9.0.0.024755229
NSX9.0.0.024733065
VCF Operations9.0.0.024695812
VCF Automation9.0.0.024701403
VCF Identity Broker9.0.0.024695128
VCF Installer9.0.1.024962180

Core concepts: mapping physical topology to fleets and instances

The physical words that matter

You will hear these terms used casually. Align them to your constraints:

The VCF objects you should use in every design discussion

Practical rule:

Decision criteria you should agree on up front

These are design-time decisions that are expensive to reverse later.

Design-time decision criteria

Day-2 reality check questions

Challenge: choose your deployment posture

You need a deployment posture that matches your physical topology without creating a governance model you cannot operate.

Solution A: Single site

This is your default starting posture unless you have a clear availability driver.

What it looks like

Design intent

Operational implications

Solution B: Two sites in one region

This is the “site resilience” posture. It usually assumes stretched constructs and tighter network constraints.

What it looks like

Design intent

Hard constraints you must respect

Operational implications

Solution C: Multi-region

This is the “DR-oriented operating model” posture. You typically deploy multiple instances, each aligned to a region.

What it looks like

Design intent

Operational implications

Architecture tradeoff matrix

Use this in design boards to stop circular debates.

AttributeSingle siteTwo sites in one regionMulti-region
Primary goalSimplicitySite resilienceRegion separation and DR posture
Typical instance count112+
Network complexityLowHighMedium to high
Latency sensitivityModerateHighMedium
Fleet service dependencyLocalLocal but stretchedCross-region dependency
Operational overheadLowHighHigh
Cost driversHost count, storageStretched fabric, witness, failover capacityDuplicate capacity, replication, bandwidth

Cost model snapshot

This is not pricing. It is what actually moves your bill of materials.

One private cloud vs multiple fleets

Treat “private cloud” as your organizational wrapper. VCF objects start at fleet.

When one fleet is enough

Choose one fleet when:

When you should operate multiple fleets

Choose multiple fleets when you need:

Practical framing:

Identity and SSO boundary patterns

Identity is not a “later” decision. It is a boundary decision.

Challenge: unify access or isolate tenants

You need a model that matches your org and compliance posture.

Solution A: Fleet-wide SSO

Use this when:

Operational reality:

Solution B: Cross-instance SSO

Use this when:

Solution C: Single instance SSO boundaries

Use this when:

Embedded vs appliance identity broker

Treat this as a scaling and availability decision:

Failure domain analysis

This is where topology turns into real operational outcomes.

Failure domains you should model

FailureWhat breaksWhat keeps runningTypical owner response
Fleet services unavailableCentral observability, centralized automation, fleet management workflows, and optionally SSO experienceExisting workloads and instance-level management planes continue to runPlatform team restores fleet services, validates integrations
Instance management domain impairedDomain lifecycle actions, some instance operationsWorkloads may still run, but you lose safe lifecycle control and may lose some vCenter or NSX management functions depending on the failureVI admin + platform team coordinate recovery
Workload domain impairedWorkloads in that domainOther domains and instances continueVI admin + app teams execute workload recovery runbooks

Practical RTO and RPO examples you can use as starting targets

These are real-world starting points, not vendor commitments:

If you cannot state these targets, you should at least agree on the priority order:

Day-0, day-1, day-2 map by topology

Day-0: decisions you should lock

These apply to all topologies:

Day-1: bring-up sequence that fits the object model

A typical greenfield flow looks like:

Day-2: operations you should operationalize early

Who owns what

Use this to prevent “everyone owns it, so no one owns it.”

CapabilityPlatform teamVI adminApp and platform teams
Fleet services lifecycleOwnConsultInformed
VCF Operations configuration and alertsOwnConsultInformed
VCF Automation provider setupOwnConsultInformed
Identity broker and SSO modelOwnConsultInformed
Instance bring-up and healthOwnOwnInformed
SDDC Manager operationsConsultOwnInformed
vCenter and NSX in management domainConsultOwnInformed
Workload domain creation and lifecycleConsultOwnInformed
Workload provisioning via automationOwn the platformConsultOwn consumption
Application deployment and runtimeInformedConsultOwn

Operational runbook snapshot

Keep this as a living page in your ops wiki.

Weekly

Monthly

Quarterly

Validation checklist

Use UI and workflow validation before you declare success:

Troubleshooting workflow

When something breaks, troubleshoot by boundary.

Anti-patterns

Avoid these early and you remove a lot of future toil.

Best practices

Summary and takeaways

Conclusion

VCF 9.0 GA becomes easier to design and operate when you treat fleet, instance, and domain as explicit boundaries and then map them to site and region realities. Pick the simplest topology that meets your availability and isolation goals, and invest early in day-2 practices for identity, backups, lifecycle, and change control.

Sources

VMware Cloud Foundation 9.0 and later Documentation: https://techdocs.broadcom.com/us/en/vmware-cis/vcf/vcf-9-0-and-later/9-0.html

Exit mobile version