errorbudget

Posted on Jun 21 • Originally published at errorbudget.io

vSAN for Mixed Workloads: Policy Design and the OSA-to-ESA Transition

#infrastructure #devops #storage #vmware

💡 What is vSAN OSA & ESA?
VMware vSAN Original Storage Architecture (OSA) is the legacy disk-group-based architecture designed for mixed magnetic/SSD environments. The newer Express Storage Architecture (ESA) is optimized entirely for high-performance NVMe drives, eliminating disk groups in favor of a single Storage Pool.

Enterprise architecture often requires running highly divergent workloads on shared physical infrastructure. Balancing the deterministic latency needed for transactional processing with the massive sequential throughput demanded by AI training profiles represents a classic infrastructure tightrope.

When the underlying platform relies on VMware vSAN, achieving this balance comes down to granular Storage Policy-Based Management (SPBM).

📊 The Sizing Reality: Workload Profiles Compared

Designing storage policies without analyzing telemetry data under peak load is a recipe for silent performance degradation. Transactional banking cores require minimal write amplification and predictable sub-millisecond read times. Conversely, AI clusters executing large-batch training loops care about raw block delivery and streaming data pipes.

Below is the operational baseline measured across configurations during peak cluster strain:

Workload Type	IOPS Range (Peak)	Latency Profile	Focus Metric
Core Transactional	25K - 40K (80K peak)	0.8ms - 1.2ms	P99 Read Latency
AI Workload Training	80K - 120K random	3 - 5 GB/s seq	Throughput Density
AI Model Inference	Steady demand stream	0.4ms - 0.7ms	Deterministic P50

💡 Calculate vSAN ESA Capacity Instantly: > Storage overhead can quickly break infrastructure budgets if left unmodeled. Instead of wrestling with complex sizing spreadsheets for your new NVMe pools, you can instantly model your RAID-5 overhead and FTT constraints using this live, browser-based vSAN ESA Capacity Calculator. It runs completely offline and requires zero signup.

⚙️ The Three-Policy Architecture Matrix

To survive rigorous third-party infrastructure audits while maintaining high availability, relying on a default "Cluster-Wide" policy is insufficient. The architecture must partition workloads logically via explicit SPBM profiles.

1. The Transactional Core Policy

Architecture: vSAN OSA (Legacy stability path)
Structure: RAID-1 (Mirroring), Failure to Tolerate (FTT) = 2
Rationale: For system-of-record databases, write amplification must be kept to an absolute minimum. Dual-mirroring guarantees availability even during concurrent dual-drive failures within a legacy disk group. Space efficiency is sacrificed for predictable sub-millisecond P99 latency.

2. The AI Training Pipeline Policy

Architecture: vSAN ESA (Next-gen NVMe pool)
Structure: RAID-5 (Erasure Coding), FTT = 1
Rationale: Legacy vSAN OSA suffered a severe write penalty when executing RAID-5. Thanks to the ESA Log-Structured File Service, RAID-5 erasure coding runs at near-mirroring speeds. This allows massive datasets to sit on cost-efficient 4+1 structures without bottlenecking the GPU ingestion path.

3. The AI Inference Service Policy

Architecture: vSAN ESA
Structure: RAID-1 (Mirroring), FTT = 1
Rationale: Real-time risk modeling and fraud scoring require deterministic response lines. RAID-1 on ESA eliminates parity calculation overhead entirely, utilizing pure NVMe speed to feed inference endpoints under tight SLAs.

🔒 Encryption Scope: Cluster vs. Policy Level

Data protection requirements in regulated fields often mandate full encryption of sensitive datasets at rest. However, implementation details differ wildly between the two architectures:

vSAN OSA Encryption: Operates strictly at the cluster level. If you turn on encryption, every single host and drive in that cluster is encrypted using the same external Key Management Server (KMS). This introduces uniform CPU overhead across all workloads.
vSAN ESA Encryption: Introduces policy-driven encryption. Data can be encrypted selectively by attaching the control to specific SPBM rules. Non-sensitive training files can skip the crypto-wrapper, saving precious CPU cycles, while production transactional pipelines remain fully wrapped and compliant.

🎯 Architectural Lessons from the Field

Avoid Spot-Fixing Thresholds: Setting generic thresholds on mixed clusters leads to alert fatigue. Always context-map your metrics based on the active storage profile.
Account for Replication Lag: During high-volume transactional spikes, background data replication can saturate backplane links. Ensure your network topology explicitly isolates vSAN traffic from telemetry and client data lines.
Validate Your Blast Radius: Erasure coding saves space, but a single host failure in a tight 4-node ESA cluster triggers high-utilization rebuild loops. Design clusters with a minimum $N+1$ node overhead allowance.

❓ Frequently Asked Questions

Is vSAN ESA worth the hardware investment for smaller clusters?

Yes, provided the hardware stack utilizes certified all-NVMe drives. ESA eliminates the concept of dedicated cache drives, turning every installed drive into a contributing member of both the capacity and performance pools.

How does RAID-5 performance on ESA compare to legacy OSA?

Legacy OSA required a multi-step write-modify-write cycle for parity, introducing heavy latency. ESA writes data sequentially into a log append log before structuring it into parity, offering RAID-1 speeds at RAID-5 capacity savings.

Does policy-level encryption satisfy standard regulatory frameworks?

Yes. Because the encryption occurs before data hits the physical block layer and keys are managed via industry-standard external KMS protocols, it satisfies strict data isolation and segmentation validation criteria.

DEV Community