DEV Community

James Lee
James Lee

Posted on

Environment Classification & Architecture: How to Design Your Offline Environment Strategy

One of the most overlooked aspects of DevOps is environment design. Teams often end up with either too many environments (expensive, hard to maintain) or too few (everyone steps on each other).

This guide walks through a practical three-tier offline environment strategy, and how it maps to a complete environment chain from local development to production.


1. The Full Environment Chain

Local (developer machine)
     │
     ▼
Dev/Debug Environment      ← developer daily use
     │  (depends on)
     ▼
Integration Test (SIT)     ← central node; closest to production
     │
     ▼
UAT Environment            ← business acceptance
     │
     ▼
Pre-Production             ← final gate; production mirror
     │
     ▼
Production
Enter fullscreen mode Exit fullscreen mode

Each environment serves a distinct purpose. The key design principle:

Build each environment to the minimum required for its purpose — no more, no less.


2. The Three Offline Environment Tiers

Tier 1: Integration Test Environment (SIT)

Primary users: QA / Test Engineers

Purpose: The central hub of all offline environments. This is where full functional, regression, and business flow validation happens before any release goes to UAT or production.

Design Principles:
├── Maximum parity with production
│     ├── Same application versions
│     ├── Same middleware configuration
│     └── Same infrastructure topology (smaller scale)
├── Stability is non-negotiable
│     ├── Strict release standards — no ad-hoc deployments
│     ├── Change freeze during active test cycles
│     └── All changes go through formal release process
└── Acts as dependency provider
      ├── Dev/Debug environment depends on SIT for shared services
      └── Project environments depend on SIT for non-project services
Enter fullscreen mode Exit fullscreen mode

What runs here:

Component Requirement
All business applications ✅ Full set, same version as production
Databases ✅ Independent instance, production-like schema
Message queues ✅ Independent instance
Cache (Redis) ✅ Independent instance
Third-party service mocks ✅ Stubbed where real services unavailable
Cluster scale Reduced (e.g. 1–2 replicas vs production's N)

Why stability matters so much:

Unstable SIT environment:
├── Test results become unreliable
├── Testers can't distinguish: app bug or env issue?
├── Regression cycles get longer
└── Release confidence drops → more production incidents
Enter fullscreen mode Exit fullscreen mode

Tier 2: Dev/Debug Environment

Primary users: Developers

Purpose: Daily development, local integration, and feature verification. Built on the minimum viable principle — only deploy what's being actively changed.

Design Principles:
├── Minimum footprint
│     ├── Only deploy services under active development
│     └── No redundant replicas
├── Resources are NOT reclaimed between iterations
│     └── Persistent environment; no lifecycle management needed
└── Dependencies resolved via SIT
      └── If a service isn't deployed here, route to SIT
Enter fullscreen mode Exit fullscreen mode

Dependency routing strategy:

Developer's service (deployed in Dev env)
     │
     ├── Calls Service A (also in Dev env) ──▶ Dev env
     │
     └── Calls Service B (NOT in Dev env)  ──▶ SIT env
Enter fullscreen mode Exit fullscreen mode

This avoids the cost of duplicating the entire service mesh in every dev environment, while still giving developers a realistic integration context.

What runs here:

Component Requirement
Services under development ✅ Latest dev branch
Shared infrastructure (DB, MQ, Cache) Shared with or pointed at SIT
Full application stack ❌ Not required

Tier 3: Project Environment

Primary users: Developers + QA (collaborative, multi-team)

Purpose: Supports large, multi-team project initiatives where multiple services change simultaneously and need to be validated together in isolation from the main SIT environment.

Design Principles:
├── Minimum footprint (same as Dev env)
│     └── Only deploy services with project-scope changes
├── Has a defined lifecycle
│     ├── Created when project kicks off
│     └── Resources reclaimed when project ships
├── Multiple project envs can coexist
│     └── One per active project (e.g. project-a, project-b)
└── Dependencies resolved via SIT
      └── Non-project services route to SIT
Enter fullscreen mode Exit fullscreen mode

Lifecycle management:

Project kickoff
     │
     ▼
Environment provisioned (namespace / VM / resources allocated)
     │
     ▼
Development + integration + QA pre-validation
     │
     ▼
Project ships to SIT → UAT → Production
     │
     ▼
Environment decommissioned → resources reclaimed
Enter fullscreen mode Exit fullscreen mode

Without lifecycle enforcement, project environments accumulate indefinitely — wasting compute, creating confusion about which env is "current," and making infra costs unpredictable.

What runs here:

Component Requirement
Services changed by this project ✅ Project branch
Services NOT changed by this project ❌ Route to SIT instead
Independent DB / MQ / Cache Optional — depends on project scope

3. Dependency Routing Architecture

The key insight behind this three-tier design is selective dependency routing:

┌─────────────────────────────────────────────────────────────┐
│  Project Environment                                        │
│  ┌──────────┐  ┌──────────┐                                │
│  │Service A │  │Service B │  (project-scoped changes)      │
│  └────┬─────┘  └────┬─────┘                                │
│       │              │                                      │
│       │ calls C      │ calls D (not in project env)        │
└───────┼──────────────┼──────────────────────────────────────┘
        │              │
        ▼              ▼
┌─────────────────────────────────────────────────────────────┐
│  Integration Test (SIT) Environment                         │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐  │
│  │Service C │  │Service D │  │Service E │  │Service F │  │
│  └──────────┘  └──────────┘  └──────────┘  └──────────┘  │
└─────────────────────────────────────────────────────────────┘
Enter fullscreen mode Exit fullscreen mode

This pattern means:

  • Project and Dev environments stay lightweight
  • SIT remains the single source of truth for shared services
  • No need to replicate the entire service mesh per team

4. Environment Comparison

Dimension SIT Dev/Debug Project
Primary users QA Developers Dev + QA
Build scope Full stack Changed services only Project services only
Stability requirement High Low Medium
Resource lifecycle Persistent Persistent Project-scoped
Change control Strict Flexible Moderate
Dependency source Self-contained SIT fallback SIT fallback
Parallel instances 1 1 per team 1 per project

5. Resource Sizing Guidelines

Production:
├── Full cluster (N replicas per service)
├── Multi-AZ / multi-region
└── Full observability stack

Pre-Production:
├── Production mirror (same topology, smaller data)
└── Used for final performance / load testing

SIT:
├── 1–2 replicas per service
├── Single-AZ
└── Full service coverage, reduced scale

Dev/Debug:
├── 1 replica per changed service
├── Shared infra (DB/MQ/Cache) pointed at SIT
└── No HA required

Project:
├── 1 replica per project-scoped service
├── Shared infra pointed at SIT
└── Auto-decommission after project ships
Enter fullscreen mode Exit fullscreen mode

6. Kubernetes Namespace Mapping

In a Kubernetes-based infrastructure, each environment tier maps naturally to namespaces:

Cluster
├── namespace: production
├── namespace: pre-production
├── namespace: sit
├── namespace: dev-team-a          ← Dev/Debug env per team
├── namespace: dev-team-b
├── namespace: project-checkout    ← Project env (lifecycle-managed)
├── namespace: project-payments
└── namespace: project-notifications
Enter fullscreen mode Exit fullscreen mode

Namespace-level controls:

# Resource quota per project namespace
apiVersion: v1
kind: ResourceQuota
metadata:
  name: project-quota
  namespace: project-checkout
spec:
  hard:
    requests.cpu: "4"
    requests.memory: 8Gi
    limits.cpu: "8"
    limits.memory: 16Gi
    pods: "20"
Enter fullscreen mode Exit fullscreen mode

Automatic namespace cleanup (project lifecycle):

# CI/CD pipeline: decommission project env after merge to SIT
kubectl delete namespace project-checkout
# or use TTL-based controllers (e.g. Janitor / kube-janitor)
Enter fullscreen mode Exit fullscreen mode

Cross-namespace service routing (SIT fallback):

# ExternalName service: route to SIT when service not deployed locally
apiVersion: v1
kind: Service
metadata:
  name: payment-service
  namespace: project-checkout
spec:
  type: ExternalName
  externalName: payment-service.sit.svc.cluster.local
Enter fullscreen mode Exit fullscreen mode

This means project-checkout's pods call payment-service normally — but the request is transparently routed to the SIT namespace if no local deployment exists.

Top comments (0)