DEV Community

Cover image for Profiling a Large-Scale 4X Strategy Game: FPS, Memory, GPU and Long-Session Performance Analysis
GameOptim
GameOptim

Posted on

Profiling a Large-Scale 4X Strategy Game: FPS, Memory, GPU and Long-Session Performance Analysis

In the world of 4X strategy games, performance optimization is rarely about a single bottleneck.

Massive battles, persistent world simulations, complex UI systems, large maps, and long play sessions create a combination of CPU, memory, rendering, networking, and power-management challenges that continue to accumulate over time. Unlike many action titles, performance issues in 4X games often emerge gradually as session length, game state complexity, and resource usage increase.

To explore how these challenges manifest in practice, we analyzed Total Battle, a long-running mobile strategy title that has remained active in a highly competitive SLG market for nearly a decade. This article demonstrates how GameOptim Gears can be used to profile a complex 4X game, identify key bottlenecks, and uncover potential optimization opportunities.

Test Environment

To better represent real-world player conditions, testing was performed on a Realme 8, a mid-range Android device.
All data was collected using GameOptim Gears, which captures performance metrics without modifying game code or affecting normal gameplay behavior.

The analysis covers:

  • Frame rate and frame pacing
  • Memory consumption
  • GPU workload
  • Network activity
  • Power consumption

The goal is not simply to monitor performance, but to connect symptoms with potential root causes and optimization priorities.

1. Frame Rate Performance

FPS remains the most visible performance metric from a player's perspective.

During testing, the game generally maintained a target of around 60 FPS, although frame-time spikes were observed throughout the session.


A closer examination showed that most spikes were associated with UI transitions and interface interactions, with frame times occasionally reaching approximately 100 ms.

While these spikes are relatively short and may be acceptable during menu navigation, similar stalls occurring during combat or intensive gameplay scenarios could negatively affect responsiveness and player decision-making.


Key Observation

  • Average FPS remained stable.
  • Frame pacing was less consistent than frame rate.
  • UI-related operations appeared to be the primary source of frame-time spikes.

2. Memory Usage

Memory management plays a critical role in long-session stability, loading performance, and background app recovery, especially on lower-end devices.

At the beginning of testing, total PSS memory increased rapidly to approximately 1.77 GB before dropping significantly after a major scene transition.

This behavior suggests that resource cleanup mechanisms are functioning correctly and that large portions of memory can be reclaimed when changing game states.

However, detailed memory metrics reveal additional optimization opportunities.

Graphics Memory
The GL mtrack curve closely followed overall PSS growth, indicating that graphical assets such as:

  • Textures
  • Meshes
  • Rendering resources are major contributors to memory consumption.

This highlights the importance of asset loading and unloading strategies.

Native Memory
The Native Heap increased rapidly after launch, reaching roughly 900 MB and remaining at that level throughout the test.
Since little memory reclamation was observed, deeper investigation using tools such as Perfetto may be worthwhile to identify long-lived native allocations.

Key Observation
Resource cleanup appears functional.
Graphics assets account for a significant portion of memory growth.
Native memory persistence deserves further investigation.

3. GPU Performance

GPU utilization directly impacts frame stability, thermal behavior, and battery life.

Overall GPU performance remained healthy throughout testing.

The game's main city scene showed higher GPU clock activity than other gameplay areas, but still remained within a reasonable range.

Bandwidth Peaks
Several scenes exhibited temporary GPU bandwidth spikes.
One possible explanation is the use of volumetric visual effects such as clouds, fog, or atmospheric rendering.

Potential optimization directions include:

  • LOD adjustments
  • Effect quality scaling
  • Dynamic graphics settings
  • Additional Rendering Analysis

For more complex scenes, GPU Primitive metrics can provide additional visibility into geometry processing and rendering pipeline efficiency.

GameOptim Gears also supports deeper GPU analysis through metrics such as:

  • GPU Shaded
  • GPU Shader Cycles
  • GPU Shader Instructions allowing developers to investigate rendering bottlenecks beyond basic utilization percentages.

Key Observation

  • GPU workload remained generally balanced.
  • Certain effects may create avoidable bandwidth pressure.
  • More detailed shader-level analysis can reveal additional opportunities.

4. Beyond FPS, Memory, and GPU

Many performance issues originate outside traditional rendering metrics.

Network Activity
Network throughput peaked during initial resource downloads and login operations.

Although expected, these spikes can contribute to temporary increases in device power consumption and heat generation, particularly on lower-end devices.

Power Consumption
Average device power usage remained relatively stable throughout testing.

Battery drain curves showed no unusual behavior, suggesting that the title maintains reasonable energy efficiency during extended play sessions.

Key Observation

  • Startup downloads generate the highest network activity.
  • Power efficiency remained consistent during testing.
  • Long-session thermal behavior appears well controlled.

Why Profiling Matters

One of the most difficult aspects of optimizing 4X games is that performance problems are often interconnected.

A memory issue may trigger additional loading stalls.
A rendering bottleneck may increase power consumption.
A UI system may introduce frame-time spikes that players perceive as lag despite stable FPS.

The value of a profiling workflow is not just collecting metrics, but understanding how these systems interact.

In this case study, GameOptim Gears helped transform raw telemetry into actionable insights by:

  • Capturing performance data without code modification
  • Monitoring FPS, memory, GPU, network, power, and thermal behavior in one workflow
  • Visualizing trends and anomalies for faster investigation
  • Supporting optimization decisions based on real-device data

Notes

All results were collected from a publicly available version of Total Battle under controlled test conditions. Data reflects the behavior of a single device and test scenario and should be considered a reference point rather than a definitive benchmark for all hardware configurations.

Top comments (0)