When disk usage on cluster nodes pushes past 95%, immediate action is required to prevent outages. GBase 8a, the China-domestically developed MPP cluster database from GBASE, offers two practical paths: leveraging its columnar-engine high-compression capability to rebuild tables, or scaling out with new hardware.
Here's a detailed breakdown of both approaches, including ready-to-run SQL and a comparison to help you choose the right strategy for your gbase database.
Solution 1: Table Rebuild with High Compression
This zero-hardware-cost approach takes advantage of GBase 8a's compression engine to reclaim significant disk space.
How It Works
- GBase 8a defaults to conservative compression. If
gbase_compress_levelis 0 (lightweight), there's substantial room for improvement. - Rebuilding a historical table with
COMPRESS('HighZ', 0)can reclaim approximately 50% of disk space.
Performance note: Higher compression levels consume more CPU during reads and writes. Run this operation during off-peak hours to minimize impact[reference:9].
Step-by-Step
Step 1: Create a high-compression backup table
Clone the original table structure with a high-compression clause.
CREATE TABLE "t1_bak" (
"a" int(11) DEFAULT NULL,
"b" varchar(10) DEFAULT NULL
) COMPRESS('HighZ', 0) ENGINE=EXPRESS DEFAULT CHARSET=utf8 TABLESPACE='sys_tablespace';
[reference:10]
Step 2: Migrate data
Inserting all rows triggers compression and reorganization under the hood.
INSERT INTO "t1_bak" SELECT * FROM "t1";
[reference:11]
Step 3: Atomic swap
After verifying data integrity, drop the original table and rename the backup for a seamless cutover.
DROP TABLE "t1";
RENAME TABLE "t1_bak" TO "t1";
[reference:12]
Step 4: Verify savings
Query information_schema.tables to compare data_length before and after, or run df -h to confirm the space reclaimed[reference:13].
Solution 2: Horizontal Server Expansion
For latency-sensitive production workloads, adding hardware is the more robust long-term solution.
- Resource planning: Submit expansion requests specifying CPU, memory, disk type, and capacity requirements[reference:14].
- Environment setup: Deploy GBase 8a on new servers, configure cluster parameters, and establish network connectivity with the existing cluster[reference:15].
- Data rebalancing: Use database backup/restore or
LOAD DATAto migrate a portion of data to new nodes, evening out storage utilization across the cluster[reference:16].
Comparison and Recommendations
| Approach | Advantages | Disadvantages | Best For |
|---|---|---|---|
| Table Rebuild with Compression | No extra hardware cost; fast results | Higher CPU overhead; multi-step process | Short-term relief; read-only or low-write historical data |
| Server Expansion | Definitive storage fix; performance-friendly | Requires hardware budget and deployment time | Long-term growth; latency-sensitive, high-concurrency workloads |
Which One Should You Choose?
The decision boils down to your workload profile and timeline. If historical data is queried infrequently and you need immediate relief, start with the table rebuild approach—it costs nothing and can be completed within a maintenance window. If your business is performance-sensitive and data volume is growing steadily, go straight for server expansion. In practice, many teams combine both: compression for cold archives, expansion for hot production data.
GBASE's GBase 8a MPP Cluster gives DBAs flexible tools to manage storage at scale. By understanding both the software-level compression levers and the hardware scale-out path, you can keep your China-domestically developed, independently controlled database platform running smoothly even as data volumes climb.
Looking ahead, as teams adopt tiered storage strategies within gbase database environments, these two techniques form the foundation of a mature capacity planning playbook.
Top comments (0)