Muhammad H.M. Alvi

Posted on Jun 14 • Originally published at insights.aethonautomation.com

Soon, Your Country Will Rent Its Own Mind

#sovereignai #nationalllm #aipolicy #aiinfrastructure

There will be three tiers of nations: those who build AI, those who rent it, and those who beg for it. Every state will want its own large language model — and most will not get one. National LLMs are moving from prestige projects to instruments of statecraft — used not just to chat, but to draft policy, model economic scenarios, triage public services, run intelligence and propaganda analysis, and estimate probabilities for decisions. Once a model sits inside the decision loop of a state, the question "whose model is it, running on whose chips, powered by whose grid, paid for in whose currency?" becomes a sovereignty question — and that is where a new tiering of nations emerges.

Why governments will run their own models

The pull is not vanity; it's function. A government-controlled LLM offers four things a foreign API cannot:

Data control — citizen records, tax, health, security, and legal corpora cannot be sent to a US or Chinese endpoint without surrendering both privacy and leverage. Bangladesh's own draft AI Policy already encodes "data localization for sensitive data" and "train abroad, infer locally" for exactly this reason.
Decision support and probability estimation — forecasting revenue, modeling flood/cyclone risk, war-gaming, epidemiology, election and unrest prediction, fraud scoring. A state that can run these in-house, on its own language and its own data, gains an analytic edge; one that rents it abroad exposes its priors to the landlord.
Narrative and linguistic sovereignty — a model fluent in Bangla, trained on local history and values, versus a model whose defaults were set in California or Beijing. Whoever trains the model sets its refusals, its framing, and its silences.
Continuity — an API can be rate-limited, price-hiked, sanctioned, or switched off. A sovereign model on sovereign hardware cannot be revoked by a foreign vendor or government.

So the demand for national LLMs is near-universal. The ability to supply one domestically is not — and that gap is the whole story.

The dependency cascade: a three-tier world

The report's numbers imply a stratification that will harden over the next 5–10 years:

Tier 1 — Compute sovereigns (build frontier models): US and China, with a short tail (EU collectively, possibly India, the Gulf states buying their way in). They own the foundation models, the leading-edge fabs' output, and the GPU supply. They export intelligence as a service.
Tier 2 — Adaptation sovereigns (fine-tune and host, but cannot train frontier): This is Bangladesh's realistic ceiling and that of most of the Global South — take an open-weights model (Llama, Qwen, Mistral, DeepSeek), continue-pretrain it on local data, host inference domestically. Genuine partial sovereignty, but structurally downstream: every base model, architecture, and capability frontier is inherited from Tier 1, and the open-weights spigot can be narrowed at any time (more restrictive licenses, capability gating, or simply the best models going closed).
Tier 3 — Dependents (consume foreign APIs): Countries that can afford neither GPUs nor power nor the dollars, running their governance on metered foreign endpoints. Their citizens' queries, their states' analytic questions, and their data flow through infrastructure they neither own nor can audit. This is cognitive dependency — a deeper kind than the cloud dependency of the 2010s, because it touches the reasoning layer of the state itself.

The cruelty of the cascade is that it compounds: Tier 1 models improve fastest (more compute, more data, more talent, more revenue to reinvest), so the capability gap widens even as Tier 2/3 work hard to keep up. Catch-up requires running uphill on a treadmill someone else controls the speed of.

The new monopoly: why it's LLM × Energy × Dollar, not just chips

The chokepoint is a stack, not a single resource. To field frontier intelligence you need all three of these at once, and each is concentrated:

Compute (the GPUs): a near-monopoly — NVIDIA designs, TSMC fabricates, ASML supplies the lithography, and US export controls govern who may buy. Four companies and one government effectively gate the world's AI accelerators.
Energy (the power): training and serving frontier models is an industrial-scale electricity problem. A single frontier training run and its serving cluster can demand hundreds of MW of firm, uninterrupted power. As the report shows, Bangladesh load-sheds 2,000+ MW; it cannot spare a dedicated, rock-stable 100+ MW for a GPU campus. Whoever has cheap, abundant, reliable power (US gas/nuclear, Gulf solar+gas, China's overbuilt grid) can host compute; whoever doesn't, can't — regardless of how many chips they're allowed to buy.
Dollars (the capital): GPUs are priced in USD, depreciate fast, and require billions in sustained capex. Foundation-model training runs cost $78–125M+ each and recur. A country with five months' import cover cannot redirect a meaningful slice of reserves to a depreciating, dollar-denominated compute fleet without IMF-relevant consequences. The reserve currency is the AI currency.

A nation must clear all three gates simultaneously. Most of the world fails on at least two. That intersection — chips you're allowed to buy, power you can actually deliver, and dollars you can actually spend — is the genuinely scarce thing, and the entities (states and a handful of hyperscalers) sitting at that intersection form the compute–energy–dollar oligopoly that will rent intelligence to everyone else. Sovereign-wealth-backed Gulf states are essentially trying to buy their way into Tier 1 by converting oil dollars and cheap energy directly into GPU fleets — a strategy available to almost no one else, and a preview of how the gate gets priced.

Counterforces (why the monopoly may be softer than it looks)

Determinism is the wrong conclusion. Several forces cut against a permanent lock-in, and a smart Tier-2 state plays for them:

Open weights keep leaking the frontier downhill. DeepSeek, Llama, Qwen, and Mistral have repeatedly put near-frontier capability into anyone's hands months after release. As long as a competitive open model exists, Tier 2 is viable. The risk is that this is a policy choice by Tier 1 actors, not a law of nature — it can stop.
Efficiency is collapsing the cost of "good enough." Distillation, quantization, MoE, and small-model gains mean the compute needed to serve a useful national model keeps falling. The 70B-on-two-cards reality in the report would have been a data-center job three years ago. The serving frontier is democratizing even as the training frontier concentrates.
Regional pooling. A single small state can't justify a frontier cluster; a bloc can. Shared sovereign compute (an ASEAN, SAARC-successor, OIC, or BRICS pool) is the plausible escape hatch from Tier 3 — and the cheapest one.
The gap that matters narrows. For most government tasks — citizen services, document drafting, local-language Q&A, routine analysis — a well-fine-tuned open model is already sufficient. Frontier supremacy matters for a narrow band of hardest problems. A country doesn't need GPT-5-class power to run its bureaucracy in Bangla; it needs reliable, sovereign, good-enough intelligence — which Tier 2 can deliver.

What this means for Bangladesh specifically

The honest strategic read: Bangladesh will be a Tier-2 adaptation sovereign for the foreseeable future, and that is a defensible, achievable position — not a failure. The danger is not failing to reach Tier 1 (essentially no one outside the US/China will). The danger is sliding to Tier 3 — building nothing, and metering the state's reasoning through foreign APIs by default.

The hybrid roadmap in the report (train abroad, own the weights, host and fine-tune at home, pool regionally for anything bigger) is precisely the play that secures Tier 2 and hedges against dependency. The three gates — chips, power, dollars — are also Bangladesh's three-item national to-do list: secure a licensing pathway for accelerators, build firm dedicated power for a compute campus, and ring-fence a stable capital line so AI capex doesn't compete with the import bill. Clear those, and "sovereign enough" is reachable. Miss them, and the cognitive-dependency tier is the default outcome.

Can Bangladesh Host Its Own National LLM? A Compute & Infrastructure Reality Check

TL;DR

Bangladesh today can comfortably fine-tune and serve (host inference of) a national Bangla LLM built on an open model like Llama or Mistral, but it cannot train a frontier model from scratch domestically — its entire public-sector AI compute is roughly "over 20" NVIDIA Volta GPUs (~2,240 teraFLOPS) at the National Data Center, orders of magnitude short of the thousands of H100-class GPUs that frontier training requires.
The binding constraints are not data-center shells but GPUs, reliable power, and dollars: the country has a certified Tier-IV national data center (plus a larger 4,800-rack/28.8 MW design at Kaliakair) and a growing private DC market, but almost no high-end AI accelerators, a grid still load-shedding 2,000+ MW, and US export-control friction plus a recovering-but-tight forex position that make bulk H100/B200 procurement slow and expensive.
The realistic path is hybrid: train/continue-pretrain on foreign cloud GPUs (as every existing Bangla model — BongLLaMA, TituLLMs, TigerLLM — has done), then host fine-tuning and inference domestically on a modest sovereign GPU cluster. This is exactly what the draft National AI Policy 2026–2030 implicitly concedes by allowing models to be "trained abroad" but "inference tested locally."

Key Findings

1. Data-center shells exist; AI-grade compute inside them does not. Bangladesh operates a Uptime-Institute-certified Tier-IV National Data Center at Bangabandhu Hi-Tech City, Kaliakair (built by China's ZTE with EXIM Bank of China financing; government share ~Tk317.55 crore), with 604 racks at 4–10 kW each, 2 PB storage (expandable to 200 PB), and 744 physical servers. A separate, larger IV-Tier facility is designed for 4,800 racks / 28.8 MW. But these are engineered around 4–10 kW racks for general government workloads — not the 40–130 kW high-density, liquid-cooled racks that modern GPU clusters (DGX H100 ≈ 10.2 kW/server; GB200 NVL72 ≈ 120–140 kW/rack) require.

2. The entire public-sector GPU fleet is tiny and made of reactivated, previous-generation cards. On January 14, 2026, the National Data Center / BCC launched the first public-sector shared "GPU Cloud," inaugurated by Chief Adviser's Special Assistant Faiz Ahmad Taiyeb, integrating "over 20 NVIDIA Volta architecture GPUs" (~2,240 teraFLOPS total) — and these were dormant GPUs reactivated under the BDSAT project, not a new purchase. Volta (V100-class) is roughly two GPU generations behind H100. This is adequate for student/research workloads and small fine-tunes, but it is not an LLM-training cluster.

3. Private and telecom operators are moving faster than the state. The private DC market is ~23.55 MW of IT load in 2025, forecast to grow ~45% CAGR to ~150 MW by 2030. BDx (regional operator) secured NVIDIA DGX-Ready certification in April 2025; Meghna Cloud switched on phase 1 of a dedicated cloud DC backed by a USD 500 million pledge; and Summit Group — which already controls ~7% of national power generation and a fiber network serving ~half of internet demand — announced plans to enter the DC market, pitching its under-utilized gas plants as 24/7 baseload for AI.

4. Every existing Bangla LLM was trained abroad/on cloud, cheaply, using open models. TituLLMs (1B/3B, by Hishab, derived from Llama-3.2) consumed 1,750 H100-GPU-hours in total over ~37B tokens; BongLLaMA fine-tuned 7B/8B models in ~40 A100-GPU-hours/epoch using LoRA on a single 80 GB A100; BengaliLlama fine-tuned a 7B in ~4 days on one A100. None required a domestic supercomputer. TigerLLM, BanglaBERT (BUET), and BanglaByT5 round out a research-grade — not production — ecosystem.

5. Government LLM ambition is real but embryonic. The draft National AI Policy 2026–2030 names a sovereign Bangla LLM and a "National AI Compute Strategy" with "phased GPU cluster upgrades" as cornerstones, but contains no funded procurement, dollar figure, or unit count. BCC's Research & Innovation Center has issued an open call for a "Large Bangla Generative Model" (no specs); the EBLICT project's "Brain Lab" promises LLM-training GPU access; and a Bangla AI platform "Kagoj.ai" launched with ~4,000 trial users and a stated plan to develop a Bengali LLM. The funding vehicle — the World Bank EDGE Project (originally $295M) — has been cut by $175M to ~$120M and rated "moderately unsatisfactory," ending ~2026.

Details

Data-center infrastructure

National Data Center (BDCCL/BCC), Kaliakair: Tier-IV (Uptime Institute certified), marketed as "world's 7th largest," 200,000 sq ft, 604 racks (42U, 4 kW & 10 kW), 152 racks for cloud, 2 PB storage (expandable to 200 PB), 744 physical servers, up to 40 Gbps connectivity, 99.995% uptime / 2N+1. Built by ZTE (China) with Chinese EXIM Bank financing. A Jashore backup site provides disaster recovery.
A larger IV-Tier facility referenced at 4,800 racks / 28.8 MW IT power across two buildings (first building targeted Q3 2024).
Private market: ~13 facilities / 24 operators in Dhaka. Tier III dominates (~56% share in 2024). Operators include Summit, Felicity IDC (Tier III, 500 racks, up to 7 kW/rack), aamra, Dhaka Colo, XeonBD, Coloasia, Gotipath. PUE generally <1.8 (national DC), with operators targeting <1.5.
AI-readiness: BDx DGX-Ready (April 2025); Meghna Cloud ($500M pledge, phase 1 March 2025). Industry analysis notes new builds being designed for "100-kW racks and liquid-cooling loops" for GPU inference clusters — i.e., AI-grade density is only now arriving.

GPU availability and AI compute

Public sector: "over 20" NVIDIA Volta GPUs (~2,240 TFLOPS), reactivated under BDSAT, on Nutanix + CNCF-certified Kubernetes PaaS; access by email request to BCC. Cited use cases: ML dataset training, threat-intel, geoscience modelling.
No A100/H100/B200 clusters of any scale are publicly documented in Bangladesh. DGX systems (DGX Spark workstation ~৳640,000; DGX H100/A100) are sold by local resellers (PCB Store, Potaka IT, Star Tech) for individual labs — not national-scale clusters.
Universities (BUET, BRAC, etc.) have modest GPU servers used for NLP research; standard practice is renting cloud A100/H100 or using Colab/Kaggle/TPU Research Cloud credits.

Export controls and procurement constraints

US BIS rules: H100/H200/A100/B200 are controlled. The Jan 2025 "AI Diffusion Rule" placed most countries (including South Asia) in Tier 2 (≈50,000 H100-equivalent country cap 2025–27; ≤1,700 H100-equiv per company/year license-free), then was rescinded in mid-2025 by the Trump administration, with a replacement rule pending and enforcement tightened mainly against China. Net effect for Bangladesh: advanced GPUs are legally obtainable (Bangladesh is not Tier 3 / embargoed), but subject to licensing friction, the pending replacement framework, and high cost — and VEU/licensing pathways favor large, vetted buyers.
Forex: Reserves fell from ~$48B (Aug 2021) to below $20B (2024) during the dollar crisis (LC-opening difficulties, ~30% taka depreciation to ~Tk122/USD), then recovered to >$33B gross / ~$28.5B on IMF BPM6 by end-Dec 2025 on record remittances (>$30B in FY25). Imports are easier than in 2023–24, but ~5 months' cover is "comfortable, not comfortable enough," and large dollar capex on depreciating GPUs remains a hard sell.

Power and reliability

Installed capacity ~29,000 MW, but usable output often only ~12,000–14,000 MW due to fuel shortages, unpaid IPP/Adani bills, and plant outages — producing load-shedding of 2,000–2,500+ MW in April 2026, with rural outages of 8–12 hours. Record generation hit ~17,200 MW (2025), yet still with load-shedding.
Only ~2% of electricity was renewable in 2024 (target 25% by 2035). Grid instability and high industrial tariffs (~9–12 BDT/kWh) push DC operators toward captive generation and UPS — Summit's pitch is precisely to co-locate DCs at its gas plants.
Cooling/climate: Dhaka's 25–35°C ambient means ~1.4–1.6× power overhead for cooling; high-density GPU racks need liquid cooling that most existing BD facilities lack.

What a national LLM actually requires (the core technical comparison)

Training a frontier model from scratch: thousands of GPUs for weeks–months. Llama-3.1-405B used 24,576 H100s (~$125M); GPT-class runs cost $78–100M+ (Stanford AI Index 2025) — for scale context, GPT-4-class training has been characterized as on the order of ~25,000 GPUs. Not feasible domestically and not necessary.
Continued pre-training / building a small sovereign model (1–8B): TituLLMs-scale ≈ 1,750 H100-hours total — achievable in days on a rented 64–256 GPU cloud cluster for low-to-mid five figures of dollars; doable on the BCC Volta cloud only very slowly.
Fine-tuning an open model (LoRA/QLoRA): a 7–8B model fine-tunes on a single 24–48 GB GPU; a 70B with QLoRA on ~2× A100 or even a single 48 GB card in ~12–24 hrs, often <$50–$5,000. Fully within reach today on local or rented hardware.
Hosting/serving inference: an 8B model serves on one consumer GPU (4-bit ≈ 6 GB); a 70B (4-bit ≈ 40 GB) on 2× 24 GB cards. A national chatbot serving millions needs a cluster of dozens–low-hundreds of inference GPUs — large but financeable, and the natural sovereign use case for a domestic GPU cloud.

Policy and partnerships

Strategy stack: Smart Bangladesh 2041 / ICT Master Plan 2041 (Universal Digital ID, Government Cloud); National Strategy for AI; draft National AI Policy 2026–2030 (risk-based EU-style tiers, data-localization for sensitive data, "train abroad, infer locally," National AI Compute Strategy). A UNESCO/UNDP/ICT-Division AI Readiness (RAM) report released Dec 2025 flagged "GPU scarcity," fragmented data systems, and largely absent Bangla-language AI infrastructure among 15 priority gaps. (On readiness rankings, treat single figures cautiously: Bangladesh sits at 113th (score 0.38) on the IMF's 174-economy AI Preparedness Index of June 2024 — behind India (72nd) and Sri Lanka (92nd); a separately cited "75th on the Oxford Insights Government AI Readiness Index" could not be confirmed against Oxford's 2024/2025 editions and should be verified before use.)
Partnerships: China (ZTE) built the national DC; Oracle Sovereign/G-Cloud was deployed in 2024; the World Bank funds the EDGE Project and the GPU-cloud initiative. Regional context: IndiaAI has empanelled 18,693 GPUs (incl. 12,896 H100s and 1,480 H200s), since reported to have crossed 34,000–38,000 units — a scale Bangladesh has no equivalent to. India ties have cooled since Aug 2024.

Recommendations

Stage 0 (now, 0–6 months) — Serve and fine-tune; don't build from scratch.

Stand up a national Bangla LLM by continued-pretraining + instruction-tuning an open model (Llama 3.x 8B/70B, Mistral, Qwen, or Gemma) on rented foreign cloud H100/H200 (a few hundred to a few thousand GPU-hours; low-five-figures of dollars). Host the resulting weights and inference domestically. Benchmark against TituLLM/TigerLLM on BLUB.
Use the BCC Volta GPU cloud + university clusters for experimentation, data curation, and small LoRA runs.

Stage 1 (6–18 months) — Build a modest sovereign inference + fine-tuning cluster.

Procure on the order of 1–4 DGX H100/H200 nodes (8–32 GPUs) or equivalent, sited at the Kaliakair Tier-IV DC or a DGX-Ready private facility (BDx/Meghna/Summit) with dedicated power + liquid cooling. This is enough to fine-tune up to 70B models and serve national inference, and is financeable even under forex constraints.
Pursue a VEU/licensing pathway with the US early (and/or evaluate compliant alternatives), given the pending replacement export rule.

Stage 2 (18–36 months) — Conditional scale-up.

Only invest in a 100+ GPU training cluster if (a) the grid can reliably deliver an additional dedicated 5–15 MW with <1% interruption, (b) forex reserves hold above ~5 months' import cover, and (c) a funded line item (not draft-policy language) and an export license are secured. Co-locate at a captive-power site (the Summit model).

Benchmarks that would change the plan:

Greenlight domestic training if Bangladesh secures a >1,000-H100-equivalent allocation (own or via a hyperscaler local zone) and dedicated firm power.
Stay cloud-first as long as load-shedding exceeds ~1,000 MW or a frontier-scale procurement would consume a meaningful share of reserves.

Caveats

Draft vs. enacted: Much of the government ambition (National AI Compute Strategy, sovereign LLM, GPU procurement) lives in draft policy with no funded budget or tender; treat "shall" language as aspiration, not capability.
Numbers in flux: Power, forex, and US export-control figures move monthly (the replacement export rule is pending; reserves are recovering). The load-shedding and reserve figures here are early-to-mid-2026 snapshots.
Source quality: Some DC superlatives ("world's 7th largest") are government/marketing claims; the EDGE Project's troubled rating and the reactivated-GPU detail come from named local reporting and should be weighted accordingly. The AI-readiness ranking figure is unresolved (see Policy section).
"National LLM" is ambiguous: hosting/serving and fine-tuning are clearly feasible today; training a competitive frontier model domestically is not, and pursuing it now would likely be a poor use of scarce capital versus a hybrid cloud-train / host-locally approach.

Originally published on Aethon Insights

DEV Community