Building LLMs for Bharat: What 6 Months of Rural AI Deployment Taught Us

#ai #machinelearning #india #llm

Most coverage of "AI for India" treats the subject the way Silicon Valley treats emerging markets — translate the product, localize the UI, and you're done. Six months of production deployment of 7S Samiti, our AI tutor for rural Indian students, has taught us that this framing is almost completely wrong.

This is a piece about what we actually learned building an LLM-powered education product for students in low-connectivity, low-literacy-adjacent environments in rural India. It's not a case study about an AI that worked. It's a case study about the specific, surprising ways it broke, and what we did about it.

The Problem We Were Solving

7S Samiti is a mission-driven edtech platform. The goal: deliver personalized, adaptive learning to rural Indian students at a price point that doesn't exclude them. The stack:

Mobile app (Flutter, offline-first) deployed to entry-level Android phones.
AI tutor that generates quizzes, assignments, and study notes on demand, in the student's preferred language.
Local caching + selective sync for areas with 2G-only connectivity.
Parent/teacher dashboard for progress tracking.

We deployed to pilot schools in three states (Uttar Pradesh, Maharashtra, Rajasthan). 2,400 students in the initial rollout.

Failure #1: The Tokenization Problem

Every multilingual LLM paper talks about "parameter efficiency across languages." What they don't talk about: Hindi and Marathi have 2-4x worse tokenization efficiency than English in most off-the-shelf models.

A 200-word Hindi paragraph eats ~600 tokens where the English equivalent eats ~150. Latency is higher. Cost is higher. Quality of generation is often lower because you're burning context budget on writing the same content.

Three fixes that worked:

1. Route by language. English queries to GPT-4. Hindi/Marathi queries through a dedicated pathway with tokenizer-aware prompts.

2. Translate-generate-translate for complex content. For long-form study notes, we generate in English and translate to Hindi/Marathi as a post-process. Three model calls instead of one. Surprisingly produces higher-quality Hindi than direct generation, because the model's reasoning in English is stronger.

3. Pre-generate common content. 80% of what students request is predictable. We batch-generate overnight, cache it, and serve from cache for 90% of requests.

Result: latency dropped from ~4s to ~800ms for cached content. API cost dropped by 70%.

Failure #2: Voice vs. Text

Our initial app was text-first. Deployment showed us: rural Indian students use voice input 8x more often than text.

Many students are the first generation in their family learning to read and write fluently in their regional language. Typing in Devanagari or Marathi script on a tiny phone keyboard is slow and intimidating. Speaking is natural.

What we did:

Voice input became the default UX. App usage doubled within two weeks.
Audio output for all AI responses. Students listen 3x longer than they read.
Multimodal interaction for math. Students photo a math problem, we OCR + solve + explain by voice. Drove 40% of daily active usage in month one.

Failure #3: Connectivity Reality

We designed for "low bandwidth." Reality: students use the app during a 15-minute window when they have signal, then go offline for hours.

The naive implementation — real-time cloud LLM calls — doesn't work. Students tap "solve," wait 8 seconds, then lose signal mid-request.

What we shipped:

Queue-and-sync model. Students ask questions offline. App queues, syncs when signal arrives, pushes responses back.
On-device inference for basic queries. Distilled quantized model (~4B params) runs locally for ~30% of common requests. Zero connectivity required.
Selective sync with priority. Prioritize unanswered questions > content updates > analytics when the 15-minute window arrives.

Impact: session completion rate went from 34% to 88%.

Failure #4: What the AI Didn't Know About Students

First prompts were generic. Output was technically correct but culturally irrelevant. We'd explain algebra using apples and oranges to a student who'd never seen an orange. Chemistry with lab metaphors to students who'd never seen a Bunsen burner.

Fixed at the prompt layer:

Regional context injection. Every prompt includes the student's state, language, and region-appropriate analogies.
Textbook alignment. State boards use different textbooks. We pre-ingested Maharashtra Board, CBSE, UP Board syllabi.
Humility prompts. "If outside the standard textbook for the grade, say so and offer the closest related question." Reduced confidently-wrong answers by 80%.

Deployment Data

After 6 months in production:

2,400 students across 3 states.
~38K AI interactions per day at peak (mid-exam season).
~₹0.02 per interaction (after optimization).
Session completion rate: 88% (up from 34% at launch).
Voice input share: 76% of total queries.
Cached content hit rate: 89% during exam-prep weeks.

Monthly AI-serving cost per student: ~₹285. Target price ₹99/month. We lose money today but the trajectory works once we hit 10K+ students.

Five Lessons

Vernacular LLMs are a tokenization problem before they're a model problem. Fix tokens + prompts before picking a model.
Voice-first changes everything. If building for rural India, voice IS the interface.
Design for bursty connectivity. 15-minute-window sessions are the real use case.
Cultural context in prompts is not optional. Analogies matter more than raw model quality.
On-device + cloud hybrid is the only viable architecture. Neither alone works.

Where This Goes

Building AI for Bharat is not a translation problem. It's a systems problem involving tokenization, connectivity, UX modality, and cultural context — all of which need to be solved in concert.

At Xenotix Labs we've shipped 7S Samiti + Growara (WhatsApp AI automation) + Alcedo (AI-powered education discovery) — three different LLM-powered products for Indian users. Each taught us something that contradicted what the AI literature said would happen. If you're building AI solutions for startups in the Indian context, these failure modes are probably in your future. Getting ahead of them saves months.

Ujjawal Tyagi is the founder of Xenotix Labs, a product engineering studio that's shipped 30+ production apps including 7S Samiti (AI tutor for rural India), Growara (AI WhatsApp automation), and Cricket Winner (real-time cricket trading).