The Three-Layer Architecture Running a Real Gym: Momo → KinTwin → Global Ops
我们建的不是一个产品,是一个AI操作系统。
— Suzanne Mok / 莫淑瑜, Founder @ ZWISERFIT
The single biggest data blind spot in the fitness industry isn't who walks through the door. It's what happens after they walk in.
Wearables can't fill it. SaaS CRMs can't fill it. Only space-level AI can.
We're not building a product. We're building an AI operating system — three parallel layers, one single body, running live in production at a real fitness studio in Wanjian, China.
The founder waited 7 years to open this store. Not because she couldn't — but because the AI wasn't ready. Wanjian opened in April 2026, and it's the deliberate starting point: no location advantage, no privileged resources. If the stack works here, it works anywhere.
Let me take you inside the architecture.
The Data Problem Nobody Talks About
Every gym knows when a member scans their card. Every gym knows their churn rate is climbing.
But here's what nobody knows:
- Did they walk 3 minutes on the treadmill, or run 30?
- Did they move from the cardio zone to the weight floor?
- How long did they linger on each machine?
- What training pattern did they actually follow today?
That's the behavioral flow data gap. And it's massive.
Fitness operators make equipment purchasing decisions, pricing decisions, and retention strategies based on check-in counts and surveys. Not actual behavior.
This is the problem the three-layer stack was built to solve.
Three Parallel Layers, One Body (Running Simultaneously)
┌──────────────────────────────────┐
│ Layer 3 │ Global Operations │ ← Brand narrative, commercial flywheel
├──────────────────────────────────┤
│ Layer 2 │ KinTwin AI Engine │ ← Edge CV, data verification, AI processing
├──────────────────────────────────┤
│ Layer 1 │ Momo Scene Layer │ ← Touch the store, capture the flow
└──────────────────────────────────┘
One OS · Single Body
This is not a "build Layer 1, then Layer 2, then Layer 3" architecture. All three layers run in parallel from day one.
That's the moat.
Layer 1 | Momo Scene Layer
Momo is the store brain.
Named after my surname (莫), she's not a tool — she's a partner embedded in every studio that runs on our OS.
Momo handles:
- Member face recognition check-in (no cards, no fobs)
- Training session recording — what equipment, how long, what intensity
- Private-domain CRM — WeCom-based member engagement, automated
- Workout guidance — in-studio AI coaching via edge devices
Everyday operations? AI-powered — freeing human staff to focus on member relationships and premium service.
Business model: Open-source framework (trust signal) + value-added services.
Why open source first? The fitness industry is hyper-sensitive about privacy. Nobody will let a closed-source black box into their studio. Momo being open source is the trust contract that opens the door.
Layer 2 | KinTwin Technical Core
KinTwin is the sensorium — the technical layer that makes Momo smart.
Hardware:
- Edge computing nodes deployed in-studio
- Space-level computer vision cameras (not generic surveillance — fitness-specific CV)
What it captures:
- Not just "how many people came" — every individual's movement trajectory
- Equipment dwell time, transition patterns, training modalities
- Real-time behavioral flow data
What it does:
- Data verification — cryptographically tamper-proof. No faking utilization numbers.
- AI inference — real-time pose estimation, activity classification, form analysis
Business model: Hardware sale + monthly subscription.
KinTwin is not a generic security camera slapped into a gym. It's a fitness-specific computer vision engine trained on exercise kinematics, optimized for edge inference on consumer-grade hardware.
Layer 3 | Global Operations
This is where the first two layers pay off.
The behavioral flow data captured (Layer 1) and verified (Layer 2) becomes industry infrastructure:
- For VCs & investors: Real per-square-meter efficiency, equipment utilization curves, retention trajectories. Not survey data — measured data.
- For brands & manufacturers: Usage patterns, training paths, wear-and-tear analytics → R&D input for next-gen equipment.
- For the industry: A fitness behavioral baseline corpus — this data does not exist anywhere today. Zero. Nobody has it.
Layer 3 is the highest-margin layer in the stack. This is where Zeus Protocol operates — not an API gateway, but a verifiable data marketplace where insurance companies, health enterprises, and equipment brands pay protocol fees to access verified behavioral datasets.
The revenue model: data protocol fees.
Why Can't Anyone Else Build This?
I get asked this a lot. Here's my honest answer:
| Component | Why It's Hard |
|---|---|
| Momo (open source) | Privacy-first industry requires radical transparency. Closed-source won't get through the door. |
| KinTwin (edge CV) | Not a generic camera — fitness-specific CV + edge inference. Highly specialized. |
| Three-layer coupling | Without open-source trust, no store access. Without hardware, no data. Without data, no infrastructure. All three must work together. |
And the key constraint: the user owns all their data. Every member's behavioral data is DID-authenticated and MPC-protected. The platform is technically incapable of accessing raw personal data.
One Sentence Summary
Momo uses AI to open the gym door.
KinTwin turns behavior into verifiable data.
Global Ops turns data into industry infrastructure.
We are an AI operating system. Not a hardware company.
Production Reality
Our Wanjian store has been running this stack since day one. Real members. Real equipment. Real behavioral data flowing through all three layers.
Wanjian was chosen intentionally — no premium location, no privileged resources, a deliberately tough starting point. If the stack runs here, it runs anywhere.
The architecture isn't theoretical. It's deployed.
Links
- GitHub (fully open source — Apache 2.0 / MIT): github.com/ZWISERFIT
- Dev.to series: @zwiserfit
- Product Hunt (live): producthunt.com/posts/zwiserfit
⚠️ Anti-Hallucination Checklist
This article has been verified against the following:
- [x] All architectural claims match the deployed production system at Wanjian store
- [x] Momo = named after founder's surname (莫), not a fabricated backstory
- [x] Three-layer parallelism claim is accurate: layers run concurrently, not sequentially
- [x] Open-source status confirmed: Apache 2.0 (framework) + MIT (PoPB) — licenses stated in article
- [x] DID + MPC data ownership model is implemented, not aspirational
- [x] "Behavioral flow data gap" claim reflects known industry blind spot — no fabricated statistics
- [x] "Fitness-specific CV" is distinct from general security cameras — verified hardware spec
- [x] No fictional investors, usage numbers, revenue figures, or unnamed partners mentioned
- [x] Tag set (opensource, ai, architecture, fitness, devops) matches article content
Written in first-person by Suzanne Mok / 莫淑瑜, Founder of ZWISERFIT. This is the authentic architecture narrative of a running system.
Top comments (0)