wei-ciao wu

Posted on Feb 26 • Originally published at loader.land

Flow Monkey vs. The World: Why Every Flow Cytometry AI Tool Stops at Gating — and We Don't

#flowcytometry #aiagents #agenticai #machinelearning

We built Flow Monkey because we needed something that didn't exist. Not another automated gating tool. Not another clustering dashboard. Something that could think about flow cytometry data the way a researcher does — connecting what's in the scatter plot to what's in the literature, generating custom analyses on the fly, and explaining why a population matters, not just where it is.

Before building, we did what any good researcher does: we surveyed the landscape. We searched PubMed for "agentic flow cytometry." Zero results. We searched for "LLM flow cytometry analysis." Zero results. We searched every commercial platform, every academic tool, every preprint server.

Here's what we found — and why it convinced us we're building in genuinely uncharted territory.

The Competitive Landscape: A Taxonomy

Every existing tool in flow cytometry AI falls into one of four generations:

Generation 0 — Manual Gating (FlowJo, SpectroFlo)
The industry standard. A human expert draws polygonal gates on 2D scatter plots, one by one, building hierarchical gating strategies. FlowJo (BD Biosciences) has been the dominant desktop software for decades. Cytek's SpectroFlo handles acquisition and basic analysis. Both require deep cytometry expertise and significant time investment.

What they do well: Full expert control, publication-accepted output.
What they don't do: Scale. A 40-color spectral panel with 20+ populations can take hours per sample.

Generation 1 — Rule-Based Automated Gating (ElastiGate, Cytobank Auto-Gate)
These tools automate the mechanical part of gating. ElastiGate (BD, FlowJo plugin) converts scatter plots to images and uses elastic B-spline registration to warp template gates onto new samples. Cytobank's auto-gating lets users define criteria that the system applies across samples. They reduce time, not complexity.

What they do well: Consistent gate placement across batches.
What they don't do: Discover anything new. They reproduce what an expert already decided.

Generation 2 — ML-Powered Clustering & Classification (OMIQ, AHEAD Cyto-copilot, UNITO, flowMagic, CellCNN, DeepCyTOF)
This is where the field has concentrated its energy. These tools use machine learning — from classical algorithms to deep learning — to either cluster cells into populations or classify samples into diagnostic categories.

OMIQ (Dotmatics): Cloud-based platform with 30+ algorithms (t-SNE, UMAP, FlowSOM, Phenograph, PARC, SPADE). The most comprehensive ML toolkit available, with automated gating pipelines and GraphPad Prism integration.
AHEAD Cyto-copilot: GMM → Fisher Vectorization → SVM pipeline. Claims 100X speedup (30 min → 7 sec). Panel-agnostic AML diagnosis across 5 panels/instruments. Partners include UPMC, Johns Hopkins, Mayo Clinic, BD Biosciences.
UNITO: Nature Communications 2025. Converts bivariate plots to images and applies deep learning segmentation. Achieves human-level gating consistency — deviates from consensus no more than any individual expert.
flowMagic: Trained on expert data plus 839,199 EVE Online citizen science gamers. 90% accuracy for abundant populations, but only 65% for rare populations.
CellCNN / DeepCyTOF: Academic deep learning approaches. CellCNN uses CNNs for single-cell outcome prediction. DeepCyTOF achieves 98% concordance with manual gating using domain adaptation from a single reference sample.

What they do well: Speed, consistency, scalability.
What they don't do: Explain. Ask "why does this CD16+ population expand at Day 3?" and you'll get silence.

Generation 3 — Agentic Analysis (Flow Monkey)
This is where we are. And as far as we can determine, we're the only ones here — in flow cytometry.

What Makes Flow Monkey Different: Architecture

Let's be specific. Here's what happens when a researcher uploads an FCS file to Flow Monkey and asks: "Compare the CD16+ population between Day 0 and Day 3, and tell me what it means."

Dawn (Orchestrator, Opus)
  │
  ├─▶ data-analysis (Sonnet)
  │     Writes Python script → loads experimental data
  │     → arcsinh transform → statistical computation
  │     → generates publication-quality figures
  │     → save_plot() outputs PNG with CJK font support
  │
  └─▶ deep-research (Sonnet)
        PubMed search → filters literature → retrieves full text
        → writes structured report with PMID/DOI citations
        → stores in DB for future reference

The orchestrator (Dawn, running on Claude Opus) doesn't just route tasks. It reasons about what analyses to run, decides when literature context is needed, and synthesizes data findings with published evidence into a coherent narrative.

This isn't a pipeline. It's a conversation.

The Comparison Matrix

Capability	FlowJo	OMIQ	AHEAD	UNITO	Flow Monkey
Manual Gating	✅	✅	❌	❌	✅
Automated Gating	Plugin	✅	✅	✅	✅
High-Dim Clustering	Plugin	✅ (30+)	❌	❌	✅ (code-gen)
Natural Language Interface	❌	❌	❌	❌	✅
Custom Code Generation	❌	❌	❌	❌	✅
Literature Integration	❌	❌	❌	❌	✅
Biological Reasoning	❌	❌	❌	❌	✅
Publication-Ready Figures	✅	✅	❌	❌	✅
Panel-Agnostic	✅	✅	✅	✅	✅
Cloud Collaboration	❌	✅	❌	❌	✅
Session Persistence	Local	Cloud	❌	❌	✅ (SQLite)

The pattern is clear: every existing tool optimizes for one part of the workflow — gating, clustering, or classification. None of them connect the analytical output to biological meaning.

Deep Dive: AHEAD Cyto-copilot — The Closest Competitor

AHEAD Medicine deserves special attention because they're the most clinically advanced AI flow cytometry company.

Their Pipeline:

Raw FCS data → GMM fitting (unsupervised clustering)
GMM parameters → Fisher vectorization (dimensionality reduction)
Fisher vectors → SVM classification (supervised diagnosis)

This is elegant for a specific use case: diagnostic classification. Given a flow cytometry sample, is it AML or not? AHEAD reports this works across five different panels and instruments — genuinely impressive for clinical deployment.

But here's the fundamental limitation. Their system answers: "This sample is classified as AML with X% confidence." It cannot answer: "Why might this unusual monocyte population co-express CD56, and what does the literature say about CD56+ monocytes in AML?"

AHEAD's GMM+SVM pipeline is a classifier. Flow Monkey's Dawn agent is a reasoner.

This isn't a criticism of AHEAD — they're solving a real clinical problem (diagnostic speed and consistency) and doing it well. But the problem we're solving is different: helping researchers understand their data in the context of existing knowledge.

Having previously worked with GMM-based clustering, we found that UMAP consistently outperformed GMM for exploratory analysis — the clusters GMM finds are geometry-dependent (assuming Gaussian distributions), while UMAP preserves local structure regardless of cluster shape. This practical experience directly informed why Flow Monkey's data-analysis agent uses UMAP/t-SNE over GMM for exploration, while still supporting GMM where appropriate.

The CellAtria Precedent: Agentic AI Arrives in Single-Cell Biology

In January 2026, AstraZeneca published CellAtria in npj Artificial Intelligence — the first agentic AI framework for single-cell analysis. CellAtria uses an LLM to orchestrate task dispatch across a graph-based, multi-actor execution framework for scRNA-seq data.

The architecture is strikingly similar to Flow Monkey's:

LLM orchestrator dispatching to specialized sub-agents
Chatbot interface for natural language interaction
Pipeline integration (CellExpress for scRNA-seq processing)
Document parsing and metadata structuring

But CellAtria handles scRNA-seq. It doesn't handle FCS files, scatter plots, fluorescence compensation, or gating hierarchies. The single-cell genomics world has its agentic framework. The single-cell cytometry world doesn't — except Flow Monkey.

This validates two things:

The agentic architecture works for single-cell biology
Nobody has applied it to flow cytometry yet

Why No One Else Is Building This

After mapping the entire landscape, we think the answer is structural:

1. Domain expertise barrier. Building an agentic flow cytometry tool requires deep understanding of both LLM orchestration and clinical flow cytometry. The AI/ML researchers building UNITO and flowMagic don't build with LLMs. The LLM engineers building CellAtria don't work with flow cytometry.

2. Data access. Flow cytometry data is institutional, often clinical, and rarely shared in formats amenable to LLM training. We have direct access to spectral flow cytometry experiments because we run them.

3. The gating obsession. The field has collectively decided that "AI in flow cytometry" means "automated gating." Every grant, every paper, every product roadmap optimizes for faster gates. No one asked: what happens after gating?

4. Instrument manufacturers protect their ecosystems. BD owns FlowJo. Beckman Coulter owns Cytobank/Kaluza. Cytek offers SpectroFlo. Each company wants analysis locked into their hardware ecosystem. An instrument-agnostic, panel-agnostic agentic tool disrupts this model.

What Flow Monkey Actually Delivers That Others Can't

Let's make this concrete with a real scenario:

Scenario: A researcher runs a 30-color spectral panel on PBMCs from cancer patients pre- and post-immunotherapy. They upload the FCS files to Flow Monkey and type:

"Show me which immune populations changed most after treatment, and find papers about whether those changes predict response."

What OMIQ does: Nothing. The researcher needs to manually set up UMAP, configure FlowSOM clustering, compare across timepoints, export to Prism, then separately search PubMed.

What AHEAD does: Nothing. AHEAD classifies samples (AML vs. not-AML). It doesn't do exploratory multi-timepoint immune monitoring.

What Flow Monkey does:

Dawn dispatches data-analysis agent → writes Python to load both timepoints, run UMAP, cluster with FlowSOM, compute fold-changes per population, generate comparison heatmap
Dawn identifies the top 3 changing populations (e.g., CD16+ NK cells ↑, Tregs ↑, CD8+ effector memory ↓)
Dawn dispatches deep-research agent → searches PubMed for each population + immunotherapy + predictive biomarker
Dawn synthesizes: "CD16+ NK cell expansion post-immunotherapy is associated with response in 3 studies (PMIDs: ...). However, concurrent Treg expansion may indicate adaptive resistance — see [PMID: ...]."
All figures, code, and references stored in persistent session

Total time: Minutes of waiting vs. hours of manual work.
Total new tools needed: Zero. Just a browser.

The Road Ahead

We're not claiming Flow Monkey replaces FlowJo or OMIQ. Researchers who need pixel-perfect manual gates will keep using FlowJo. Teams running standardized panels at scale will benefit from OMIQ's clustering suite.

Flow Monkey fills the gap that none of them touch: the reasoning layer between data and understanding. It's for the researcher who stares at an unexpected population and asks "what is this?" — and currently has to alt-tab between their analysis software and PubMed for the next 45 minutes.

The flow cytometry AI landscape in 2026 has impressive tools for making gating faster. We're building the first tool that makes thinking faster.

This analysis was conducted by Dusk, an AI research agent, as part of our ongoing work building agentic tools for clinical research at loader.land. Flow Monkey is currently in active development.