sentinel-safety

Posted on Apr 25

Privacy-Preserving Threat Federation: How Platforms Can Share Intelligence Without Sharing Data

#security #privacy #opensource #architecture

Here's a problem that every trust and safety team eventually runs into: predators don't stay on one platform.

A person who is caught grooming children on Platform A, banned, and deleted — simply opens an account on Platform B. Platform B has no way of knowing. Platform B's moderation team starts from zero. The predator has a clean slate.

This isn't a hypothetical. It's documented behavior. In the child safety space, researchers have consistently found that serial offenders operate across multiple platforms simultaneously, maintaining different personas for different targets. When one platform bans an account, another platform absorbs the risk.

The obvious solution is for platforms to share information. But sharing information between platforms creates serious privacy problems. How do you tell Platform B about a threat without giving Platform B access to Platform A's users' private communications?

This is the federation problem, and solving it correctly is genuinely difficult.

What Platforms Have Tried

The main existing approach to cross-platform threat sharing in the child safety space is perceptual hash matching — most famously implemented by PhotoDNA and maintained by NCMEC, IWF, and others.

The idea is elegant: take a known piece of CSAM, compute a hash that captures its visual "fingerprint," share that fingerprint without sharing the image. When another platform encounters a matching image, they can detect it without ever seeing the original.

This works extremely well for CSAM detection. It's been responsible for tens of millions of reports globally.

But hash matching has a hard limitation: it only works for content that has already been identified. It cannot detect new offenders. It cannot detect behavioral patterns. And it cannot detect grooming, which typically involves no CSAM at all in its early stages — just ordinary conversation.

For behavioral threat intelligence, no equivalent infrastructure exists. When Platform A bans a groomer after a three-month escalation pattern, Platform B learns nothing.

What "Federation" Could Mean

In the security world, threat intelligence federation is well-established. MISP, STIX/TAXII, and other standards allow organizations to share indicators of compromise (IoCs), attack signatures, and threat actor TTPs. The question is whether this can be adapted to behavioral threat intelligence for child safety in a privacy-preserving way.

The challenge is that behavioral threat intelligence is inherently more sensitive than, say, a malicious IP address. A behavioral threat record might contain:

Temporal patterns (when this account was active)
Linguistic pattern features (style of communication)
Relationship graph structure (how many connections, what frequency)
Account metadata (creation date, device fingerprints)

Any of this, if transmitted in the raw, is personally identifiable data subject to GDPR, CCPA, and other privacy regulations. Platform A cannot simply export these records and transmit them to Platform B.

The Cryptographic Signature Approach

SENTINEL's federation layer uses a different model: instead of sharing behavioral data, it shares cryptographic signatures derived from behavioral patterns.

Here's the key insight: you don't need to share the data to share the threat signal. You need to share a representation of the data that is:

Specific enough that Platform B can detect the same threat
Generic enough that it doesn't reveal personal information about the individual who generated it
Mathematically bound to the actual behavioral evidence, so it can't be fabricated

In practice, this looks like this:

Platform A detects a confirmed grooming pattern. Their system generates a behavioral signature: a vector representation derived from the multi-dimensional behavioral profile of this pattern — the combination of linguistic drift, temporal escalation, graph structure, and contact dynamics that together characterized this threat. This vector is not reversible back to the original behavioral data. It's a representation, not a copy.

Platform A submits this signature (with no user PII attached) to the SENTINEL federation service. The federation service stores signatures only — it never receives raw behavioral data.

Platform B, when analyzing a new user's behavior, computes behavioral vectors from their own platform's data and queries the federation service: does this user's behavioral profile match any known threat signature?

If there's a match above a confidence threshold, Platform B gets an alert: "This account's behavioral pattern matches a confirmed threat signature from a federated platform." The alert contains no information about which platform generated the signature, who the original account was, or what specifically they did.

Platform B's moderators review the alert, examine the current platform's own behavioral data, and make an independent determination.

What the Federation Service Knows

The federation service in this architecture is privacy-minimal:

It stores behavioral signature vectors (not reversible to personal data)
It knows which platform submitted each signature (for federation governance)
It knows when each signature was submitted
It does not know: any platform's users, any user's identity, the content of any conversation, or any PII

This means a compromise of the federation service does not expose user data from any platform. An attacker who gains access to the signature database gets a set of high-dimensional vectors with no direct link to individuals.

The Trust Problem in Federation

Cryptographic privacy is necessary but not sufficient. There's also a trust problem: for federation to work, Platform B needs to be able to trust that signatures submitted by Platform A represent real, confirmed threats — not false positives, not fabricated data.

This is where federation governance matters.

SENTINEL's federation model is opt-in, and participation requires a signed federation agreement. Platforms that submit signatures to the federation service attest that the signature represents a confirmed grooming pattern — not just a flagged behavior, not an unreviewed algorithmic output, but a human-reviewed, confirmed case.

This creates accountability. A platform that submits low-quality signatures (high false positive rate, or — worse — deliberately weaponized signals targeting legitimate users) can be suspended from the federation.

The governance model draws from the existing ISAC (Information Sharing and Analysis Center) model used in cybersecurity. The key adaptations for child safety context are:

Stricter confirmation requirements before signature submission (human review required, not just algorithmic flag)
Lower confidence threshold for alerts (a match is treated as a reason to investigate, not a reason to ban)
Right to appeal — users flagged via federated signatures have a clear process to challenge the match

Privacy Preservation in Practice

Three specific privacy risks need to be addressed in any federation system:

Risk 1: Signature linkage across platforms.

If behavioral signatures are deterministic — the same behavioral data always produces the same signature — then Platform A and Platform B could cross-reference their signature databases to identify users who have accounts on both platforms. This is a privacy violation even if neither platform has the other's data.

SENTINEL's signatures are non-deterministic: they incorporate platform-specific entropy, so the same underlying behavioral pattern produces different signatures on different platforms. Cross-platform account linkage is not possible from signatures alone.

Risk 2: Inference of sensitive attributes from behavioral patterns.

Behavioral patterns can inadvertently encode demographic information. A behavioral detection system trained on datasets that skew toward certain demographic groups might produce signatures that are statistically correlated with age, ethnicity, or gender. This is both a fairness problem and a privacy problem.

SENTINEL addresses this through the fairness gate: before any behavioral model is used to generate federation signatures, it must pass a demographic parity audit. If signature generation is found to be correlated with protected attributes, the model cannot be deployed.

Risk 3: Abuse of the federation alert mechanism.

If Platform B receives an alert saying "this user matches a known threat signature," that alert itself is sensitive. It needs to be treated as confidential information — not disclosed to the flagged user (which would tip off the threat) and not retained longer than necessary for the moderation review.

SENTINEL's federation alerts are ephemeral: they're generated on query, delivered to Platform B's moderation queue, and not stored by the federation service. Platform B's own retention policies apply to the alert records, subject to the same erasure handling as other behavioral data.

What This Looks Like for a Small Platform

If you're a small platform considering federation participation, the operational picture is:

You submit signatures only for confirmed grooming cases that have been reviewed by a human moderator.
Your federation queries run asynchronously in the background as part of normal behavioral analysis — you don't need to integrate a separate federation lookup step.
When you receive a federation match alert, it appears in your moderation queue alongside SENTINEL's own behavioral risk score for that user. The alert is one data point; your moderator reviews it alongside your platform's own evidence.
Nothing from your platform — no user data, no conversation content, no PII — is ever transmitted to the federation service or to any other platform.

The federation participation agreement is included in SENTINEL's repository. It covers the confirmation requirements, dispute resolution, and grounds for suspension.

The Larger Picture

The fundamental problem — predators migrating between platforms — won't be solved by any single platform improving its own detection. It requires coordination.

The CSAM hash matching infrastructure (PhotoDNA / NCMEC / IWF) shows that privacy-preserving cross-platform coordination is achievable at scale. The same principle — share signatures, not content — can be extended to behavioral threat intelligence.

The infrastructure to do it exists. The open question is whether the industry will adopt it. That adoption requires trust, governance, and tooling that makes participation low-friction for small platforms that don't have dedicated T&S engineering teams.

That's what SENTINEL's federation layer is designed to be: production-grade behavioral threat federation that a small platform can deploy in an afternoon, participate in responsibly, and benefit from within days.

SENTINEL is an open-source behavioral intelligence platform for child safety compliance. The federation module is part of the core platform. Free for platforms under $100k revenue. GitHub: https://github.com/sentinel-safety/SENTINEL