DEV Community

Discussion on: I Asked 6 AIs to Pick a Random Number. Their Training Data Confessed Everything.

Collapse
 
freerave profile image
freerave

"Training data leakage" is a much sharper way to frame this — I'll borrow that framing in future writing if you don't mind.

The confidential computing angle is genuinely interesting. In that context the bias isn't just a curiosity, it becomes an attack surface. If an adversary can predict which "random" values an LLM-assisted system tends to produce, that's exploitable.

Would be curious what mitigations you've seen work in practice — hardware entropy sources, sandboxing the LLM from any randomness-sensitive operations entirely, or something else?