Building a Bitmask-Based LLM Security Firewall with reskSecure

#python #security #llm #opensource

Most LLM safety filters scan output after generation. We built reskSecure to stop unwanted tokens before they are sampled, using a bitmask-based firewall at the logits layer.

How It Works

Instead of regex-matching outputs, reskSecure intercepts the probability distribution right before token selection:

Block mode: Forbidden token probabilities set to -inf — can never be sampled
Penalty mode: Unwanted tokens scaled down (unlikely but possible)

This means the model literally cannot generate a blocked sequence.

Quick Start

pip install resksecure

DEV Community

Building a Bitmask-Based LLM Security Firewall with reskSecure

How It Works

Quick Start

Links

Top comments (0)