DEV Community

Cover image for Remove Protected Data with No-code
Vlad L
Vlad L

Posted on

Remove Protected Data with No-code

remove-pii is a no-code solution that creates shareable CSVs while removing protected data, PII and PHI with just a few clicks.

Working with third parties, contractors and partners who need access to files containing protected data means walking a tightrope between providing useful data without violating privacy, HIPAA or GDPR.

Start Simple

Let's remove protected data from a CSV without writing any code using an example file.

Remove protected data with no code

This walk-through will use Although we're working with a website, no data is transmitted to any server; all processing is handled in the browser. An offline version is also available.

Now, we'll provide our CSV to the website. This will parse the headers and allow us to replace any data within the file. Any unset columns pass through to the final file.

De-identifying a file with protected data

De-identification Examples

Let's set a data-replacement strategy for a column by clicking on "Set Strategy ...". This will replace data in our document with the function we select. We can use a text search to find data-replacers or scroll through all available approaches:

Email address:

De-identify email address

Phone number:

De-identify phone number

Credit card number:

De-identify credit card number

Bitcoin address:

De-identify credit bitcoin address

Order date:

De-identify order date

Now we have strategies to replace data in our file we can Save the updated file and view the result:

Protected data replaced in CSV

Done! With just a few clicks we've replaced all protected data within the example file. If we need to repeat this process on multiple files we can download the replacement strategy by clicking Download Strategy and use redact PII offline.

Preserve Data Relationships

What if we need to preserve the relationships within the data while replacing values? Notice that email repeats in our example file (e.g. Nora.OReilly21 and Preston1) but is replaced with new, different values in the final file. If we want to replace this data with the same new value we'll enable "Preserve" on this column. Now, when we save the updated file any repeated email addresses will be replaced with the same, new value.

Preserve relationships in replaced data

Need More?

For those who are (understandably) cautious: just the headers in a CSV can be provided to create a removal strategy which can be downloaded separately. Then the offline application can be used to remove protected data, PII or PHI. If necessary, the replacement strategy can be created manually in JSON. For those interested, a dedicated version of exclusive to your organization can be provided.

Please contact us at with any questions.

Top comments (0)