About TestPattern

The open pattern library for Data Loss Prevention.

TestPattern is a free, community-curated collection of regex patterns, keyword lists, and classification rules for detecting sensitive information in documents and data streams. Think of it as Sigma for DLP.

Why this exists

If you've ever deployed DLP, you know the drill. You need a Medicare number regex. A credit card pattern. A passport detector. So you write it yourself, test it against real documents, discover the false positives, fix them, and hope you haven't missed an edge case.

The team at the next company over is doing the exact same thing. So is every other security team in the country. Nobody shares their work.

Which is strange, because every adjacent security domain figured out pattern sharing years ago:

DLP had nothing. No shared patterns. No standard format. Every team started from zero.

TestPattern is here to fix that.

What you get

Who's behind this

TestPattern is a community project sponsored by Compl8, the compliance toolkit for Microsoft Purview.

It was started by compliance engineers who were tired of writing the same regex patterns from scratch at every engagement. The patterns are open source. The project is open source.

Get in touch