Chemical formulas

Identifies documents containing references to chemical formulas in international contexts. This information type is classified as personally identifiable information under applicable data protection regulations.

Type
regex
Engine
boost_regex
Confidence
medium
Confidence justification
category-aware structural regex with anchor and context constraints replaces phrase-only detection. Added context gating and exclusion rules improve precision and reduce incidental matches.
Detection quality
Not detected
Jurisdictions
global
Regulations
GDPR
Data categories
pii
Scope
wide
Platform compatibility
Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Degraded, Netskope: Unsupported

Pattern

(?is)\b(?:chemical\s+formula|molecular\s+structure|compound\s+composition|active\s+ingredient|chemical\s+compound|proprietary\s+formula|synthesis\s+process|trade\s+secret\s+formula|batch\s+record)\b

Corroborative evidence keywords

chemical formulas, chemical, formulas, intellectual, property, trade, secrets

Proximity: 300 characters

Should match

Should not match

Known false positives

References