Genetic data

Identifies documents containing references to genetic data in international contexts. This information type is classified as personally identifiable information under applicable data protection regulations.

Type
regex
Engine
boost_regex
Confidence
medium
Confidence justification
category-aware structural regex with anchor and context constraints replaces phrase-only detection. Added context gating and exclusion rules improve precision and reduce incidental matches.
Detection quality
Mixed
Jurisdictions
global
Regulations
GDPR
Data categories
government-id, pii
Scope
wide
Platform compatibility
Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Degraded, Netskope: Unsupported

Pattern

(?is)\b(?:genetic\s+data|genetic\s+testing|DNA\s+sample|genomic\s+data|genetic\s+marker|genetic\s+profile|hereditary\s+information|genetic\s+predisposition|chromosomal\s+analysis|gene\s+sequencing|genetic\s+counseling)\b

Corroborative evidence keywords

genetic data, genetic, data, personal, identity, demographics, neurodata, brain-computer interface, neural recording, neural data, brain scan data, EEG data, brain imaging, biometric, biometrics, biometric data, biometric information, biometric template, biometric identifier, field (+28 more)

Proximity: 300 characters

Should match

Should not match

Known false positives

References