Sexual orientation

Identifies documents containing references to sexual orientation in international contexts. This information type is classified as personally identifiable information under applicable data protection regulations.

Type
regex
Engine
boost_regex
Confidence
medium
Confidence justification
category-aware structural regex with anchor and context constraints replaces phrase-only detection. Added context gating and exclusion rules improve precision and reduce incidental matches.
Detection quality
Mixed
Jurisdictions
global
Regulations
GDPR
Data categories
government-id, pii
Scope
wide
Platform compatibility
Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Degraded, Netskope: Unsupported

Pattern

(?is)\b(?:sexual\s+orientation|sexual\s+preference|gender\s+preference|personal\s+relationship|intimate\s+partner|domestic\s+partnership|personal\s+demographics|diversity\s+data|equality\s+monitoring|protected\s+characteristic)\b

Corroborative evidence keywords

sexual orientation, sexual, orientation, personal, identity, demographics, address, age, birthday, citizenship, city, date of birth, DOB, email, ethnicity, fax, first name, full name, gender, given name (+44 more)

Proximity: 300 characters

Should match

Should not match

Known false positives

References