PESEL
Detects PESEL patterns. This pattern is based on a Microsoft Purview built-in sensitive information type. Users already running Purview may prefer to enable the built-in SIT directly, or use this version as a starting point for customisation.
- Type
- regex
- Engine
- universal
- Confidence
- medium
- Confidence justification
- Medium confidence: pattern has structural constraints but corroborative keywords are recommended to reduce false positive rates.
- Detection quality
- Verified
- Jurisdictions
- eu, pl
- Regulations
- BDSG, CNIL / LIL, GDPR
- Frameworks
- ISO 27001, ISO 27701
- Data categories
- pii, government-id
- Scope
- wide
- Risk rating
- 9
- Platform compatibility
- Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Compatible, Netskope: Compatible
Pattern
\b\d{11}\b
Corroborative evidence keywords
identifier, number, ID, ID number, identification, ID card, license, permit, registration, certificate
Proximity: 300 characters
Should match
85010112345— Standard PESEL number90123156789— Alternate PESEL00210198765— PESEL for 2000s-born person
Should not match
1234567890— Only 10 digits instead of 11123456789012— 12 digits instead of 111234567890A— Contains a letter instead of all digits
Known false positives
- Long numeric sequences in unrelated contexts (tracking numbers, serial codes) matching the PESEL format Mitigation: Use corroborative keywords and, where available, checksum validation to filter false matches.