เลขประจำตัวประชาชน
Detects เลขประจำตัวประชาชน patterns. This pattern is based on a Microsoft Purview built-in sensitive information type. Users already running Purview may prefer to enable the built-in SIT directly, or use this version as a starting point for customisation.
- Type
- regex
- Engine
- universal
- Confidence
- high
- Confidence justification
- High confidence: pattern has strong structural constraints (specific format, prefix, or character class restrictions) that significantly reduce false positive rates.
- Detection quality
- Verified
- Jurisdictions
- th
- Regulations
- PDPA (TH)
- Frameworks
- ISO 27001, ISO 27701
- Data categories
- pii, government-id
- Scope
- narrow
- Risk rating
- 9
- Platform compatibility
- Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Compatible, Netskope: Compatible
Pattern
\b\d-\d{4}-\d{5}-\d{2}-\d\b
Corroborative evidence keywords
เลขประจำตัว, population ID, citizen ID, บัตรประชาชน, ID number, identification, ID card, license, permit, registration, certificate
Proximity: 300 characters
Should match
1-1234-56789-01-2— Thai ID with standard format3-9876-54321-09-8— Another Thai ID5-4567-89012-34-5— Valid Thai ID format
Should not match
1-123-56789-01-2— Too few digits in second group1-1234-5678-01-2— Too few digits in third group
Known false positives
- The distinctive dash-separated format (X-XXXX-XXXXX-XX-X) is highly specific and rarely matches non-identity data. Mitigation: The structured format provides excellent inherent validation with very low false positive rates.
- In multiple languages, similar terminology used in formal or administrative contexts (education, professional documentation) that does not constitute sensitive data collection. Mitigation: Layer with additional contextual signals such as structured identifiers, form fields, or database column headers to distinguish sensitive records from general references.