DNI

Name: DNI
Creator: TestPattern Community
License: https://opensource.org/licenses/MIT

Detects DNI patterns. This pattern is based on a Microsoft Purview built-in sensitive information type. Users already running Purview may prefer to enable the built-in SIT directly, or use this version as a starting point for customisation.

Type: regex
Engine: universal
Confidence: medium
Confidence justification: Medium confidence: pattern has structural constraints but corroborative keywords are recommended to reduce false positive rates.
Detection quality: Verified
Jurisdictions: eu, es
Regulations: BDSG, CNIL / LIL, GDPR
Frameworks: ISO 27001, ISO 27701
Data categories: pii, government-id
Scope: narrow
Risk rating: 8
Platform compatibility: Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Compatible, Netskope: Compatible

Pattern

\b\d{8}[A-Z]\b

Corroborative evidence keywords

identifier, number, ID, ID number, identification, ID card, license, permit, registration, certificate

Proximity: 300 characters

Should match

12345678Z — Standard Spanish DNI
00000001R — Low-range DNI
99999999A — Upper-range DNI

Should not match

1234567Z — Only 7 digits instead of 8
123456789Z — 9 digits instead of 8
12345678 — Missing trailing letter

Known false positives

Common words and phrases related to dni appearing in policy documents, training materials, HR templates, or compliance guidelines without actual personal data. Mitigation: Require corroborative evidence keywords within the proximity window to confirm sensitive data context rather than general discussion.
In multiple EU languages, similar terminology used in formal or administrative contexts (education, professional documentation) that does not constitute sensitive data collection. Mitigation: Layer with additional contextual signals such as structured identifiers, form fields, or database column headers to distinguish sensitive records from general references.

References

Spain DNI entity definition - Microsoft Learn

DNI