ICD-10 Cm

Detects ICD-10 Cm patterns. This pattern is based on a Microsoft Purview built-in sensitive information type. Users already running Purview may prefer to enable the built-in SIT directly, or use this version as a starting point for customisation.

Type
regex
Engine
universal
Confidence
high
Confidence justification
High confidence: structurally constrained pattern with corroborative keyword support reduces false positive rates significantly. Added context gating and exclusion rules improve precision and reduce incidental matches.
Detection quality
Verified
Jurisdictions
global
Frameworks
ISO 27001, ISO 27701, SOC 2
Data categories
phi, healthcare
Scope
narrow
Risk rating
8
Platform compatibility
Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Compatible, Netskope: Compatible

Pattern

\b[A-TV-Z]\d[A-Z0-9](\.?[A-Z0-9]{0,4})?\b

Corroborative evidence keywords

MRN, medical record number, patient ID, NPI, DEA, medicare, medicaid, insurance ID, member ID, beneficiary, ICD-10, ICD-9, CPT, NDC, SNOMED, HCPCS, diagnosis code, procedure code, drug code, field (+28 more)

Proximity: 300 characters

Should match

Should not match

Known false positives

References

Collections