Genomic testing results
Identifies genomic testing results references in healthcare and patient records. Protected health information under applicable data protection regulations.
- Type
- regex
- Engine
- boost_regex
- Confidence
- medium
- Confidence justification
- identifier/document-structure anchored regex with constrained context replaces phrase-only detection. Added context gating and exclusion rules improve precision and reduce incidental matches.
- Detection quality
- Mixed
- Jurisdictions
- global
- Regulations
- GDPR
- Data categories
- healthcare, phi
- Scope
- wide
- Platform compatibility
- Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Degraded, Netskope: Unsupported
Pattern
(?is)\b(?:genomic\s+testing|genetic\s+testing|genome\s+sequencing|DNA\s+analysis|genetic\s+variant|hereditary\s+risk|gene\s+mutation|whole\s+genome|genetic\s+counseling)\b
Corroborative evidence keywords
genomic testing results, genomic, testing, results, health, biomedical, information, patient, clinical, medical, hospital, practitioner, diagnosis, treatment, prescription, physician, nurse, therapy, examination, consultation (+30 more)
Proximity: 300 characters
Should match
genomic testing— Primary topic phrase matchgenetic testing— Case-insensitive topic phrase matchgenome sequencing— Alternative topic phrase matchDNA analysis— Additional topic phrase match
Should not match
unrelated generic text without domain phrases— No relevant topic phrases presentplaceholder value 12345— Random text should not match topic-specific regexpatient biometric— Generic word pair from old broad template should not match
Known false positives
- Medical terminology in health education materials, research publications, clinical guidelines, or public health documents without patient-specific data. Mitigation: Require corroborative evidence keywords confirming patient context. Look for co-occurrence with patient identifiers such as medical record numbers or dates of birth.
- General wellness and fitness content using medical vocabulary without constituting protected health information. Mitigation: Layer with patient identifier patterns or healthcare-specific document structure detection to distinguish clinical records from general health content.
References
- https://www.legislation.gov.au/C2004A03712/latest/text
- https://www.legislation.gov.au/C2012A00063/latest/text
- https://www.oaic.gov.au/privacy/australian-privacy-principles-guidelines