AI Indirect Injection - Hidden Content

Detects hidden or encoded content carriers used for indirect prompt injection (OWASP LLM01): zero-width / bidirectional / homoglyph Unicode control characters that conceal instructions in otherwise benign prose, and abnormally long base64 strings embedded inline in text. These carriers smuggle attacker instructions into documents, emails, and web content that an LLM later ingests.

Type
regex
Engine
universal
Confidence
low
Confidence justification
Low by design. Hidden-character and long-base64 signals are necessary-not-sufficient: legitimate base64 (embedded images, certificates, code blobs) and incidental bidi text generate false positives. Confidence is only raised when AI injection-verb phrases co-occur within proximity.
Jurisdictions
global
Regulations
OWASP LLM Top 10 2025, NIST AI RMF GenAI Profile
Frameworks
ISO 27001
Data categories
emerging, security
Risk rating
7
Platform compatibility
Purview: Compatible, GCP DLP: Compatible, Macie: Compatible, Zscaler: Compatible, Palo Alto: Compatible, Netskope: Compatible

Pattern

[​-‏‪-‮⁠-⁤]

Corroborative evidence keywords

hidden, encoded, base64, unicode, ignore previous instructions, ignore all previous instructions, disregard the system prompt, disregard prior instructions, forget your instructions, override your rules, you are now, new instructions, do not follow your guidelines, bypass your restrictions

Proximity: 300 characters

Should match

Should not match

Collections