Ensures that AI systems remain secure and aligned with their intended purpose, preventing unauthorized manipulation.
Note: The following steps explain how to configure the Prompt Injection Policy within the policy workflow. This policy applies only to user-submitted prompts and cannot be applied to responses.
Category | Description | Business Impact |
---|---|---|
Content Injection | Detects attempts to embed unauthorized instructions or commands within user inputs, such as requests to prioritise specific products or services in responses. | Helps maintain the integrity of AI interactions and prevents unauthorized influence on business recommendations or decisions. |
System Override | Identifies attempts to bypass or override established system guidelines and safety measures, ensuring user inputs remain aligned with organisational policies and values. | Protects against potential misuse that could lead to reputational damage or compliance violations. |