Prompt Injection | AI Glossary

Definition

A security vulnerability where malicious inputs can override or bypass an AI system's intended behavioral constraints.

Detailed Explanation

Prompt injection occurs when carefully crafted inputs manipulate an AI model into ignoring its original instructions or security measures. This technique exploits the model's tendency to process all input as potentially valid instructions potentially leading to unauthorized behavior or information disclosure. Understanding and preventing prompt injection is crucial for AI system security.

Use Cases

Security testing system hardening safety protocol development and vulnerability assessment

Definition

Detailed Explanation

Use Cases

Related Terms

In-Context Learning

Temperature

Tool Use (LLMs)

Help

People also viewed