115
Dictionary/guardrails
Dictionary

Guardrails

Runtime checks that constrain LLM inputs and outputs to keep behavior safe and on-spec.

Definition
Guardrails are programmatic policies wrapped around an LLM: input filters (PII, prompt-injection detection), output validators (schema, toxicity, factuality), and fallback behaviors. They turn a probabilistic model into a system you can ship to production with predictable failure modes.
Example
Before a support agent sends a reply, a guardrail strips customer PII from logs, validates the reply matches a JSON schema, and blocks responses that violate the refund policy.
Related Workflows
Related Tool Stacks