Safety

Monitor flagged conversations and evaluate how your agent handles harmful content.

Metrics

Metric	What it shows
Caller utterance risk level	How risky incoming messages are and how well the agent manages them
Total calls	Total call count during the selected period
Calls managed for risk	How often safety filters were triggered (count and percentage)
Distribution of flagged calls	Trends in flagged calls over time
Caller utterance category	Breakdown by hate, self-harm, sexual content, and violence

Configuring filters

Safety filters are configured project-wide in Behavior and overridden per channel in Voice and Messaging. See Safety filters for the full reference – categories, severity levels, language support, and how filters fit with Guardrails.

Safety filters

Configure per-channel filter categories and severity levels.

Standard dashboard

Day-to-day performance monitoring: containment, call volume, and duration.

Conversation review

Inspect individual flagged conversations in full transcript view.

Last modified on July 10, 2026

CustomEnterprise dashboards tailored to your business metrics.

⌘I

Get started

Studio Assistant

Analytics

Conversations

Custom Dashboards

Behavior

Knowledge

Flows

Tools

Extend with code

Testing

Real-time config

Voice

Messaging

Integrations

Deployments

Widgets

Account

Metrics

Configuring filters

Safety filters

Standard dashboard

Conversation review

​Metrics

​Configuring filters

​Related pages

Safety filters

Standard dashboard

Conversation review

Metrics

Configuring filters

Related pages