Chat Performance - botBrains Docs

Real-time chat monitoring requires a fundamentally different approach than ticketing system analytics. When your AI powers live conversations on your website, Slack workspace, or WhatsApp Business, users expect immediate responses, natural dialogue flow, and seamless resolution within minutes—not hours or days. This page shows you how to measure, interpret, and optimize AI performance specifically for chat environments where every second counts and conversation quality makes or breaks user satisfaction.

Why Chat Monitoring Is Different

Chat conversations operate under unique constraints that distinguish them from traditional support tickets: Immediate response expectations - Users expect replies within seconds, not minutes. A 30-second delay that’s acceptable in email feels glacial in chat. Synchronous dialogue patterns - Chat conversations happen in real-time, creating natural back-and-forth exchanges. Users don’t batch all their questions into one message—they ask, wait, respond, and iterate. Shorter attention spans - Users abandon chat conversations quickly if they don’t get immediate value. A slow or unhelpful initial response loses the user entirely. Context within single sessions - Chat users typically resolve their entire question in one continuous session, unlike ticket workflows that span days with multiple agents. Informal communication style - Chat favors conversational, friendly language over formal support terminology. Tone and brevity matter enormously. Peak hour sensitivity - Chat volume spikes during business hours and specific events. Your AI must handle sudden surges without degradation. These differences demand specialized metrics that capture response speed, conversation flow quality, and user engagement patterns specific to real-time interactions.

Chat-optimized teams monitor response times in seconds (not hours), conversation abandonment rates, and peak hour coverage as their primary success indicators—metrics that matter less in asynchronous ticketing.

Accessing Chat Performance Metrics

Navigate to Analyze → Metrics and select the General view. The General view provides comprehensive chat analytics across all real-time conversation channels.

Filtering for Chat Channels

Use the channel filter to focus specifically on chat-based conversations: Website Chat (Web channel) Your website chat widget conversations. Filter to “Web” to see only these interactions. Slack (Slack channel) Workspace conversations from your Slack integration. Essential for internal support or community chat. WhatsApp (WhatsApp channel) WhatsApp Business conversations. Combines both synchronous and asynchronous patterns. To analyze only chat performance, select these channels and exclude email, ticketing, and other asynchronous channels from your view.

While WhatsApp conversations can be asynchronous, many users treat them like real-time chat. Monitor your WhatsApp response time patterns to determine which monitoring approach fits your use case.

Key Chat Performance Metrics

Response Time Analysis

Unlike ticketing systems where first response time is measured in hours, chat response time matters in seconds. What to measure:

Time from user message to AI’s first response
Average response latency across conversations
95th percentile response time (catching slowest responses)
Response time during peak hours vs. off-peak

Why it matters:

User expectations by response time:
- 0-3 seconds: Excellent, feels instant
- 3-7 seconds: Good, acceptable wait
- 7-15 seconds: Fair, user notices delay
- 15-30 seconds: Poor, user getting impatient
- 30+ seconds: Unacceptable, high abandonment risk

Impact:
Every 5 seconds of additional delay increases abandonment by ~10-15%

How to calculate from conversation data:

Navigate to Analyze → Conversations
Filter to your chat channels (Web, Slack, WhatsApp)
Open 20-30 conversations
Manually calculate time between user message and AI response
Average the results to establish your baseline

While the platform doesn’t currently display response time as a standalone metric card, tracking this manually for a sample of conversations provides critical insight into chat performance. Benchmarks:

0-5 seconds average: Excellent chat performance
5-10 seconds average: Good, room for optimization
10-20 seconds average: Fair, likely impacting satisfaction
20+ seconds average: Poor, investigate latency issues urgently

Response time degrades significantly when knowledge retrieval is slow or external data providers timeout. If you see response times above 15 seconds, review your data provider performance and consider caching frequently-accessed information.

Conversation Resolution Rate

What it measures: Percentage of chat conversations that successfully resolve the user’s question without escalation or abandonment.

Chat Resolution Rate = Resolved Conversations / Total Chat Conversations × 100%

Access this metric in the General view via the Resolution Rate card. Filter to your chat channels to isolate chat-specific performance. Chat-specific interpretation:

80%+ resolution = Excellent
- AI handles most common questions autonomously
- Users trust and engage with the AI
- Knowledge base covers expected topics well

60-80% resolution = Good
- Reasonable automation with clear improvement opportunities
- Some topic gaps causing escalations
- Review unresolved conversations for patterns

40-60% resolution = Fair
- Significant knowledge gaps or guidance issues
- Users frequently need human help
- May indicate wrong deployment strategy for chat

Below 40% = Poor
- Major problems preventing effective chat automation
- Fundamental knowledge gaps or technical issues
- Urgent investigation required

Chat vs. ticketing differences: In ticketing, 70% resolution might be excellent because tickets handle complex issues. In chat, 70% resolution suggests problems—chat should handle simpler, high-volume questions autonomously while escalating truly complex cases.

User Satisfaction in Chat (CSAT)

What it measures: Customer Satisfaction score specific to chat conversations, calculated as percentage of 4-5 star ratings.

Chat CSAT = (Good + Amazing ratings) / Total Rated Chat Conversations × 100%

Find this in the General view CSAT Score card. Apply chat channel filters for chat-specific CSAT. Why chat CSAT matters differently:

Chat users rate based on:
- Speed of response (immediate gratification)
- Conversational flow (natural dialogue)
- Conciseness (no long-winded explanations)
- Tone (friendly, helpful attitude)

Ticket users rate based on:
- Thoroughness (complete information)
- Accuracy (correct answer)
- Follow-through (problem actually solved)
- Professionalism (formal, competent)

Chat CSAT benchmarks:

85%+ = Excellent (users love the chat experience)
75-85% = Good (solid chat satisfaction)
65-75% = Fair (chat experience needs improvement)
Below 65% = Poor (fundamental chat quality issues)

Common chat CSAT issues: Pattern 1: High resolution but low CSAT

AI is answering but not conversationally
Responses too formal or lengthy for chat
Tone feels robotic or unhelpful
Missing pleasantries (greetings, acknowledgment)

Pattern 2: Low resolution and low CSAT

Knowledge gaps preventing answers
AI escalating too frequently
Users frustrated by lack of help

Pattern 3: High CSAT but low resolution

AI is friendly but not helpful
Good tone but missing information
Users appreciate effort despite not resolving issue

Use the Conversation Rating Chart in General view to see your CSAT distribution. Filter to 1-2 star ratings for chat channels to identify problematic conversations.

Handoff Timing and Patterns

What it measures: When and why chat conversations escalate from AI to human agents. Access via the Handoff Chart in General view. This visualization shows escalation triggers and timing patterns. Chat-specific handoff analysis: Immediate handoffs (within first 1-2 messages):

Causes:
- User explicitly requests human ("I need to speak to a person")
- AI detects high-stakes topic (billing, cancellation)
- Sentiment-based escalation triggers

Interpretation:
- Normal for certain topics
- Review if happening frequently for routine questions
- May indicate overly conservative escalation rules

Mid-conversation handoffs (3-5 messages in):

Causes:
- AI couldn't answer follow-up questions
- User expressed frustration with AI responses
- Complexity escalated beyond AI capability
- Knowledge gap revealed during conversation

Interpretation:
- Indicates partial AI value (handled initial exchange)
- Review these conversations for knowledge improvements
- Good handoff timing if complexity genuinely requires human

Late handoffs (6+ messages in):

Causes:
- AI struggled through long conversation before giving up
- User patience exhausted, explicitly requested human
- Circular conversation pattern (AI repeating itself)

Interpretation:
- Problematic - wasted user time before escalating
- Should have escalated earlier if unable to help
- Review for AI confusion or inadequate knowledge

Peak handoff times: Use date range filtering in Handoff Chart to identify when escalations spike:

Business hours (9 AM - 5 PM): Expected for complex questions
After hours/weekends: May indicate AI confidence issues when team isn’t available
Specific days: Monday mornings, Friday afternoons may have patterns

Action items from handoff analysis:

Filter conversations by escalation trigger
Identify most common escalation reasons
For each reason, determine if escalation was necessary or avoidable
Add knowledge or refine guidance to reduce avoidable escalations
Adjust escalation rules if escalating too early or too late

Chat Abandonment Rate

What it measures: Percentage of chat conversations where the user leaves before their question is resolved or escalated. How to calculate: Navigate to Analyze → Conversations, filter to:

Channels: Web, Slack, WhatsApp (your chat channels)
Status: Unresolved
Date range: Last 7-30 days

Abandonment Rate = Unresolved Chat Conversations / Total Chat Conversations × 100%

Chat abandonment benchmarks:

0-10%: Excellent (minimal abandonment)
10-20%: Good (reasonable abandonment for difficult questions)
20-35%: Fair (significant abandonment, investigate causes)
35%+: Poor (losing too many users mid-conversation)

Common abandonment patterns: Single-message abandonment:

Pattern: User asks question, never responds to AI's answer

Causes:
- AI response took too long to arrive (user left)
- AI answer didn't address the question
- User found answer elsewhere (help docs, Google)
- User got distracted or lost interest

Action:
- Check response time for these conversations
- Review answer relevance
- Improve initial response quality

Multi-message abandonment:

Pattern: Back-and-forth exchange, then user stops responding

Causes:
- AI couldn't answer follow-up questions
- Conversation becoming circular (repeating information)
- User frustrated with lack of progress
- AI provided partial solution, "good enough" for user

Action:
- Read 10-20 of these conversations
- Identify where conversations break down
- Add missing knowledge for common follow-ups
- Improve guidance for handling multi-turn conversations

Strategic abandonment analysis: Use the Conversation Length Chart to see message distribution:

If many 1-message conversations are unresolved:
- Response time or initial answer quality issues

If many 3-5 message conversations are unresolved:
- Knowledge gaps for follow-up questions
- AI not handling multi-turn dialogue well

If many 6+ message conversations are unresolved:
- AI getting lost in long conversations
- Should escalate earlier if unable to resolve

Peak Hour Coverage

What it measures: AI performance during highest-traffic periods compared to off-peak times. Why this matters for chat: Unlike ticketing where AI handles weekend overflow, chat AI must maintain quality during peak weekday hours when:

Traffic volume is 3-5x higher than off-peak
Users expect instant responses
Multiple concurrent conversations strain resources
Human agents are busy and can’t easily take handoffs

How to analyze peak hour coverage: Step 1: Identify your peak hours Use the Weekly Heatmap in General view (scroll to bottom of metrics page):

Shows conversation volume by day of week and hour
Darker colors indicate higher volume
Identify your busiest 3-4 hour blocks

Step 2: Filter metrics to peak hours Navigate to Analyze → Conversations and use custom date filtering:

Select specific date ranges during peak hours
Note CSAT, Resolution Rate, and involvement levels
Export conversations for detailed analysis

Step 3: Compare peak vs. off-peak performance

Example comparison:

Peak hours (10 AM - 2 PM weekdays):
- 250 conversations per day
- 72% CSAT
- 68% resolution rate
- 15 second avg response time

Off-peak hours (8 PM - midnight):
- 40 conversations per day
- 82% CSAT
- 85% resolution rate
- 5 second avg response time

Insight: Performance degrades during peak load
Action: Investigate whether issue is response time, quality, or both

Peak hour performance issues: Response time degradation:

AI latency increases during high traffic
May indicate infrastructure scaling issues
Consider optimizing data provider queries
Review caching strategy for common questions

Quality degradation:

Resolution rate drops during peaks
More escalations during busy periods
May indicate human agents too busy to take handoffs gracefully
Could suggest AI hesitates to handle questions during peaks

Coverage gaps:

Peak times see more “Not Involved” conversations
Agents handling conversations manually without AI
May indicate team doesn’t trust AI during critical periods
Review guidance for peak hour scenarios

The best chat AI deployments maintain consistent quality regardless of traffic volume. If your metrics vary significantly between peak and off-peak, investigate whether the issue is technical (infrastructure) or operational (team behavior).

Using the General View for Chat Analysis

The General view dashboard provides several charts specifically valuable for chat monitoring:

Conversation Status Chart

What it shows: Stacked area chart of resolved, unresolved, and escalated conversations over time. Chat-specific usage:

Track daily resolution trends for chat channels
Identify days when resolution rate dips (investigate those days)
Correlate status changes with deployments or knowledge updates
Monitor whether unresolved conversations (abandonment) is growing

How to use:

Apply chat channel filters (Web, Slack, WhatsApp)
Set date range to last 30 days
Look for trends: Is green (resolved) growing? Is yellow (unresolved) shrinking?
Click specific dates to view conversations from that day

AI Involvement Rate Chart

What it shows: Pie chart showing autonomous, public, private, and not-involved distribution. Ideal chat involvement distribution:

Mature chat deployment:
- 70-80% Fully Autonomous (green)
- 10-15% Public Involvement (blue)
- 5-10% Private Involvement (purple)
- 5-10% Not Involved (gray)

Chat differs from ticketing:
- Higher autonomous rate expected (chat handles simpler questions)
- Lower private involvement (less time for copilot review in real-time)
- Lower public involvement (either AI handles or escalates quickly)

If your chat distribution looks different: Low autonomous rate (40-50%):

Chat is deployed for questions too complex for full automation
Knowledge gaps preventing autonomous handling
Guidance too conservative, escalating unnecessarily
Review public involvement conversations for improvement opportunities

High not-involved (30%+):

Team handling many chats manually without AI assistance
Integration may not be triggering AI for all conversations
Agents may be disabling AI during busy periods
Review deployment settings and team training

Message Volume Chart

What it shows: Area chart with message count and conversation count over time. Chat-specific metrics:

Messages per Conversation = Total Messages / Total Conversations

Ideal chat conversation length:
- 2-4 messages: Quick questions, efficient resolution
- 5-6 messages: Moderate complexity, good dialogue
- 7-10 messages: Complex but manageable
- 10+ messages: Potentially problematic, investigate

Long chat conversations indicate:
- AI not answering concisely
- Knowledge gaps requiring many clarifying questions
- Circular conversations (AI repeating itself)
- Complex issues that should escalate earlier

Use this chart to identify whether your chat conversations are becoming longer over time (potential quality issue) or shorter (potential improvement or users giving up quickly).

User Sentiment Chart

What it shows: Distribution of positive, neutral, and negative sentiment in user messages. Why sentiment matters more in chat: Real-time chat captures emotional reactions immediately:

Users express frustration faster in chat than email
Positive sentiment validates good chat experience
Sentiment shift mid-conversation indicates AI performance

How to use for chat optimization:

Filter to chat channels
Note baseline sentiment distribution
Filter to negative sentiment conversations
Review whether negative sentiment correlates with:
- Long response times
- Unresolved status
- Many-message conversations
- Specific topics or times

Healthy chat sentiment distribution:

60-70% Neutral: Normal for factual questions
20-30% Positive: Good experiences, satisfied users
5-15% Negative: Acceptable rate, often reflects problem frustration, not AI frustration

If negative sentiment exceeds 20%, investigate whether users are frustrated with the AI’s responses or with their underlying problem.

Real-Time Performance Monitoring

Unlike ticketing where daily review suffices, chat performance benefits from more frequent monitoring:

Daily Quick Check (5 minutes)

Every morning:

Open Analyze → Metrics, General view
Filter to chat channels (Web, Slack, WhatsApp)
Review yesterday’s performance:
- CSAT score: Did it drop from baseline?
- Resolution rate: Any significant change?
- Total conversations: Traffic volume normal?
Check Conversation Status Chart: Any unusual spikes in escalations or unresolved?

What to look for:

Sudden CSAT drops (5+ percentage points): Investigate immediately
Resolution rate degradation: May indicate knowledge issue or technical problem
Volume spikes: Ensure AI is handling load without quality loss

Weekly Deep Dive (30-45 minutes)

Every Monday or Tuesday:

Review previous week’s chat metrics:
- Filter date range to past 7 days
- Note CSAT, resolution rate, involvement rate
- Compare to previous week’s performance
Analyze low-rated conversations:
- Navigate to Analyze → Conversations
- Filter: Chat channels, Rating 1-2 stars, Last 7 days
- Read 10-15 conversations
- Document common issues (response quality, speed, knowledge gaps)
Check abandonment patterns:
- Filter: Chat channels, Status: Unresolved, Last 7 days
- Review 10-15 abandoned conversations
- Note where users are leaving (after how many messages?)
- Identify topics with high abandonment
Review escalation triggers:
- Filter: Chat channels, Status: Escalated, Last 7 days
- Check Handoff Chart for escalation reasons
- Determine if escalations were necessary or avoidable
- Plan knowledge improvements to reduce avoidable escalations
Track improvements:
- Review conversations for topics you recently improved
- Verify that changes are having desired effect
- Document what’s working to replicate success

Real-Time Chat Monitoring (During Critical Periods)

For product launches, marketing campaigns, or other high-stakes events: Set up active monitoring:

Keep Analyze → Conversations open and filtered to chat channels
Refresh every 15-30 minutes to see new conversations
Spot-check recent conversations for quality
Watch for unusual patterns (sudden escalation spike, low ratings)

Have escalation plan ready:

Define thresholds for manual intervention (e.g., 3 consecutive 1-star ratings)
Assign team member to monitor metrics during event
Prepare to adjust AI deployment if quality issues arise
Have backup plan to route conversations to humans if AI struggles

Never make major changes to guidance or knowledge during critical traffic periods. Test all improvements in off-peak times first. During high-traffic events, focus on monitoring and routing, not experimenting.

Conversation Flow Analysis

Chat conversations reveal how well your AI handles real-time dialogue dynamics:

Multi-Turn Dialogue Quality

What to analyze: How AI performs across multiple back-and-forth exchanges. Access the data:

Navigate to Analyze → Conversations
Filter to resolved chat conversations
Use Conversation Length Chart to filter to 5-10 message conversations
Review 20-30 examples

What good multi-turn chat looks like:

Turn 1: User asks initial question
Turn 2: AI provides direct answer with key information
Turn 3: User asks clarifying follow-up
Turn 4: AI addresses follow-up without repeating initial answer
Turn 5: User confirms understanding or thanks AI
Turn 6: AI acknowledges and offers additional help

Quality indicators:
- Each AI response adds new information
- No circular repetition
- Natural conversation flow
- Concise responses (2-4 sentences typically)
- Appropriate tone throughout

What poor multi-turn chat looks like:

Turn 1: User asks initial question
Turn 2: AI provides lengthy, comprehensive answer (8+ sentences)
Turn 3: User asks related follow-up (didn't read full answer)
Turn 4: AI repeats information from Turn 2
Turn 5: User expresses frustration
Turn 6: AI repeats again or escalates

Problems:
- Initial answer too long for chat format
- AI not tracking what it already said
- Missing follow-up knowledge
- Escalating too late after frustrating user

Context Retention Analysis

What it measures: Whether AI maintains conversation context across messages. How to test:

Filter to chat conversations with 6+ messages
Read user follow-up questions
Check if AI responses acknowledge previous context
Look for phrases indicating context loss:
- “As I mentioned earlier…” (good, maintaining context)
- “I don’t have information about that” (bad, after already discussing topic)
- User repeating question multiple times (bad, AI not understanding)

Common context loss issues:

AI treats each user message as independent query
Follow-up questions reference information from earlier in conversation
AI doesn’t connect related questions across messages
Pronouns and references not resolved (user says “What about the other option?” AI doesn’t know what option)

Improving context retention:

Ensure guidance emphasizes conversation continuity
Add examples of multi-turn conversations to training
Review knowledge structure for related topics that should link together
Test conversation memory by asking follow-ups in Preview Panel

Greeting and Closing Quality

First impressions matter in chat: Opening message analysis:

Filter to chat conversations
Review first AI message in 30-50 conversations
Evaluate:
- Does AI greet user warmly?
- Is tone conversational, not formal?
- Does AI acknowledge the question clearly?
- Is first response fast (ideally under 5 seconds)?

Good chat openings:

"Hi! I'd be happy to help you with that..."
"Hey there! Let me explain how that works..."
"Hi! Great question about [topic]..."

Poor chat openings:

"Based on your inquiry regarding..." (too formal)
"I will now retrieve information about..." (robotic)
*No greeting, jumps straight to answer* (cold)
"Please wait while I search..." (slow, lacks confidence)

Closing message analysis: Good chat AI closes conversations naturally:

"Is there anything else I can help with?"
"Glad I could help! Let me know if you have other questions."
"Hope that clarifies it! Feel free to ask if you need anything else."

Poor chat closings:

*No closing* - AI just stops responding
"This conversation is now closed." (abrupt)
"Please rate your experience." (pushy without helpfulness check)

Review 20-30 resolved conversations to ensure your AI is opening and closing naturally.

User Engagement Metrics

Beyond success rates, engagement metrics show whether users trust and value the chat experience:

Repeat User Rate

What it measures: Percentage of chat users who return for multiple conversations. How to calculate:

Navigate to Analyze → Metrics, General view
Note “Unique Users” count for selected time period
Note “Conversations” count for same period
Calculate: Conversations per User = Conversations / Unique Users

Conversations per User:
- 1.0-1.2: Low engagement, mostly one-time users
- 1.3-1.8: Moderate engagement, some repeat users
- 1.9-2.5: Good engagement, healthy repeat rate
- 2.5+: High engagement, users trust and return frequently

Why repeat users matter:

Indicates users find chat valuable (come back)
Lower customer acquisition cost (retention vs. new users)
Trust in AI capability (wouldn’t return if first experience was poor)

If repeat rate is low (below 1.5):

First-time experience may not be compelling enough
Users solving problem once and never need help again (could be good!)
Users avoiding chat after poor initial experience (needs investigation)

Cross-reference with CSAT: Low CSAT + Low repeat rate = poor experience driving users away.

Follow-Up Question Rate

What it measures: Percentage of conversations where user asks follow-up questions after initial answer. How to analyze:

Use Conversation Length Chart to see message distribution
Calculate percentage of conversations with 3+ user messages

User message counts:
- 1 message: User asked, got answer, left (or abandoned)
- 2 messages: User asked, got answer, confirmed/thanked
- 3+ messages: User had follow-up questions or clarifications

Follow-up rate = Conversations with 3+ user messages / Total × 100%

Benchmarks:
- 30-45%: Normal, healthy engagement
- 45-60%: High engagement, complex questions or incomplete initial answers
- 15-30%: Low engagement, either very clear answers or users leaving quickly
- Below 15%: Potentially concerning, investigate abandonment

Interpreting follow-up patterns: High follow-up rate (50%+) with high CSAT:

Users engaged in productive dialogue
AI handling complex questions through conversation
Good multi-turn performance

High follow-up rate (50%+) with low CSAT:

Users not getting clear answers initially
Having to ask same question multiple ways
AI not understanding or answering directly
Review these conversations for knowledge or guidance issues

Low follow-up rate (20%) with high CSAT:

AI providing clear, complete answers immediately
Users getting what they need quickly
Efficient resolution (ideal for chat)

Low follow-up rate (20%) with low CSAT:

Users abandoning after poor first answer
Not bothering to ask follow-ups
Lost confidence in AI after initial interaction

Survey Completion Rate

What it measures: Percentage of users who complete satisfaction surveys when offered. How to analyze:

Review Conversation Rating Chart in General view
Note “Abandoned” bar (surveys offered but not completed)
Calculate: Survey Completion = Rated / (Rated + Abandoned) × 100%

Chat survey benchmarks:

60%+ completion: Excellent (users engaged enough to rate)
40-60% completion: Good (reasonable survey engagement)
25-40% completion: Fair (many users skip survey)
Below 25%: Poor (survey fatigue or poor placement)

Why chat survey completion matters: Low completion rates limit your feedback data:

CSAT score based on small sample may not represent true satisfaction
Missing feedback from users who abandoned or had neutral experience
Hard to identify problems without representative ratings

Improving survey completion:

Time survey appropriately (immediately after resolution, not after idle time)
Keep survey simple (1 question: star rating, optional comment)
Frame survey helpfully: “How did I do?” not “Rate your experience”
Don’t offer survey if conversation was very short (1-2 messages)

Chat Quality Indicators

Beyond metrics, qualitative signals reveal chat experience quality:

Response Relevance

What to check: Does the AI actually answer the question asked? How to evaluate:

Read 20-30 random chat conversations
For each, ask: “Did the first AI response directly address the user’s question?”
Calculate: Relevance Rate = Directly Relevant Responses / Total

Target: 85%+ relevance rate (AI answers what user actually asked) Common relevance problems:

AI provides related but not directly relevant information
AI answers a similar question, not the one asked
AI gives generic response when user wants specific information
AI misunderstands due to typos or informal phrasing

Improving relevance:

Add examples of actual user questions to guidance
Include common phrasings and variations in knowledge
Adjust guidance to prioritize direct answers over comprehensive explanations
Test with real user questions in Preview Panel

Response Conciseness

What to check: Are AI responses appropriately brief for chat? How to evaluate:

Read 30-50 chat conversations
Count sentences in typical AI responses
Note which responses feel too long

Chat response length guidelines:

Simple questions: 1-2 sentences ideal, 3 maximum
"What are your hours?"
Good: "We're open Monday-Friday 9am-5pm EST."
Too long: "Thank you for asking about our hours. Our business hours are Monday through Friday from 9:00am to 5:00pm Eastern Standard Time. We're closed on weekends and major holidays. If you need assistance outside these hours, you can email us and we'll respond the next business day."

Moderate complexity: 3-4 sentences, 5 maximum
"How do I reset my password?"
Good: "Click 'Forgot Password' on the login page. Enter your email address, and we'll send you a reset link within a few minutes. The link is valid for 24 hours."

Complex topics: 5-6 sentences, with offer to explain more
"How does your pricing work?"
Good: "We have three plans: Basic ($29/mo), Pro ($79/mo), and Enterprise (custom). Basic includes up to 100 users and core features. Pro adds advanced analytics and integrations. Want me to explain the differences in more detail?"

Signs of excessive length:

Users asking follow-ups that were already answered (didn’t read full response)
Users saying “too much info” or “just tell me X”
High abandonment after AI’s first long response
Many follow-up questions because initial answer buried key info

Fixing verbosity:

Add guidance: “Keep responses to 2-3 sentences for simple questions”
Show examples of concise vs. verbose answers
Instruct AI to offer more detail rather than providing it upfront
Remove unnecessary preambles (“Thank you for contacting us today…”)

Tone Appropriateness

What to check: Does AI match the conversational, friendly tone users expect from chat? How to evaluate:

Read 20-30 chat conversations
Note formal or robotic phrasing
Check if tone matches your brand voice

Good chat tone characteristics:

- Conversational: "Let me help you with that" not "I shall assist you"
- Friendly: "Happy to explain!" not "I will now provide information"
- Natural: "That's a great question" not "Your inquiry has been noted"
- Brief: "Sure!" not "Certainly, I would be happy to assist"
- Human: "I understand that's frustrating" not "Your frustration is acknowledged"

Red flags in chat tone:

"Per your request..." (too formal)
"I will now retrieve..." (robotic)
"Thank you for contacting..." (call center script)
"Your inquiry regarding..." (ticket language)
"Please be advised..." (legal tone)
"As previously stated..." (condescending)

Refining tone:

Add tone guidance: “Respond conversationally, like a helpful colleague”
Provide good/bad example pairs in guidance
Test tone by reading responses out loud—if you wouldn’t say it to a friend, it’s too formal
Review low-rated conversations for tone complaints

Best Practices for Chat Monitoring

Establish Chat-Specific Baselines

Don’t compare chat to ticketing performance: Track chat metrics separately:

Your baselines (example):
- Chat CSAT: 78% (vs 85% for tickets - acceptable difference)
- Chat resolution: 82% (vs 70% for tickets - expected difference)
- Chat autonomous rate: 75% (vs 65% for tickets)
- Avg chat length: 4.2 messages (vs 8.5 for tickets)

Track changes in these baselines over time:
Week 1: 78% CSAT baseline
Week 4: 81% CSAT (+3pp improvement)
Week 8: 80% CSAT (maintaining improvement)

Document your benchmarks: Create a simple spreadsheet or document:

Week of [date]
Chat CSAT: X%
Chat resolution: Y%
Avg messages per conversation: Z
Notes: What changed this week?

Weekly Chat Review Ritual

Monday morning routine (15-20 minutes):

Check weekend chat performance:
- Filter metrics to Saturday-Sunday
- Note CSAT and resolution rate
- Compare to weekday performance
- Check if AI maintained quality without team available
Review last week’s improvements:
- Did knowledge additions improve specific topics?
- Did guidance changes affect tone or conciseness?
- Track one improvement at a time to measure impact
Identify this week’s focus:
- Pick 1-2 specific issues to address
- Set measurable goal (e.g., “Reduce ‘Billing’ topic abandonment from 25% to 18%”)
- Plan specific actions (add 3 billing snippets, update guidance)

Friday afternoon wrap-up (10 minutes):

Quick metrics check for the week
Note any significant changes or incidents
Plan weekend monitoring if needed
Document week’s performance for next Monday review

Conversation Sampling Strategy

You can’t read every conversation. Use strategic sampling: Daily samples (5-10 conversations):

3 most recent conversations (spot-check quality)
2 lowest-rated from today (identify acute issues)
2-3 random conversations (avoid bias)

Weekly samples (30-50 conversations):

10 lowest-rated (understand dissatisfaction)
10 escalated (identify avoidable escalations)
10 unresolved/abandoned (understand where users leave)
10 highly-rated (learn what’s working well)
10 random (representative sample)

Monthly samples (100-150 conversations):

Deep dive across all rating levels
Segment by topic (30 conversations per top topic)
Segment by channel if using multiple chat platforms
Track improvement in previously problematic areas

Correlate Chat Metrics with Business Outcomes

Connect chat performance to business goals: Customer acquisition:

Track: What percentage of new users start chat conversations?
Insight: If 60% of new users chat, chat is critical to onboarding
Action: Optimize AI for common new user questions

Conversion rate:

Track: Do users who chat convert at higher rates?
Calculate: Conversion rate for users with chat vs. without
Insight: If chat users convert 2x more, chat directly drives revenue
Action: Proactively offer chat to users showing buying signals

Support cost reduction:

Track: How many chats would have been email or tickets?
Calculate: Chat resolutions × avg ticket handling time × agent cost
Insight: AI chat deflects 300 tickets/week = 75 hours saved = $X savings
Action: Demonstrate ROI to leadership for continued investment

Customer lifetime value:

Track: Do users who chat successfully have higher LTV?
Calculate: Average LTV of users with successful chat experience
Insight: Good chat experience correlates with retention
Action: Prioritize chat quality as retention investment

Optimization Strategies for Chat

Speed Optimization

If response times exceed 10 seconds: Investigate causes:

Check data provider sync status (slow providers delay responses)
Review knowledge retrieval performance (too many sources slow search)
Test during peak vs. off-peak (infrastructure scaling issue?)
Check external API timeouts (integration dependencies)

Optimization tactics:

Strategy 1: Prioritize fast data providers
- Snippets respond instantly (direct knowledge)
- Tables respond quickly (structured data)
- Large PDFs respond slowly (full-text search)
- External APIs vary (depends on their performance)

Action: Move frequently-accessed info to snippets

Strategy 2: Optimize knowledge scope
- Reduce total knowledge volume if extremely large (100k+ documents)
- Remove outdated or rarely-used data providers
- Use audience targeting to limit knowledge per conversation

Strategy 3: Implement caching
- Common questions should hit cached responses
- Review top 20 questions and ensure answers are readily available

Abandonment Reduction

If abandonment rate exceeds 25%: Step 1: Identify abandonment points Filter to unresolved conversations, read 30-50:

Do users leave after 1st AI response? (Answer quality issue)
Do users leave after 3-4 messages? (Follow-up knowledge gaps)
Do users leave after long delays? (Response time issue)

Step 2: Segment by topic Use Topics page to identify which topics have highest abandonment:

Example:
- "Refund Policy": 40% unresolved (high abandonment)
- "Product Features": 15% unresolved (normal)
- "Billing": 35% unresolved (high abandonment)

Focus: Improve Refund Policy and Billing knowledge first

Step 3: Improve highest-impact areas For each high-abandonment topic:

Read 20 conversations about that topic
Note what information users needed but didn’t get
Add that information to knowledge base
Update guidance for better handling of that topic
Monitor next week’s abandonment rate for that topic

CSAT Improvement

If chat CSAT below 70%: Diagnosis process:

Filter to low ratings:
- Conversations with 1-2 star ratings
- Last 30 days
- Chat channels only
Categorize complaints:
- Speed issues: “Too slow” “Took forever” “Waited too long”
- Accuracy issues: “Wrong answer” “Didn’t help” “Incorrect information”
- Tone issues: “Rude” “Unhelpful” “Robotic” “Formal”
- Completeness issues: “Not enough detail” “Vague” “Didn’t answer my question”

Quantify each category:

From 50 low-rated conversations:
- 22 (44%) cited accuracy/completeness issues
- 12 (24%) cited tone/helpfulness issues
- 10 (20%) cited speed issues
- 6 (12%) other/unclear

Focus: Fix accuracy/completeness first (highest impact)

Address root causes:
- Accuracy issues → Add knowledge, improve data provider quality
- Tone issues → Refine guidance, add tone examples
- Speed issues → Optimize response time (see Speed Optimization above)
- Completeness issues → Ensure answers directly address questions
Measure improvement:
- Track CSAT weekly after changes
- Read follow-up low-rated conversations to verify issues resolving
- Target 3-5 percentage point improvement per month

Escalation Optimization

If escalation rate exceeds 20% for chat: Determine if escalations are appropriate: Read 30-50 escalated conversations: Appropriate escalations (don’t try to eliminate):

- Account-specific information (billing history, private data)
- Complex troubleshooting (requires technical expertise)
- Emotional situations (angry customer needs human empathy)
- Policy exceptions (requires human judgment)
- Sales negotiations (pricing, contracts)

Avoidable escalations (optimize these):

- Simple questions AI should know ("What are your hours?")
- Follow-up questions after partial answer (missing knowledge)
- User requested human because AI was unhelpful (quality issue)
- AI gave up too easily ("I don't have that information" without trying)
- Escalation triggered by keyword but not actually complex

Reduce avoidable escalations:

Add missing knowledge:
- Create snippets for questions AI incorrectly escalates
- Ensure common questions have direct answers
Adjust escalation rules:
- Review trigger keywords (too sensitive?)
- Increase AI confidence threshold for autonomous handling
- Add more specific escalation criteria (not just keywords)
Improve guidance:
- Instruct AI to attempt answering before escalating
- Provide examples of when to escalate vs. when to try
- Add guidance for handling follow-up questions

Target: Reduce avoidable escalations by 30-50% while maintaining appropriate escalations for complex cases.

Troubleshooting Common Chat Issues

Response Time Suddenly Increased

Symptoms: Average response time jumped from 5 seconds to 20+ seconds. Diagnosis steps:

Check deployment status (recent changes to profile or knowledge?)
Review data provider sync health (any providers failing or timing out?)
Test in Preview Panel (is delay affecting all questions or specific topics?)
Check for infrastructure issues (external APIs down?)
Review traffic volume (sudden spike causing congestion?)

Solutions:

Roll back recent profile changes if timing correlates
Disable slow or failing data providers temporarily
Contact support if infrastructure issue
Scale up resources if traffic spike is sustained

CSAT Dropped Suddenly

Symptoms: CSAT was 80%, now 65% with no obvious change. Diagnosis steps:

Compare date ranges: When exactly did drop occur?
Check for deployments: Did profile change around that time?
Read recent low-rated conversations: What specific complaints?
Check if drop is topic-specific: Filter by topic, see if isolated
Review team changes: New agents affecting handoffs?

Solutions:

If after deployment: Review changes, roll back if necessary
If topic-specific: Add knowledge for that topic urgently
If tone-related: Adjust guidance for better tone
If unclear: Keep monitoring, may be temporary anomaly

High Abandonment Rate

Symptoms: 40%+ of conversations ending unresolved. Diagnosis steps:

Calculate abandonment by message count (where are users leaving?)
Check response times (slow responses causing abandonment?)
Read abandoned conversations (what questions not being answered?)
Filter by topic (specific topics causing abandonment?)
Compare to previous period (sudden change or gradual trend?)

Solutions:

Immediate triage: Add knowledge for top 3 abandonment topics
Speed fixes: Optimize slow data providers
Quality fixes: Improve initial response relevance
Follow-up fixes: Add knowledge for common follow-up questions

Users Repeatedly Asking Same Question

Symptoms: Multi-message conversations where user asks same question 2-3 times. Diagnosis:

AI not directly answering the question
AI providing related info instead of specific answer
AI’s answer buried in long response (user didn’t see it)
AI misunderstanding question due to phrasing

Solutions:

Improve answer directness:
- Update guidance: “Answer the specific question asked in first sentence”
- Add examples of direct vs. indirect answers
Add knowledge variations:
- Create snippets with multiple phrasings of same question
- Include common follow-up formulations
Improve conciseness:
- Shorten responses so key info isn’t buried
- Lead with direct answer, offer more detail after

Chat Works Well in Testing, Poorly in Production

Symptoms: Preview Panel conversations look great, but real user CSAT is low. Common causes:

Test questions don’t match real user questions:
- Testing with well-formed questions
- Real users ask with typos, informal language, context
- Solution: Test with exact user questions from conversation history
Testing during off-peak, production during peak:
- Response time fine in testing, slow during traffic
- Solution: Test during actual peak hours
Testing individual messages, not full conversations:
- Single-message test looks good
- Multi-turn dialogue breaks down
- Solution: Test full conversation flows, not isolated Q&A
Different audiences or channels:
- Testing Web channel, production includes Slack with different expectations
- Solution: Test across all active channels

Next Steps

Now that you understand chat performance monitoring:

Review Conversations - Read actual chat conversations to understand metrics in context
General Metrics Dashboard - Access the complete General view for chat analytics
Analyze Topics - Identify which chat topics need improvement
Improve Answers - Use chat insights to refine knowledge and guidance
Test Your AI - Create test suites specifically for chat scenarios
Ticketing Performance - Compare chat performance to ticketing metrics

Remember: Chat performance optimization is continuous. Monitor daily, review weekly, and make focused improvements monthly. Small, consistent enhancements to response time, answer quality, and conversation flow compound into dramatically better chat experiences that drive user satisfaction, engagement, and business outcomes.

Getting Started

Core Concepts

Train

Deploy

Analyze

Guides

Security

Other

More

​Why Chat Monitoring Is Different

​Accessing Chat Performance Metrics

​Filtering for Chat Channels

​Key Chat Performance Metrics

​Response Time Analysis

​Conversation Resolution Rate

​User Satisfaction in Chat (CSAT)

​Handoff Timing and Patterns

​Chat Abandonment Rate

​Peak Hour Coverage

​Using the General View for Chat Analysis

​Conversation Status Chart

​AI Involvement Rate Chart

​Message Volume Chart

​User Sentiment Chart

​Real-Time Performance Monitoring

​Daily Quick Check (5 minutes)

​Weekly Deep Dive (30-45 minutes)

​Real-Time Chat Monitoring (During Critical Periods)

​Conversation Flow Analysis

​Multi-Turn Dialogue Quality

​Context Retention Analysis

​Greeting and Closing Quality

​User Engagement Metrics

​Repeat User Rate

​Follow-Up Question Rate

​Survey Completion Rate

​Chat Quality Indicators

​Response Relevance

​Response Conciseness

​Tone Appropriateness

​Best Practices for Chat Monitoring

​Establish Chat-Specific Baselines

​Weekly Chat Review Ritual

​Conversation Sampling Strategy

​Correlate Chat Metrics with Business Outcomes

​Optimization Strategies for Chat

​Speed Optimization

​Abandonment Reduction

​CSAT Improvement

​Escalation Optimization

​Troubleshooting Common Chat Issues

​Response Time Suddenly Increased

​CSAT Dropped Suddenly

​High Abandonment Rate

​Users Repeatedly Asking Same Question

​Chat Works Well in Testing, Poorly in Production

​Next Steps

Why Chat Monitoring Is Different

Accessing Chat Performance Metrics

Filtering for Chat Channels

Key Chat Performance Metrics

Response Time Analysis

Conversation Resolution Rate

User Satisfaction in Chat (CSAT)

Handoff Timing and Patterns

Chat Abandonment Rate

Peak Hour Coverage

Using the General View for Chat Analysis

Conversation Status Chart

AI Involvement Rate Chart

Message Volume Chart

User Sentiment Chart

Real-Time Performance Monitoring

Daily Quick Check (5 minutes)

Weekly Deep Dive (30-45 minutes)

Real-Time Chat Monitoring (During Critical Periods)

Conversation Flow Analysis

Multi-Turn Dialogue Quality

Context Retention Analysis

Greeting and Closing Quality

User Engagement Metrics

Repeat User Rate

Follow-Up Question Rate

Survey Completion Rate

Chat Quality Indicators

Response Relevance

Response Conciseness

Tone Appropriateness

Best Practices for Chat Monitoring

Establish Chat-Specific Baselines

Weekly Chat Review Ritual

Conversation Sampling Strategy

Correlate Chat Metrics with Business Outcomes

Optimization Strategies for Chat

Speed Optimization

Abandonment Reduction

CSAT Improvement

Escalation Optimization

Troubleshooting Common Chat Issues

Response Time Suddenly Increased

CSAT Dropped Suddenly

High Abandonment Rate

Users Repeatedly Asking Same Question

Chat Works Well in Testing, Poorly in Production

Next Steps