Vision

botBrains Vision transforms you AI agent from a text-only chatbot into a multimodal support system that can see and understand images. When customers share screenshots, product photos, error messages, or visual issues, your AI can analyze the content and provide intelligent, context-aware assistance - just like a human support agent would.

Why Vision Matters

Visual communication is essential for effective customer support. Many issues are easier to show than describe, and images provide context that text alone cannot capture. Vision enables your AI to: Resolve Issues Faster - Customers can share screenshots of error messages, product defects, or installation problems instead of trying to describe them in words. Your AI sees exactly what they see. Reduce Customer Effort - Instead of asking customers to transcribe error codes, describe visual problems, or navigate complex menus to find information, let them simply take a photo. Handle Complex Support Scenarios - Troubleshoot visual issues like UI problems, physical product defects, receipt verification, installation guidance, and document processing. Improve First-Contact Resolution - With visual context, your AI can diagnose problems more accurately and provide solutions without multiple back-and-forth exchanges. Scale Visual Support - Provide the same level of visual analysis your best support agents offer, without hiring more staff.

Customers using image upload resolve issues 40% faster on average compared to text-only conversations. Visual context eliminates ambiguity and speeds up problem diagnosis.

How Vision Works

When a customer uploads an image during a conversation, botBrains processes it through a sophisticated multimodal AI system:

1. Image Upload

Customers can upload images directly in the chat widget (if enabled). The system accepts common image formats and validates them for security and compatibility.

2. Secure Processing

Uploaded images are:

Scanned for security threats and malicious content
Validated for format and file size
Stored securely with encryption at rest
Processed using privacy-preserving techniques

3. Visual Understanding

The AI analyzes the image using advanced computer vision capabilities:

Text extraction: Reads error messages, labels, serial numbers, and any visible text
Object recognition: Identifies products, components, UI elements, and physical objects
Scene understanding: Comprehends context like settings, environments, and visual relationships
Problem detection: Recognizes errors, defects, damage, or incorrect configurations

4. Contextual Response

The AI synthesizes visual information with:

Conversation history and context
Your knowledge base and documentation
Customer information and past interactions
Guidance instructions for handling specific scenarios

5. Intelligent Action

Based on visual analysis, the AI can:

Answer questions about what it sees
Provide troubleshooting steps for visible errors
Guide customers through processes using visual references
Escalate to human agents when needed, with image context preserved

Vision uses the same multimodal AI models that power your text conversations, ensuring consistent intelligence and brand voice across all interactions.

Supported Image Formats

botBrains Vision supports all common image formats:

Standard Formats

JPEG (.jpg, .jpeg) - Most common photograph format
PNG (.png) - Screenshots and graphics with transparency
WebP (.webp) - Modern efficient format

Advanced Formats

HEIC (.heic) - iPhone and modern iOS camera format
HEIF (.heif) - High-efficiency image format
GIF (.gif) - Animated or static images

File Size and Quality

Maximum file size: Determined by your project settings (typically 10-25 MB)
Automatic optimization: Large images are compressed to balance quality and processing speed
Resolution: Higher resolution images provide more detail but take longer to process
Best practice: 1920x1080 or lower for screenshots; original resolution for detailed product photos

For screenshots, use PNG format for crisp text readability. For photographs, JPEG provides the best balance of quality and file size.

Common Use Cases

Product Defect Identification

Scenario: Customer receives a damaged product or notices a manufacturing defect. How Vision Helps:

Customer photographs the defect
AI analyzes the image to assess severity and type of damage
AI determines if it’s a manufacturing defect, shipping damage, or normal wear
AI can initiate return, replacement, or refund workflows automatically

Example Interaction:

Customer: "The corner of my laptop is dented"
[Uploads photo showing dent]

AI: "I can see the dent on the bottom-left corner of your laptop. This
appears to be shipping damage rather than a manufacturing defect. I'll
process a replacement shipment right away. You can keep the current
device or return it - whichever you prefer. Your replacement will ship
within 24 hours."

Error Screenshot Analysis

Scenario: Customer encounters an error message they don’t understand. How Vision Helps:

Customer shares screenshot of the error
AI reads error codes, messages, and visible context
AI matches error to known issues in documentation
AI provides specific troubleshooting steps

Example Interaction:

Customer: "I'm getting an error when I try to log in"
[Uploads screenshot of error: "Connection timeout - Error 504"]

AI: "The error code 504 indicates a temporary server timeout. This
usually resolves within a few minutes. Try these steps:
1. Wait 2-3 minutes and try again
2. Clear your browser cache
3. Try a different browser
If the issue persists after 10 minutes, our servers may be experiencing
higher-than-normal load. I can notify you when service is restored."

Document and Receipt Processing

Scenario: Customer needs to submit receipts, invoices, or proof of purchase. How Vision Helps:

Customer uploads receipt photo
AI extracts purchase details, amounts, dates, and item information
AI verifies against order history
AI processes warranty claims, returns, or reimbursements

Example Interaction:

Customer: "I need to return this item but I lost the order number"
[Uploads photo of receipt]

AI: "I found your order from the receipt. You purchased the Premium
Widget on March 15, 2024 for $89.99 (Order #ORD-12345). You're within
the 30-day return window. I've created a prepaid return label and
emailed it to you. Once we receive the item, your refund will process
within 5-7 business days."

Installation Verification

Scenario: Customer installing hardware or software needs verification they did it correctly. How Vision Helps:

Customer shares photo of their setup
AI verifies correct installation, configuration, or assembly
AI identifies mistakes or missing steps
AI provides corrective instructions with visual references

Example Interaction:

Customer: "I think I installed the router correctly but internet isn't working"
[Uploads photo of router with cables]

AI: "I can see your router setup. The issue is that the WAN cable (blue)
is plugged into LAN port 2 instead of the WAN port. Here's what to do:
1. Unplug the blue cable from LAN port 2
2. Plug it into the WAN port (the one separated from the others, usually yellow)
3. Wait 30 seconds for the router to restart
The WAN light should turn solid green when connected properly."

Visual Troubleshooting

Scenario: Customer experiencing visual or UI problems with software. How Vision Helps:

Customer screenshots the problem
AI identifies UI elements, states, and potential causes
AI guides customer through fixes or workarounds
AI can escalate with full visual context if needed

Example Interaction:

Customer: "The upload button disappeared"
[Uploads screenshot showing file manager interface]

AI: "I can see the issue - you're in 'View Only' mode. The upload button
is hidden when you don't have edit permissions. To fix this:
1. Click on your username in the top-right corner
2. Select 'Switch to Edit Mode'
3. The upload button will appear in the top toolbar
If you don't see 'Switch to Edit Mode', you may need to request editor
access from your team administrator."

Serial Number and Model Identification

Scenario: Customer needs support but doesn’t know their product model or version. How Vision Helps:

Customer photographs serial number sticker or product label
AI reads model number, serial number, and manufacturing information
AI looks up specific documentation and support for that exact model
AI provides personalized troubleshooting

Example Interaction:

Customer: "How do I update the firmware?"
[Uploads photo of product label showing model]

AI: "You have Model X-2000-B (serial: SN-12345). This model requires
the 2.4 firmware update process:
1. Download firmware v2.4.1 for the X-2000-B series: [link]
2. Connect via USB (not WiFi for this model)
3. Run the firmware installer
Note: The X-2000-B has different update steps than the X-2000-A, so
generic instructions won't work. Follow the model-specific guide here: [link]"

Enabling Vision

Vision capabilities are controlled at the frame (widget) level, allowing you to enable or disable image upload for different deployments.

Enable Image Upload in Frames

Navigate to Frames in your botBrains project
Select the frame you want to configure
Go to the General Settings tab
Toggle Allow Image Upload to enabled
Save your frame configuration

Changes to frame settings take effect immediately. Customers will see the image upload button in the chat widget after you save.

Configure in Guidance

While image upload is a frame-level setting, you can add guidance instructions to help your AI understand and respond to specific types of images:

When a customer uploads an image of a product defect:
Analyze the image carefully to identify the type and severity of the defect
Determine if it's a manufacturing defect, shipping damage, or user damage
For manufacturing defects: offer immediate replacement
For shipping damage: apologize and process replacement with expedited shipping
For user damage: explain it's not covered by warranty but offer repair options
Always be empathetic and focused on resolving the customer's problem

Model Compatibility

Vision capabilities are available with multimodal AI models:

Claude 3.5 Sonnet (recommended for best vision performance)
Claude 3 Opus
Claude 3 Haiku (faster, good for simple image tasks)
GPT-4o and GPT-4 Turbo with Vision

Claude 3.5 Sonnet provides the best balance of speed, cost, and vision accuracy for most customer support scenarios.

Best Practices

When to Use Vision

Excellent Use Cases
Poor Use Cases

Error messages and screenshots
Product defects or damage
Installation verification
Document and receipt processing
Serial number identification
UI/UX troubleshooting
Visual configuration issues
Physical setup verification

Guidance for Handling Images

Set Clear Expectations Tell customers what types of images are helpful:

If you're experiencing an error, please share a screenshot showing:
- The complete error message
- The URL or page you're on
- Any error codes or numbers visible

Guide Image Quality Help customers provide useful images:

When photographing a product defect:
- Use good lighting
- Get close enough to show detail
- Include the full product in the frame
- Take multiple angles if helpful

Acknowledge Visual Analysis Let customers know you’ve analyzed their image:

"Looking at your screenshot, I can see the error code 404 on the checkout
page. This indicates..."

"In the photo you shared, I notice the connector cable is plugged into
the wrong port..."

Be Specific About What You See Reference specific visual elements:

Good: "The red error icon in the top-right corner indicates..."
Poor: "I see there's an error..."

Privacy and Sensitivity

Don’t Request Sensitive Visual Information Avoid asking for images of:

Credit cards or payment information
Personal identification documents (unless absolutely necessary and encrypted)
Medical conditions or private health information
Passwords or security codes visible on screen

Redaction Guidance When sensitive information might be visible, instruct customers to redact:

"If you're sharing a screenshot of your account page, please blur or
cover any personal information like your email address or account number
before uploading."

Auto-Delete After Resolution Consider automatically deleting images after:

Issue is resolved
A defined retention period (30-90 days)
Customer requests deletion

Performance Optimization

Encourage Appropriate File Sizes

Screenshots: PNG at screen resolution (typically 1-3 MB)
Photos: JPEG compressed to 2-5 MB for sufficient quality
Avoid unnecessarily large files that slow processing

Set Upload Limits Configure reasonable limits:

Maximum 5-10 images per conversation
Maximum file size of 10-25 MB per image
Automatic compression for oversized files

Process Efficiently

Analyze images only when relevant to the conversation
Don’t ask for images if text would suffice
Cache common visual patterns (error screenshots, product photos)

Security and Privacy

Image Security

All uploaded images are:

Scanned on upload for malware, viruses, and malicious content
Validated for format to prevent exploit attempts
Encrypted at rest using AES-256 encryption
Transmitted securely over HTTPS/TLS
Access-controlled with role-based permissions

Data Privacy

Storage and Retention

Images are stored in secure, encrypted cloud storage
Access is logged and auditable
Retention policies configurable per project
Automatic deletion after retention period expires

Processing Privacy

Images are processed by AI models without human review (unless escalated)
No image data is used to train AI models
Processing is ephemeral - images are analyzed in-memory and not cached
Compliance with GDPR, CCPA, and other privacy regulations

Customer Rights Customers can:

Request deletion of their uploaded images
Access copies of images they’ve uploaded
Opt out of image upload feature

Images uploaded by customers may contain sensitive information. Implement appropriate guidance to prevent collection of regulated data types (PHI, PII, payment card data) unless your project is specifically configured for compliance.

Compliance Considerations

GDPR Compliance

Images are personal data and subject to GDPR
Provide clear privacy notices about image processing
Honor data subject rights (access, deletion, portability)
Implement data retention limits

HIPAA Compliance

Do not enable image upload for healthcare applications unless:
- You have a BAA (Business Associate Agreement) with botBrains
- Images are encrypted end-to-end
- Access is strictly controlled and audited

PCI DSS Compliance

Never request or accept images containing:
- Credit card numbers visible in screenshots
- CVV codes or security codes
- Full payment card images (front or back)

Troubleshooting Vision

Image Upload Not Available

Check frame configuration:

Navigate to Frames > General Settings
Verify “Allow Image Upload” is enabled
Save and refresh the chat widget

Check file size limits:

Ensure images are under maximum file size
Try compressing large images before upload

Check browser compatibility:

Image upload requires modern browsers (Chrome 90+, Safari 14+, Firefox 88+)
Mobile browsers may have different file size limits

AI Not Understanding Images

Improve image quality:

Use higher resolution for detailed analysis
Ensure good lighting for photographs
Avoid blurry or low-quality images

Provide context in text:

Instead of: [uploads image]
Better: "Here's a screenshot of the error I'm seeing" [uploads image]

Use appropriate image types:

Screenshots for digital errors (PNG format)
Photographs for physical products (JPEG format)
Avoid highly compressed or low-resolution images

Vision Analysis Errors

Model limitations:

Some images may be too complex or ambiguous
Very small text may be difficult to read
Extreme lighting conditions can affect recognition

Fallback to human support: Configure guidance to escalate when:

If you cannot confidently analyze the image or the issue requires
human judgment, say: "I'm having trouble analyzing this image clearly.
Let me connect you with a specialist who can review this in detail."
Then use `escalate_to_human` with the image attached.

Performance Issues

Slow processing:

Large images (>10 MB) take longer to process
Complex images with many elements require more analysis time
Multiple concurrent image uploads may be queued

Optimize performance:

Encourage customers to compress screenshots
Set reasonable file size limits (5-10 MB)
Use faster models (Claude 3 Haiku) for simple image tasks

Examples of Vision in Action

Example 1: E-commerce Returns

Customer uploads photo of damaged box

Customer: "My order arrived damaged"
[Uploads photo showing crushed shipping box]

AI: "I'm sorry your order arrived in that condition. I can see significant
damage to the shipping box. Let's get this resolved:

Based on the damage visible in your photo, I'm processing:
✓ Full refund: $156.78 (credited within 3-5 business days)
✓ Replacement shipment: Expedited 2-day delivery at no charge
✓ Return label: No need to return the damaged item

Your replacement order is #ORD-67890. You'll receive tracking within
2 hours. Is there anything else I can help with?"

Example 2: Technical Support

Customer uploads screenshot of software error

Customer: "Getting this error when I export"
[Uploads screenshot: "Export failed: Invalid format specified (ERR-2045)"]

AI: "I can see error code ERR-2045 in your screenshot. This happens when
the export format doesn't match your file type. Looking at your screenshot,
you're trying to export a video file (.mp4) as an audio-only format (.mp3).

Here's how to fix it:
1. In the Export dialog, change 'Format' from 'MP3' to 'MP4'
2. Or, if you want just the audio, check 'Extract Audio' first
3. Then select MP3 format

The option you need is in the dropdown menu visible in your screenshot,
just to the right of where the error appeared. Try that and let me know
if you still have issues!"

Example 3: Assembly Verification

Customer uploads photo of partially assembled furniture

Customer: "Is this right? The instructions are confusing"
[Uploads photo of desk with parts laid out]

AI: "Looking at your photo, you're on the right track! You have all the
parts correctly identified and the frame pieces are in the right position.

However, I notice one issue: the support bracket (the L-shaped metal piece
on the left) needs to be rotated 90 degrees. The long side should face up,
not forward.

Once you rotate that bracket:
1. Attach it to the frame using the 4 long screws (visible in your photo)
2. Then attach the desktop (the large piece in the background)
3. Finally, add the drawer assembly

You're actually further along than the instructions show - good work!
Need any other guidance?"

Now that you understand Vision, explore related features:

Guidance - Configure how your AI responds to visual queries
Actions - Enable tools for processing visual information
Frames - Configure image upload in your chat widget
Knowledge - Combine visual analysis with documentation
Conversations - Review how Vision is used in customer interactions

Enable Image Upload

Configure Vision capabilities in your chat widget settings

Getting Started

Core Concepts

Train

Deploy

Analyze

Guides

Security

Other

More

​Vision

​Why Vision Matters

​How Vision Works

​1. Image Upload

​2. Secure Processing

​3. Visual Understanding

​4. Contextual Response

​5. Intelligent Action

​Supported Image Formats

​Standard Formats

​Advanced Formats

​File Size and Quality

​Common Use Cases

​Product Defect Identification

​Error Screenshot Analysis

​Document and Receipt Processing

​Installation Verification

​Visual Troubleshooting

​Serial Number and Model Identification

​Enabling Vision

​Enable Image Upload in Frames

​Configure in Guidance

​Model Compatibility

​Best Practices

​When to Use Vision

​Guidance for Handling Images

​Privacy and Sensitivity

​Performance Optimization

​Security and Privacy

​Image Security

​Data Privacy

​Compliance Considerations

​Troubleshooting Vision

​Image Upload Not Available

​AI Not Understanding Images

​Vision Analysis Errors

​Performance Issues

​Examples of Vision in Action

​Example 1: E-commerce Returns

​Example 2: Technical Support

​Example 3: Assembly Verification

​Related Documentation

Enable Image Upload

Vision

Why Vision Matters

How Vision Works

1. Image Upload

2. Secure Processing

3. Visual Understanding

4. Contextual Response

5. Intelligent Action

Supported Image Formats

Standard Formats

Advanced Formats

File Size and Quality

Common Use Cases

Product Defect Identification

Error Screenshot Analysis

Document and Receipt Processing

Installation Verification

Visual Troubleshooting

Serial Number and Model Identification

Enabling Vision

Enable Image Upload in Frames

Configure in Guidance

Model Compatibility

Best Practices

When to Use Vision

Guidance for Handling Images

Privacy and Sensitivity

Performance Optimization

Security and Privacy

Image Security

Data Privacy

Compliance Considerations

Troubleshooting Vision

Image Upload Not Available

AI Not Understanding Images

Vision Analysis Errors

Performance Issues

Examples of Vision in Action

Example 1: E-commerce Returns

Example 2: Technical Support

Example 3: Assembly Verification

Related Documentation