Vision
botBrains Vision transforms you AI agent from a text-only chatbot into a multimodal support system that can see and understand images. When customers share screenshots, product photos, error messages, or visual issues, your AI can analyze the content and provide intelligent, context-aware assistance - just like a human support agent would.Why Vision Matters
Visual communication is essential for effective customer support. Many issues are easier to show than describe, and images provide context that text alone cannot capture. Vision enables your AI to: Resolve Issues Faster - Customers can share screenshots of error messages, product defects, or installation problems instead of trying to describe them in words. Your AI sees exactly what they see. Reduce Customer Effort - Instead of asking customers to transcribe error codes, describe visual problems, or navigate complex menus to find information, let them simply take a photo. Handle Complex Support Scenarios - Troubleshoot visual issues like UI problems, physical product defects, receipt verification, installation guidance, and document processing. Improve First-Contact Resolution - With visual context, your AI can diagnose problems more accurately and provide solutions without multiple back-and-forth exchanges. Scale Visual Support - Provide the same level of visual analysis your best support agents offer, without hiring more staff.How Vision Works
When a customer uploads an image during a conversation, botBrains processes it through a sophisticated multimodal AI system:1. Image Upload
Customers can upload images directly in the chat widget (if enabled). The system accepts common image formats and validates them for security and compatibility.2. Secure Processing
Uploaded images are:- Scanned for security threats and malicious content
- Validated for format and file size
- Stored securely with encryption at rest
- Processed using privacy-preserving techniques
3. Visual Understanding
The AI analyzes the image using advanced computer vision capabilities:- Text extraction: Reads error messages, labels, serial numbers, and any visible text
- Object recognition: Identifies products, components, UI elements, and physical objects
- Scene understanding: Comprehends context like settings, environments, and visual relationships
- Problem detection: Recognizes errors, defects, damage, or incorrect configurations
4. Contextual Response
The AI synthesizes visual information with:- Conversation history and context
- Your knowledge base and documentation
- Customer information and past interactions
- Guidance instructions for handling specific scenarios
5. Intelligent Action
Based on visual analysis, the AI can:- Answer questions about what it sees
- Provide troubleshooting steps for visible errors
- Guide customers through processes using visual references
- Escalate to human agents when needed, with image context preserved
Vision uses the same multimodal AI models that power your text conversations, ensuring consistent intelligence and brand voice across all interactions.
Supported Image Formats
botBrains Vision supports all common image formats:Standard Formats
- JPEG (.jpg, .jpeg) - Most common photograph format
- PNG (.png) - Screenshots and graphics with transparency
- WebP (.webp) - Modern efficient format
Advanced Formats
- HEIC (.heic) - iPhone and modern iOS camera format
- HEIF (.heif) - High-efficiency image format
- GIF (.gif) - Animated or static images
File Size and Quality
- Maximum file size: Determined by your project settings (typically 10-25 MB)
- Automatic optimization: Large images are compressed to balance quality and processing speed
- Resolution: Higher resolution images provide more detail but take longer to process
- Best practice: 1920x1080 or lower for screenshots; original resolution for detailed product photos
Common Use Cases
Product Defect Identification
Scenario: Customer receives a damaged product or notices a manufacturing defect. How Vision Helps:- Customer photographs the defect
- AI analyzes the image to assess severity and type of damage
- AI determines if it’s a manufacturing defect, shipping damage, or normal wear
- AI can initiate return, replacement, or refund workflows automatically
Error Screenshot Analysis
Scenario: Customer encounters an error message they don’t understand. How Vision Helps:- Customer shares screenshot of the error
- AI reads error codes, messages, and visible context
- AI matches error to known issues in documentation
- AI provides specific troubleshooting steps
Document and Receipt Processing
Scenario: Customer needs to submit receipts, invoices, or proof of purchase. How Vision Helps:- Customer uploads receipt photo
- AI extracts purchase details, amounts, dates, and item information
- AI verifies against order history
- AI processes warranty claims, returns, or reimbursements
Installation Verification
Scenario: Customer installing hardware or software needs verification they did it correctly. How Vision Helps:- Customer shares photo of their setup
- AI verifies correct installation, configuration, or assembly
- AI identifies mistakes or missing steps
- AI provides corrective instructions with visual references
Visual Troubleshooting
Scenario: Customer experiencing visual or UI problems with software. How Vision Helps:- Customer screenshots the problem
- AI identifies UI elements, states, and potential causes
- AI guides customer through fixes or workarounds
- AI can escalate with full visual context if needed
Serial Number and Model Identification
Scenario: Customer needs support but doesn’t know their product model or version. How Vision Helps:- Customer photographs serial number sticker or product label
- AI reads model number, serial number, and manufacturing information
- AI looks up specific documentation and support for that exact model
- AI provides personalized troubleshooting
Enabling Vision
Vision capabilities are controlled at the frame (widget) level, allowing you to enable or disable image upload for different deployments.Enable Image Upload in Frames
- Navigate to Frames in your botBrains project
- Select the frame you want to configure
- Go to the General Settings tab
- Toggle Allow Image Upload to enabled
- Save your frame configuration
Changes to frame settings take effect immediately. Customers will see the image upload button in the chat widget after you save.
Configure in Guidance
While image upload is a frame-level setting, you can add guidance instructions to help your AI understand and respond to specific types of images:Model Compatibility
Vision capabilities are available with multimodal AI models:- Claude 3.5 Sonnet (recommended for best vision performance)
- Claude 3 Opus
- Claude 3 Haiku (faster, good for simple image tasks)
- GPT-4o and GPT-4 Turbo with Vision
Best Practices
When to Use Vision
- Excellent Use Cases
- Poor Use Cases
- Error messages and screenshots
- Product defects or damage
- Installation verification
- Document and receipt processing
- Serial number identification
- UI/UX troubleshooting
- Visual configuration issues
- Physical setup verification
Guidance for Handling Images
Set Clear Expectations Tell customers what types of images are helpful:Privacy and Sensitivity
Don’t Request Sensitive Visual Information Avoid asking for images of:- Credit cards or payment information
- Personal identification documents (unless absolutely necessary and encrypted)
- Medical conditions or private health information
- Passwords or security codes visible on screen
- Issue is resolved
- A defined retention period (30-90 days)
- Customer requests deletion
Performance Optimization
Encourage Appropriate File Sizes- Screenshots: PNG at screen resolution (typically 1-3 MB)
- Photos: JPEG compressed to 2-5 MB for sufficient quality
- Avoid unnecessarily large files that slow processing
- Maximum 5-10 images per conversation
- Maximum file size of 10-25 MB per image
- Automatic compression for oversized files
- Analyze images only when relevant to the conversation
- Don’t ask for images if text would suffice
- Cache common visual patterns (error screenshots, product photos)
Security and Privacy
Image Security
All uploaded images are:- Scanned on upload for malware, viruses, and malicious content
- Validated for format to prevent exploit attempts
- Encrypted at rest using AES-256 encryption
- Transmitted securely over HTTPS/TLS
- Access-controlled with role-based permissions
Data Privacy
Storage and Retention- Images are stored in secure, encrypted cloud storage
- Access is logged and auditable
- Retention policies configurable per project
- Automatic deletion after retention period expires
- Images are processed by AI models without human review (unless escalated)
- No image data is used to train AI models
- Processing is ephemeral - images are analyzed in-memory and not cached
- Compliance with GDPR, CCPA, and other privacy regulations
- Request deletion of their uploaded images
- Access copies of images they’ve uploaded
- Opt out of image upload feature
Compliance Considerations
GDPR Compliance- Images are personal data and subject to GDPR
- Provide clear privacy notices about image processing
- Honor data subject rights (access, deletion, portability)
- Implement data retention limits
- Do not enable image upload for healthcare applications unless:
- You have a BAA (Business Associate Agreement) with botBrains
- Images are encrypted end-to-end
- Access is strictly controlled and audited
- Never request or accept images containing:
- Credit card numbers visible in screenshots
- CVV codes or security codes
- Full payment card images (front or back)
Troubleshooting Vision
Image Upload Not Available
Check frame configuration:- Navigate to Frames > General Settings
- Verify “Allow Image Upload” is enabled
- Save and refresh the chat widget
- Ensure images are under maximum file size
- Try compressing large images before upload
- Image upload requires modern browsers (Chrome 90+, Safari 14+, Firefox 88+)
- Mobile browsers may have different file size limits
AI Not Understanding Images
Improve image quality:- Use higher resolution for detailed analysis
- Ensure good lighting for photographs
- Avoid blurry or low-quality images
- Screenshots for digital errors (PNG format)
- Photographs for physical products (JPEG format)
- Avoid highly compressed or low-resolution images
Vision Analysis Errors
Model limitations:- Some images may be too complex or ambiguous
- Very small text may be difficult to read
- Extreme lighting conditions can affect recognition
Performance Issues
Slow processing:- Large images (>10 MB) take longer to process
- Complex images with many elements require more analysis time
- Multiple concurrent image uploads may be queued
- Encourage customers to compress screenshots
- Set reasonable file size limits (5-10 MB)
- Use faster models (Claude 3 Haiku) for simple image tasks
Examples of Vision in Action
Example 1: E-commerce Returns
Customer uploads photo of damaged boxExample 2: Technical Support
Customer uploads screenshot of software errorExample 3: Assembly Verification
Customer uploads photo of partially assembled furnitureRelated Documentation
Now that you understand Vision, explore related features:- Guidance - Configure how your AI responds to visual queries
- Actions - Enable tools for processing visual information
- Frames - Configure image upload in your chat widget
- Knowledge - Combine visual analysis with documentation
- Conversations - Review how Vision is used in customer interactions
Enable Image Upload
Configure Vision capabilities in your chat widget settings