Overview
The web browser tool gives your Utari workers the ability to actively browse and interact with websites. Unlike web search which finds information, the browser tool allows workers to visit specific URLs, navigate through pages, capture screenshots, and extract content—essentially giving your worker a functional web browser.Web Browser Capabilities
When you enable the web browser tool, your worker gains four powerful capabilities:Browser Navigate To
Visit specific URLs and navigate through websites
Browser Screenshot
Capture visual screenshots of web pages
Browser Extract Content
Extract text, data, and content from web pages
Browser App
Launch and interact with web applications
The web browser tool is ideal for tasks that require visual verification, page interaction, or when you need to see exactly how a website appears.
Enabling the Web Browser Tool
1
Navigate to Worker Tools
Select your worker and click on the Tools tab.
2
Search for Web Browser
Scroll through available tools or use the search function to find Web Browser.
3
Enable the Tool
Check the box next to Web Browser to activate all four capabilities:
- Browser app
- Browser extract content
- Browser navigate to
- Browser screenshot
4
Customize Capabilities (Optional)
Click on the tool settings to enable or disable specific capabilities based on your needs.
5
Save Configuration
Your changes are automatically saved. The worker can now browse websites.
Using the Web Browser Tool
Capturing Screenshots
1
Start a Chat
Open a conversation with a worker that has the web browser tool enabled.
2
Request Screenshot
Ask your worker to visit a website and capture a screenshot:
3
Handle Preview Warning
You may see a preview warning when the browser activates. Simply click “I understand” to dismiss the warning and proceed.
4
Review Screenshot
The worker will:
- Navigate to the specified URL
- Wait for the page to load
- Capture a screenshot
- Display the screenshot in the chat
Example Screenshot Request Patterns
Browser Capabilities in Detail
Navigate To
Direct navigation to specific URLs and pages:Extract Content
Pull specific information from web pages:Browser App
Interact with web applications:Common Use Cases
Competitive Analysis
Monitor Competitor Websites
Take regular screenshots of competitor homepages, pricing pages, or product launches to track changes over time.
Website Monitoring
Track Website Changes
Capture periodic screenshots to monitor your own website or detect issues.
Design Review
Visual Design Verification
Review how designs appear in actual browsers across different pages.
Content Verification
Check Published Content
Verify that content appears correctly after publishing.
Documentation
Create Visual Documentation
Capture screenshots for tutorials, help docs, or training materials.
Advanced Browser Commands
Multi-Step Navigation
Navigate through complex user flows:Content Extraction with Screenshots
Combine visual and textual information:Comparative Analysis
Compare multiple pages side by side:Scheduled Monitoring
Use with triggers for automated monitoring:Combining with Other Tools
The web browser tool becomes more powerful when combined with other Utari capabilities:+ Image Vision
Capture screenshots, then use image vision to analyze design elements, colors, and layout
+ Files and Folder
Save screenshots in organized folders for archiving and comparison
+ Document Creator
Create reports with embedded screenshots from multiple websites
+ Web Search
Search for websites, then use browser to visit and capture them
+ Task Management
Create multi-step website audits with screenshots at each checkpoint
+ Knowledge Base
Compare screenshots against brand guidelines or design standards
Example Combined Workflows
Best Practices
Use Full URLs
Always provide complete URLs including https:// for reliable navigation
Be Patient
Allow time for pages to fully load before requesting screenshots
Specify Sections
For large pages, specify which section you want captured or extracted
Organize Screenshots
Save screenshots systematically in dated folders for easy tracking
Document Context
When saving screenshots, include dates and context for future reference
Clear Instructions
Provide step-by-step navigation instructions for complex flows
Workflow Examples
Weekly Competitor Monitoring
1
Create Monitoring List
2
Capture Screenshots
3
Organize Results
4
Analysis Report
Website Audit Process
1
Homepage Review
2
Key Pages
3
Extract Content
4
Analysis
5
Report Generation
Product Launch Documentation
1
Pre-Launch
2
Launch Day
3
Extract Details
4
Archive
Troubleshooting
Browser tool not activating
Browser tool not activating
Verify that:
- Web browser tool is enabled in worker configuration
- All necessary capabilities are checked
- You’re providing valid, complete URLs
- You’ve dismissed any preview warnings
Screenshots are blank or incomplete
Screenshots are blank or incomplete
Try:
- Waiting a few seconds for the page to fully load
- Asking the worker to “wait for the page to load, then take a screenshot”
- Checking if the website blocks automated access
- Verifying the URL is correct and publicly accessible
Can't access certain websites
Can't access certain websites
Check if:
- The website requires login credentials
- The site blocks automated browser access
- The URL is behind a firewall or VPN
- The website has geo-restrictions
- Try a different page on the same domain
Preview warning keeps appearing
Preview warning keeps appearing
Remember to:
- Click “I understand” each time it appears
- This is a security feature to confirm you want the worker to access external websites
- The warning protects you from unintended navigation
Extracted content is incomplete
Extracted content is incomplete
Improve extraction by:
- Being more specific about what content to extract
- Specifying the section or element you need
- Asking for specific data types (headings, prices, lists, etc.)
- Trying multiple extraction commands for complex pages
Navigation fails on multi-step processes
Navigation fails on multi-step processes
Privacy and Security
Limitations
Current Limitations:
- Cannot interact with pages requiring login
- Limited JavaScript interaction capabilities
- May not work with all single-page applications (SPAs)
- Cannot fill out forms or click buttons (yet)
- Works best with publicly accessible, static content
Summary
You’ve successfully learned how to:Enable and configure the web browser tool with all four capabilities
Navigate to specific websites and capture screenshots
Extract content from web pages for analysis
Handle browser preview warnings
Combine browser capabilities with other Utari tools for powerful workflows
Apply best practices for effective web browsing and monitoring