Supapup 🐾

MCP-aware Puppeteer wrapper for intelligent web automation

Features • Installation • Quick Start • MCP Tools • Examples • Contributing

🎬 Watch Demo Video

See Supapup in action - automated web testing with AI agents

🚀 See the Difference

❌ With Regular Puppeteer MCP

$ puppeteer navigate https://example.com
✓ Navigated to https://example.com

$ puppeteer screenshot
✓ Screenshot taken (800x600)
[📸 High token cost image data...]

$ # Now what? Need to inspect elements, write selectors, execute JS...

✅ With Supapup

$ supapup navigate https://example.com
✅ Navigation successful
📍 URL: https://example.com

AGENT PAGE VIEW
==============================
Found 42 interactive elements

📝 FORMS:
  • form: login-form
    - email: form-login-email
    - password: form-login-password  
    - submit: form-login-submit

🎛️ CONTROLS:
  • link: sign-up → signup-link
  • link: forgot-password → forgot-password-link
  • button: get-started → get-started-button
  
✨ Ready to interact! No screenshots needed.
Example: execute_action({actionId: "form-login-email", params: {value: "[email protected]"}})

The Result?

⚡ 10x faster - Instant structured data vs screenshots + manual inspection
💰 90% fewer tokens - Text-based output instead of images
🎯 Zero complexity - Semantic IDs ready to use, no CSS selectors needed
🔄 Dynamic handling - Automatically remaps after page changes

🎥 See it in Action - Watch Demo

Supapup is an intelligent web automation tool that bridges the gap between AI agents and web browsers. It wraps Puppeteer with Model Context Protocol (MCP) support, providing a structured, predictable interface for programmatic web interaction.

✨ Features

🤖 Agent-Aware Web Pages

Semantic Element IDs: Instead of brittle CSS selectors, use meaningful IDs like form-login-email
Structured Representation: Web pages are presented as organized, actionable interfaces
Automatic Element Detection: Intelligently identifies interactive elements on any page
Visual Element Mapping: Number-based element identification for easy interaction
Unique ID Generation: Handles duplicate elements intelligently - multiple "Add to Cart" buttons get unique IDs with product context

🔍 Advanced Debugging

JavaScript Breakpoints: Set conditional breakpoints and step through code
Variable Inspection: Examine local variables and expressions during debugging
Automated Function Debugging: Trigger and debug specific functions automatically

📊 Network Monitoring

Request/Response Logging: Capture all network traffic with headers and payloads
API Request Replay: Replay requests with modified parameters
Request Interception: Modify requests on-the-fly with custom rules
Network Throttling: Simulate slow connections (slow-3g, fast-3g, offline) for testing
Performance Metrics: Track page load times and resource usage

🎯 Smart Navigation

CAPTCHA Detection: Automatically identifies CAPTCHA pages
Dynamic Content Handling: Waits for AJAX/DOM changes after actions
Chunked Screenshots: Handles large pages by splitting screenshots automatically
Browser Tab Management: Open content directly in new tabs

📦 Installation

Install with npm

npm install -g supapup

Using with Claude CLI

First, ensure you have Claude CLI installed:

npm install -g @anthropic/claude-cli

Configure Supapup as an MCP server:

claude mcp add supapup "supapup"

Start using Supapup in Claude:

claude "Navigate to example.com and find all buttons"

For more Claude CLI MCP configuration:

claude mcp help

Using with Gemini CLI

Install Gemini CLI:

npm install -g @google/gemini-cli

Configure Supapup in your Gemini config:

{
  "mcpServers": {
    "supapup": {
      "command": "npx",
      "args": ["supapup"]
    }
  }
}

As a Library

npm install supapup

From Source

git clone https://github.com/jaspergreen/supapup.git
cd supapup
npm install
npm run build

🚀 Quick Start

With Claude CLI

# Add Supapup to Claude CLI
claude mcp add supapup "supapup"

# Use it in a conversation
claude "Navigate to https://example.com and click the login button"

# Or start an interactive session
claude --interactive

With Gemini CLI

# Configure in gemini-config.json then:
gemini "Use Supapup to navigate to example.com and find all forms"

🛠️ MCP Tools

Browser Management

navigate - Navigate to a URL and get agent-friendly page representation
screenshot - Capture screenshots with automatic chunking for large pages
close_browser - Close the browser instance

Element Interaction

execute_action - Execute actions on elements using semantic IDs
discover_actions - Find all available actions on the current page
get_page_state - Get current page state and element count

Visual Element Mapping

devtools_visual_element_map - Create numbered visual map of page elements
JavaScript helpers for numbered elements:
- window.__AGENT_PAGE__.clickElement(1) - Click element 1
- window.__AGENT_PAGE__.fillElement(25, "text") - Fill element 25
- window.__AGENT_PAGE__.highlightElement(100) - Highlight element 100
- window.__AGENT_PAGE__.getElementByNumber(1) - Get element DOM reference

Debugging

set_breakpoint - Set JavaScript breakpoints with optional conditions
debug_continue - Resume execution after breakpoint
debug_step_over - Step over current line
debug_evaluate - Evaluate expressions in debug context
debug_get_variables - Get local variables at breakpoint

Network Analysis

get_network_logs - Get all network requests
get_api_logs - Get detailed API request/response logs
replay_api_request - Replay requests with modifications
intercept_requests - Set up request interception rules
network_throttle - Control network speed to simulate slow connections (slow-3g, fast-3g, offline, no-throttling)

Page Analysis

get_performance_metrics - Get page load and runtime metrics
get_accessibility_tree - Get page accessibility structure
inspect_element - Get detailed element properties
evaluate_script - Execute JavaScript in page context

Content Reading

agent_read_content - Extract readable page content in markdown format for articles, search results, or any page text
- Supports pagination for large content (Wikipedia, long articles)
- Professional HTML-to-markdown conversion
- Handles pages with thousands of elements efficiently

📚 Examples

Testing a Website

claude "Test the login flow on https://myapp.com - try logging in with test credentials and tell me if it works"

claude "Navigate to https://shop.example.com and add 3 different products to cart, then check if the cart total is calculated correctly"

Debugging API Errors

claude "Go to https://myapp.com/dashboard and check the network logs for any failed API calls. Tell me what's causing the 404 errors"

claude "Monitor the API requests on https://example.com/form when I submit the form. Check if the auth token is being sent correctly"

Visual Debugging

claude "Why does the submit button on https://mysite.com/contact look broken? Take a screenshot and inspect its CSS"

claude "The layout on https://myapp.com is messed up on mobile. Can you check how it looks at 375px width and tell me what's wrong?"

Finding Elements

claude "I can't find the logout button on https://app.example.com. Can you help me locate it?"

claude "Show me all the forms on https://example.com and tell me which fields are required"

Performance Testing

claude "Check the performance metrics for https://mysite.com and tell me what's making it load slowly"

claude "Test https://heavysite.com and identify which resources are taking the longest to load"

claude "Set network throttling to slow-3g and test how https://myapp.com performs on a slow connection"

Content Reading

claude "Read the Wikipedia article at https://en.wikipedia.org/wiki/Python_(programming_language) and summarize the key points"

claude "Extract the main content from https://news.ycombinator.com and tell me what the top stories are about"

claude "Go to https://docs.python.org/3/tutorial/ and read through the first few sections to help me understand Python basics"

Automated Testing

claude "Run through the checkout process on https://shop.com: add item to cart, go to checkout, fill in the form with test data, but stop before placing the order"

claude "Test all the navigation links on https://mysite.com and tell me if any are broken"

🏗️ Architecture

Supapup consists of several specialized modules:

Agent Page Generator: Creates structured representations of web pages
Element Detector: Automatically identifies interactive elements
ID Generator: Creates unique, semantic IDs incorporating context from parent forms, headings, and element properties
Debugging Tools: Full Chrome DevTools Protocol integration
Network Tools: Comprehensive request/response monitoring
Page Analysis: Performance metrics and accessibility analysis
DevTools Elements: Visual element mapping and manipulation

ID Generation Strategy

Supapup ensures every element gets a unique, meaningful ID by:

Extracting context from parent containers (forms, sections, headings)
Including semantic meaning (label text, placeholder, button text)
Adding element type suffixes (button, link, input, checkbox)
Appending unique indices to guarantee no duplicates
Handling dynamic content with stable ID regeneration

🤝 Contributing

We welcome contributions! Please see our Contributing Guidelines for details.

Development Setup

# Clone the repository
git clone https://github.com/jaspergreen/supapup.git
cd supapup

# Install dependencies
npm install

# Run in development mode
npm run dev

# Build for production
npm run build

Running Tests

# Run test scripts
node test-agent-generator.js
node test-complex-page.cjs
node test-supapup-flow.cjs

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built on top of Puppeteer
Implements Model Context Protocol (MCP)
Inspired by the need for better AI-browser interaction

🔗 Links

Made with ❤️ for AI agents and developers

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
agentScripts		agentScripts
bin		bin
examples/ecommerce		examples/ecommerce
scripts		scripts
src		src
test-install		test-install
.gitignore		.gitignore
.npmignore		.npmignore
CLAUDE.md		CLAUDE.md
GEMINI-QUICK-REFERENCE.md		GEMINI-QUICK-REFERENCE.md
LICENSE		LICENSE
README.md		README.md
debug-html-structure.js		debug-html-structure.js
debug-supapup.mjs		debug-supapup.mjs
debug-wikipedia-bug.js		debug-wikipedia-bug.js
gemini-setup.md		gemini-setup.md
package-lock.json		package-lock.json
package.json		package.json
remove_console_logs.py		remove_console_logs.py
remove_console_logs_v2.py		remove_console_logs_v2.py
supapup-0.1.10.tgz		supapup-0.1.10.tgz
supapup-0.1.11.tgz		supapup-0.1.11.tgz
supapup-0.1.12.tgz		supapup-0.1.12.tgz
supapup-0.1.13.tgz		supapup-0.1.13.tgz
supapup-0.1.6.tgz		supapup-0.1.6.tgz
supapup-0.1.7.tgz		supapup-0.1.7.tgz
supapup-0.1.8.tgz		supapup-0.1.8.tgz
supapup-0.1.9.tgz		supapup-0.1.9.tgz
supapup-mcp		supapup-mcp
supapup-mcp.sh		supapup-mcp.sh
test-claude-mcp.sh		test-claude-mcp.sh
test-debug-function.cjs		test-debug-function.cjs
test-import-meta.mjs		test-import-meta.mjs
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

License

ahumandev/supapup

Folders and files

Latest commit

History

Repository files navigation

Supapup 🐾

🎬 Watch Demo Video

🚀 See the Difference

❌ With Regular Puppeteer MCP

✅ With Supapup

🎥 See it in Action - Watch Demo

✨ Features

🤖 Agent-Aware Web Pages

🔍 Advanced Debugging

📊 Network Monitoring

🎯 Smart Navigation

📦 Installation

Install with npm

Using with Claude CLI

Using with Gemini CLI

As a Library

From Source

🚀 Quick Start

With Claude CLI

With Gemini CLI

🛠️ MCP Tools

Browser Management

Element Interaction

Visual Element Mapping

Debugging

Network Analysis

Page Analysis

Content Reading

📚 Examples

Testing a Website

Debugging API Errors

Visual Debugging

Finding Elements

Performance Testing

Content Reading

Automated Testing

🏗️ Architecture

ID Generation Strategy

🤝 Contributing

Development Setup

Running Tests

📄 License

🙏 Acknowledgments

🔗 Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages