文章目录

As AI coding agents become increasingly capable, their inability to reliably browse the web has become a significant bottleneck. While agents can reason about code and execute shell commands, interacting with complex, JavaScript-heavy websites — from filling out forms to handling CAPTCHAs — remains notoriously difficult. browserbase/skills is an open-source project that bridges this gap, bringing production-grade browser automation to Claude Code and other AI coding agents through a carefully designed skill system.

The project is the official open-source skills library from Browserbase, a company specializing in remote browser infrastructure. It bundles multiple specialized skills that let AI agents control headless browsers with stealth, solve CAPTCHAs, manage sessions, capture network traffic, and even perform adversarial UI testing — all through natural language CLI commands.

🔑 Key Features

  • browser skill — Full CLI-based browser automation with anti-bot stealth, CAPTCHA solving, and residential proxies. Agents can issue natural language commands like "click the login button" or "scroll down" and the browser executes them.
  • browser-trace skill — Captures a complete Chrome DevTools Protocol (CDP) firehose: screenshots, DOM dumps, network requests, and console logs. Then bisects the trace into per-page searchable buckets for easy analysis and replay.
  • site-debugger skill — An AI-powered diagnostic tool that analyzes failing automations, identifies bot detection patterns, broken selectors, timing issues, and auth problems, then generates a tested site playbook to fix them.
  • bb-usage skill — Terminal dashboard showing Browserbase usage stats, session analytics, and cost forecasts — essential for teams managing browser automation at scale.
  • Easy plugin system — One command to install for Claude Code: /plugin marketplace add browserbase/skills then /plugin install browse@browserbase. Works with Claude Code's native plugin system.

💬 English Community Discussions

The project's GitHub Issues are a genuine source of real engineering discussions. Here are some highlights:

Issue #14 — "Will this work with an Anthropic Max Subscription?" (5 comments · closed)

A user asked whether the skills work with an Anthropic Max subscription instead of requiring a separate API key. The discussion revealed the current architecture: the browser-automation tool uses @anthropic-ai/claude-agent-sdk which makes direct API calls, requiring an explicit API key. Contributors discussed the technical feasibility of wrapping Stagehand to use a Claude Code subscription instead, noting that Stagehand internally needs API access — so custom development would be needed.

Issue #63 — "Add domain-firewall skill — CDP navigation security for browser agents" (2 comments · open)

One of the most interesting recent discussions centers on security. A contributor proposed adding a domain-firewall skill that intercepts every navigation at the Chrome DevTools Protocol level, protecting browser agents from prompt injection attacks. The core concern: a hidden link on a page could trick an AI agent into navigating to an attacker-controlled URL, potentially exfiltrating session tokens. The proposed domain-firewall skill would enforce allowlists via CDP, blocking unauthorized navigations before requests ever leave the browser. This is a genuinely novel approach to an emerging attack surface for AI agents.

Issue #18 — "feat: add DevTools commands (network, console, eval)" (2 comments · closed)

A contributor proposed adding Chrome DevTools-like commands to the browser CLI — browser network to list captured network requests, browser console to read console logs, and browser eval to run JavaScript. The Browserbase team responded enthusiastically: "we're opinionated against blanket javascript evaluate tools, but love the idea of checking network requests and console logs" — a principled stance that balances power with safety.

📝 Summary

browserbase/skills represents a mature, production-focused approach to bringing browser automation to AI coding agents. With 10+ specialized skills covering everything from basic browsing to complex UI testing, it's the most comprehensive open-source skills library for Claude Code browser automation. The community is actively discussing security improvements (domain-firewall), feature expansions (DevTools commands, OpenAPI reverse-engineering), and integration with other tools. If you're building AI agents that need to interact with the web, this is a must-watch project — and it's only getting started.

View on GitHub →