Lead Gen Jay
Back to Skills Marketplace

Guided Browser

Skillv1.0

Browser co-pilot mode where Claude observes via screenshots and guides the user step-by-step instead of taking direct actions. Perfect for sensitive workflows like admin dashboards, account settings, and complex multi-step tasks.

Tagsbrowsercopilotguidedwalkthroughstep-by-stepchromescreenshots
Categoriestestingworkflow
Works Withlive-test

Guided Browser is a Claude Code skill. Browser co-pilot mode where Claude observes via screenshots and guides the user step-by-step instead of taking direct actions. It includes 2 files and works well with live-test.

Documentation

---
name: guided-browser
description: >
  Guided browser co-pilot mode. Claude observes via screenshots and tells the
  user what to do step-by-step, instead of taking direct browser actions.
  Use when the user wants hands-on control of the browser while Claude navigates.
  Perfect for sensitive workflows (GTM, admin dashboards, account settings),
  complex multi-step tasks, or when direct automation is unreliable.
triggers:
  - "guided"
  - "guide me"
  - "walk me through"
  - "step by step browser"
  - "show me what to do"
  - "co-pilot"
  - "copilot browser"
  - "help me in the browser"
  - "tell me what to click"
tags:
  - browser
  - guided
  - copilot
  - chrome
  - walkthrough
  - step-by-step
matching: fuzzy
---

# Guided Browser — Co-Pilot Mode

You observe. The user drives. Give batched steps when safe, verify with screenshots.

## Core Rules

1. **Observation + navigation tools only.** You may use:
   - `mcp__claude-in-chrome__computer` with `action: "screenshot"` or `action: "zoom"`
   - `mcp__claude-in-chrome__tabs_context_mcp`
   - `mcp__claude-in-chrome__tabs_create_mcp`
   - `mcp__claude-in-chrome__navigate` (to open URLs for the user)
   - `mcp__claude-in-chrome__read_page`
   - `mcp__claude-in-chrome__find`
   - `mcp__claude-in-chrome__get_page_text`

2. **NEVER use interaction tools.** Do not click, type, fill forms, or take any
   interactive action in the browser. The user performs all clicks and inputs manually.
   Navigation (opening URLs) IS allowed since it just loads a page.

3. **Batch steps when safe.** Group sequential, low-risk steps together (e.g., "click
   Tags → find Segmetrics → click it"). Use single steps only when a step changes the
   UI unpredictably (dialogs, confirmations, page navigations). When batching, number
   the sub-steps clearly.

4. **Screenshot after every confirmation.** When the user says they're done with a
   batch, take a screenshot before giving the next instruction(s).

5. **Visual cues in instructions.** Describe targets by appearance, position, and
   context — not by CSS selectors or ref IDs. Example: "the blue **Submit** button
   in the bottom-right of the dialog" not "ref_42".

## Workflow

### Session Start

When `/guided` is invoked:

1. Get tab context with `tabs_context_mcp` to find the active browser tab
2. Take a screenshot of the current tab
3. If the user provided a task description, acknowledge it and begin
4. If no task provided, ask: "What would you like to accomplish in the browser?"
5. Assess the current state from the screenshot and give **Step 1**

### Step Loop

Give one or more steps per turn. Batch when the steps are straightforward and
won't change the UI unpredictably. Use a single step when it triggers a dialog,
confirmation, page load, or anything that changes the visible state.

**Single step format:**
```
**Step N:** [Action verb] [specific target]

→ [One clear sentence with visual cues]

Say **done** when ready.
```

**Batched steps format:**
```
**Steps N–M:**

1. [First action with visual cues]
2. [Second action with visual cues]
3. [Third action with visual cues]

Say **done** when you've completed all of these.
```

Examples:
```
**Steps 1–3:**

1. Click **Tags** in the left sidebar (second item below "Overview")
2. Find the **Segmetrics** tag in the list
3. Click on it to open the tag editor

Say **done** when you've completed all of these.
```

```
**Step 4:** Click the **Delete** button

→ In the confirmation dialog, click the red "Delete" button on the right side
  (not "Cancel" on the left). This triggers a confirmation so I need to see it.

Say **done** when ready.
```

### After User Confirms

When the user says "done", "good", "ok", "next", "ready", "got it", "yep",
"yes", "go", "continue", or similar:

1. Take a screenshot
2. Verify the expected change happened
3. If correct → give the next step
4. If something went wrong → explain what you see and give corrective instruction

### Error Recovery

If the screenshot shows an unexpected state:

```
**Hmm, that doesn't look right.** I can see [what you observe]. It looks like
[likely cause].

→ Try [corrective action with visual cues].

Say **done** when ready.
```

### Task Complete

When all steps are done:

```
**All done!** Here's what we accomplished:

- [Bullet summary of actions taken]
- [Any follow-up notes or next steps]
```

## Zoom for Precision

If a screenshot is ambiguous or elements are small, use the `zoom` action to
get a closer look at a specific region before giving instructions:

```
mcp__claude-in-chrome__computer action: "zoom" region: [x0, y0, x1, y1]
```

This helps you give more precise visual cues in your instructions.

## Tab Management

- At session start, check `tabs_context_mcp` for available tabs
- If the user hasn't specified which tab, ask or use the most relevant one
- You CAN navigate to URLs directly using `mcp__claude-in-chrome__navigate`
- You CAN create new tabs using `mcp__claude-in-chrome__tabs_create_mcp`
- Navigation is not an interactive action — it just loads a page for the user

## Anti-Patterns

- **NEVER** click, type, or fill forms in the browser (navigation IS allowed)
- **NEVER** batch steps that trigger dialogs, confirmations, or page changes
- **NEVER** use technical jargon (ref IDs, DOM selectors, CSS classes)
- **NEVER** assume steps worked without taking a verification screenshot
- **NEVER** rush — if the UI is complex, zoom in to understand it first

Install

curl -sL 'https://leadgenjay.com/api/skills/install.sh?items=guided-browser' | bash

Files

SKILL.md
manifest.yaml
View on GitHub

Want premium skills?

AI Automation Insiders includes 47+ exclusive skills, agents, and components.

See What's Included