← 返回
开发者工具 中文

Actionbook

Activate when the user needs to interact with any website — browser automation, web scraping, screenshots, form filling, UI testing, monitoring, or building AI agents. Provides pre-verified page actions with step-by-step instructions and tested selectors.
当用户需要与任何网站交互时激活——包括浏览器自动化、网页抓取、截图、表单填写、UI测试、监控或构建AI智能体。提供预先验证的页面操作,包含分步说明和经过测试的选择器。
adcentury
开发者工具 clawhub v0.1.1 1 版本 99642.3 Key: 无需
★ 0
Stars
📥 2,786
下载
💾 68
安装
1
版本
#latest

概述

When to Use This Skill

Activate when the user's request involves interacting with a website:

Activate when the user:

  • Needs to do anything on a website ("Send a LinkedIn message", "Book an Airbnb", "Search Google for...")
  • Asks how to interact with a site ("How do I post a tweet?", "How to apply on LinkedIn?")
  • Wants to fill out forms, click buttons, navigate, search, filter, or browse on a specific site
  • Wants to take a screenshot of a web page or monitor changes
  • Builds browser-based AI agents, web scrapers, or E2E tests for external websites
  • Automates repetitive web tasks (data entry, form submission, content posting)
  • Wants to control their existing Chrome browser (Extension mode)

What Actionbook Provides

Actionbook is a library of pre-verified page interaction data. actionbook search finds actions matching a task description; actionbook get "" returns a structured document describing a page's purpose, functional capabilities, and DOM structure with inline CSS selectors — eliminating the need for runtime page structure discovery.

search and get

search — Find actions by task description

actionbook search "<query>"                      # Search by task intent
actionbook search "<query>" --domain site.com    # Filter by domain
actionbook search "<query>" --url <url>          # Filter by URL
actionbook search "<query>" -p 2 -s 20           # Pagination

Returns for each result:

  • ID — use with actionbook get "" to retrieve full details
  • Typepage (full page) or area (page section)
  • Description — page overview and function summary
  • URL — page where this action applies
  • Health Score — selector reliability percentage (0–100%)
  • Updated — last verified date

Constructing an effective search query

The query string is the primary signal for finding the right action. Pack it with the user's full intent — not just a site name or a vague keyword.

Include in the query:

  1. Target site — the website name or domain
  2. Task verb — what the user wants to do (search, book, post, filter, login, compose, etc.)
  3. Object / context — what they're acting on (listings, messages, flights, repositories, etc.)
  4. Specific details — any constraints, filters, or parameters the user mentioned (dates, location, category, language, etc.)

Rule of thumb: Rewrite the user's request as a single descriptive sentence and use that as the query.

User saysBad queryGood query
----------------------------------
"Book an Airbnb in Tokyo for next week""airbnb""airbnb search listings Tokyo dates check-in check-out guests"
"Search arXiv for recent NLP papers""arxiv search""arxiv advanced search papers NLP natural language processing recent"
"Send a LinkedIn connection request""linkedin""linkedin send connection request invite someone"
"Post a tweet with an image""twitter post""twitter compose new tweet post with image media attachment"
"Filter GitHub issues by label""github issues""github repository issues filter by label search issues"

When the user provides extra context (e.g., specific dates, a city name, a topic), fold it into the query even if it won't match a stored action literally — it helps the search engine rank relevant pages higher.

# User: "Help me apply for a software engineer job on LinkedIn"
actionbook search "linkedin job search apply software engineer application form"

# User: "I need to search for machine learning papers on arXiv"
actionbook search "arxiv advanced search papers machine learning subject category"

If --domain or --url is known, always add them — they narrow results and improve precision.

get — Retrieve full action details by ID

# Use the ID from search results directly
actionbook get "arxiv.org:/search/advanced:default"

Returns a structured document with:

  1. Page URL — exact URL and query/path parameters
  2. Page Overview — what the page does
  3. Page Function Summary — interactive capabilities (e.g., "Search Term Input", "Subject Classification Filtering")
  4. Page Structure Summary — DOM hierarchy with CSS selectors inline

Selectors appear embedded in the structure description, e.g.:

Search Term Form Section: Contains search term input field (input[type="text"]),
field selector dropdown (select[name="searchtype"]), and submit button (button.Search)

Extract CSS selectors from the structure summary for use with browser commands.

Browser Commands

Quick reference. Full details with all flags and options: command-reference.md.

Navigation

actionbook browser open <url>           # Open URL in new tab
actionbook browser goto <url>           # Navigate current page
actionbook browser back / forward       # History navigation
actionbook browser reload               # Reload page
actionbook browser pages                # List open tabs
actionbook browser switch <page_id>     # Switch tab
actionbook browser close                # Close browser

Interactions

actionbook browser click "<selector>"          # Click element
actionbook browser fill "<selector>" "text"    # Clear and type
actionbook browser type "<selector>" "text"    # Append text
actionbook browser select "<selector>" "value" # Select dropdown option
actionbook browser hover "<selector>"          # Hover
actionbook browser press Enter                 # Press key

Observation

actionbook browser text                        # Full page text
actionbook browser text "<selector>"           # Element text
actionbook browser snapshot                    # Accessibility tree (live page structure)
actionbook browser screenshot                  # Save screenshot
actionbook browser screenshot --full-page      # Full page screenshot
actionbook browser wait "<selector>"           # Wait for element
actionbook browser wait-nav                    # Wait for navigation

actionbook browser close cleans up the browser session. Skip if the user requests the browser remain open.

Examples

User request: "Search arXiv for papers about Neural Networks, search in titles only"

# 1. Search — include the full intent: site + task + subject + filter preference
actionbook search "arxiv advanced search papers neural network title field" --domain arxiv.org

# 2. Get details — read Page Structure Summary for selectors
actionbook get "arxiv.org:/search/advanced:default"
# Response includes: input[type="text"], select[name="searchtype"], button.Search, etc.

# 3. Automate using selectors from the response
actionbook browser open "https://arxiv.org/search/advanced"
actionbook browser fill "input[type='text']" "Neural Network"
actionbook browser select "select[name='searchtype']" "title"
actionbook browser click "button.Search"
actionbook browser wait-nav
actionbook browser text
actionbook browser close

Fallback

Actionbook stores page data captured at indexing time. Websites evolve, so selectors may become outdated.

When a selector from actionbook get fails at runtime, actionbook browser snapshot provides the live accessibility tree with current selectors. Use selectors from the snapshot output to retry the interaction.

Selectors used in browser commands should come from actionbook get or actionbook browser snapshot output in the current session — not from prior knowledge or memory.

If actionbook search returns no results for a page, use snapshot as the primary source, or fall back to other available tools.

References

ReferenceDescription
------------------------
command-reference.mdComplete command reference with all flags and options
authentication.mdLogin flows, OAuth, 2FA handling, session persistence

版本历史

共 1 个版本

  • v0.1.1 当前
    2026-03-28 19:32 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 668 📥 323,946
developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 66 📥 179,977
developer-tools

Agent Browser

matrixy
专为AI智能体优化的无头浏览器自动化CLI,支持无障碍树快照和基于引用的元素选择。
★ 426 📥 118,093