← 返回
未分类 中文

Web + Desktop Automation

Use when the user wants browser automation, web scraping, form filling, clicking, or desktop GUI automation, including mixed workflows that move between web...
用于浏览器自动化、网页抓取、表单填写、点击操作或桌面GUI自动化,包括网页与桌面间的混合工作流。
lurui808
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 567
下载
💾 23
安装
1
版本
#latest

概述

Web + Desktop Automation

Use this skill when a task may involve:

  • Opening or controlling websites
  • Reading or extracting page content
  • Filling forms, clicking buttons, logging in
  • Downloading or uploading files
  • Controlling desktop apps with mouse/keyboard
  • Combining browser steps with local app steps

Core rule

Prefer the simplest reliable path:

  1. If the task can be done in the browser, use browser automation.
  2. If the task needs local apps or OS-level interaction, use desktop automation.
  3. If both are needed, split the job into clear phases and verify after each phase.

Execution strategy

1) Classify the task

Decide which of these applies:

  • Browser only
  • Desktop only
  • Mixed browser + desktop

2) Browser automation

Use browser automation for:

  • Navigation
  • Search
  • Page reading
  • Form filling
  • Clicking controls
  • File upload/download
  • Logged-in web workflows

Prefer stable selectors and explicit waits. Avoid brittle coordinate-based clicking when browser selectors exist.

3) Desktop automation

Use desktop automation for:

  • Native apps
  • Window switching
  • Copy/paste between apps
  • File manager operations
  • UI flows outside the browser

Prefer application/window-aware methods when available. Use image-based or coordinate-based control only when necessary.

4) Mixed workflows

Break the task into phases:

  • Browser phase
  • Desktop phase
  • Browser phase again if needed

After each phase, verify the result before continuing.

Recovery rules

If a step fails:

  1. Re-check the current UI state
  2. Re-locate the target element or window
  3. Try a more stable selector or a different interaction method
  4. If the task risks loss of data or irreversible action, stop and ask the user

Best practices

  • Prefer deterministic steps over guessing
  • Avoid rapid blind retries
  • Capture key state when tasks are long or fragile
  • Keep flows small and modular
  • Use scripts for repeated actions
  • Use scripts/browser_runner.py for Playwright browser automation templates
  • Use scripts/desktop_runner.py for PyAutoGUI desktop automation templates
  • Use scripts/mixed_orchestrator.py for browser + desktop handoffs
  • Put browser-specific patterns in references/browser-workflows.md
  • Put desktop-specific patterns in references/desktop-workflows.md
  • Put mixed-flow orchestration examples in references/mixed-flows.md
  • Put dependency and installation notes in references/dependencies.md
  • Put a realistic browser-download → desktop-edit → browser-upload flow in references/mixed-example.md
  • See requirements.txt for a minimal install set
  • Put dependency and installation notes in references/dependencies.md
  • Put a realistic browser-download → desktop-edit → browser-upload flow in references/mixed-example.md
  • Put dependency and installation notes in references/dependencies.md

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 19:00 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 672 📥 324,516
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,363 📥 319,036
security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,219 📥 266,846