AgentBrowse is the browser layer for agent tasks that happen on a real
website.
Use this skill when the agent needs to:
AgentBrowse works well on its own for browser automation. It can also be
paired with MagicPay later when a broader flow reaches an approved login,
identity, or payment step.
Open source:
https://github.com/MercuryoAI/agentbrowse@mercuryo-ai/agentbrowse-cliagentbrowse must be available on PATH. If it is missing or outdated, run npm i -g @mercuryo-ai/agentbrowse-cli@latest, then verify with
agentbrowse --version.
agentbrowse launch needs an environment that can start a browser. agentbrowse attach needs a reachable CDP endpoint.
launch, attach, navigate, act, browser-status, screenshot, and close do not need any API key.
observe with a natural-language goal and extract — call an LLM through the gateway. Configure API access with
agentbrowse init before using them. Pass a non-default API
URL during init if needed.
agentbrowse doctor inspects the local config. Use it after init when AI-assisted observe or extract still fails.
agentbrowse launch [url] or agentbrowse attach .
agentbrowse observe.agentbrowse act [value] .agentbrowse observe after navigation or meaningful UI changes.agentbrowse navigate when the destination is already known.agentbrowse extract '' [scopeRef] when you needstructured output instead of another page action.
agentbrowse screenshot or agentbrowse browser-status only forevidence and debugging.
agentbrowse close when the browser session is no longerneeded.
Bring in a companion protected-flow tool when the site reaches:
At that point AgentBrowse can stay the browsing layer around the protected
step, but it should not invent its own secret-handling flow.
Ask the user only when:
happened.
close is only teardown or recovery. Never treat close as a success signal — task success comes from the visible page state before close.
Open an extra reference only when it helps:
If a term (session, ref, targetRef, scopeRef, fillRef, pageRef)
is unfamiliar, check the
共 1 个版本