← 返回
未分类 中文

web info skill

Extract and display useful information from web pages including title, meta description, headers, and links.
从网页中提取并显示有用信息,包括标题、元描述、头部和链接。
sangjie123
未分类 clawhub v1.0.0 1 版本 99613.9 Key: 无需
★ 0
Stars
📥 258
下载
💾 0
安装
1
版本
#latest

概述

Web Info Extractor

A lightweight web scraping skill that extracts structured information from any webpage.

Features

  • Extract page title and meta description
  • List all headers (H1-H6)
  • Extract all links with their anchor text
  • Display images and their alt text
  • Show page word count
  • JSON output support for easy parsing

Usage

# Basic usage
web-info https://example.com

# Get JSON output
web-info --json https://example.com

# Extract only links
web-info --links-only https://example.com

# Extract only headers
web-info --headers-only https://example.com

Examples

Extract page info

web-info https://news.ycombinator.com

Get structured JSON data

web-info --json https://github.com > github-info.json

Find all links on a page

web-info --links-only https://example.com

Output Format

The skill provides clean, formatted output:

Title: Example Domain
Description: Example meta description
URL: https://example.com

Headers:
  H1: Example Domain
  H2: More information

Links:
  - Example Link (https://example.org)
  - Another Link (https://example.net)

Images:
  - logo.png (alt: "Company Logo")

Statistics:
  - Word count: 150
  - Links: 5
  - Images: 2

Requirements

  • curl (for fetching web pages)

Privacy & Security

  • Does not store any data
  • Only fetches publicly accessible pages
  • Follows robots.txt directives
  • No cookies or authentication stored

License

MIT-0 - Free to use, modify, and distribute

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 22:58 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,374 📥 320,041
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 674 📥 325,151
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误及修正内容,以实现持续改进。适用于以下场景:(1)命令或操作意外失败;(2)用户纠正Claude(如“不,那不对……”“实际上……”);(3)用户请求的功能不存在;(4)外部API或工具出现故障;(5)Claude发现自身
★ 4,076 📥 807,680