← 返回
开发者工具 中文

.Publish Temp

Install, configure, validate, and run the news-fetcher Python CLI for aggregating RSS/Atom and HTML news sources with deduplication, clustering, ranking, sou...
安装、配置、验证并运行 news‑fetcher Python CLI,用于聚合 RSS/Atom 与 HTML 新闻源,实现去重、聚类、排序等功能。
miniade
开发者工具 clawhub v0.1.8 4 版本 99891.3 Key: 无需
★ 0
Stars
📥 919
下载
💾 5
安装
4
版本
#aggregation#clustering#latest#news#rss#summarization

概述

News Fetcher

Use this skill to get a working news-fetcher installation and run it correctly.

Release marker: news-fetcher-skill-0.1.8-debug-a

Important

  • Installing the ClawHub skill does not install the Python package.
  • Install the Python package separately with pip.
  • Put global options before run.

Correct:

news-fetcher --config config.yaml --limit 10 run

Wrong:

news-fetcher run --config config.yaml --limit 10

Minimal working install

python3 -m venv .venv
. .venv/bin/activate
pip install --upgrade pip
pip install "git+https://github.com/miniade/news-fetcher.git@v0.1.8"
news-fetcher version

Expect news-fetcher version 0.1.8.

Minimal working config

Generate a starter config:

news-fetcher config example > config.yaml

Or create one manually:

sources:
  - name: BBC News
    url: http://feeds.bbci.co.uk/news/rss.xml
    weight: 1.0
    type: rss

  - name: Reuters Tech
    url: https://www.reutersagency.com/feed/?best-topics=tech
    weight: 1.2
    type: rss

  - name: Example HTML Source
    url: https://example.com/news
    weight: 0.9
    type: html
    selector: main article

thresholds:
  similarity: 0.8
  min_score: 0.3
  cluster_size: 2
  max_per_source: 3

weights:
  content: 0.6
  source: 0.2
  publish_time: 0.2

Validate it:

news-fetcher config validate config.yaml

Common commands

Run with a config:

news-fetcher --config config.yaml --limit 20 run

Write Markdown output:

news-fetcher --config config.yaml --format markdown --output news.md run

Filter by time:

news-fetcher --config config.yaml --since 2026-03-01T00:00:00 run

Raise the score threshold:

news-fetcher --config config.yaml --min-score 0.5 run

Override sources directly from the CLI:

news-fetcher --sources "http://feeds.bbci.co.uk/news/rss.xml,https://news.ycombinator.com/rss" --limit 10 run

GitHub project discovery

Minimal config example:

sources:
  - name: GitHub Trending
    url: https://github.com/trending
    type: html
    source_type: github_project_discovery
    candidate_strategy: project_discovery

thresholds:
  similarity: 0.8
  min_score: 0.0
  cluster_size: 2
  max_per_source: 3

This path discovers projects from GitHub Trending, enriches repository metadata, ranks projects with GitHub-specific signals, and emits selected repositories as normal news items.

HTML sources

For type: html sources, set selector when the site does not use clean

blocks.

Example:

sources:
  - name: Example HTML Source
    url: https://example.com/news
    type: html
    selector: .story-card

Troubleshooting

If news-fetcher is missing after installing the skill, install the Python package:

pip install "git+https://github.com/miniade/news-fetcher.git@v0.1.8"

If config validation fails, run:

news-fetcher config validate config.yaml

If results are too sparse, lower thresholds.min_score or raise thresholds.max_per_source.

References

  • Tagged source install: pip install "git+https://github.com/miniade/news-fetcher.git@v0.1.8"
  • PyPI (optional, if published): https://pypi.org/project/news-fetcher/
  • Source: https://github.com/miniade/news-fetcher
  • Version targeted by this skill: 0.1.8

版本历史

共 4 个版本

  • v0.1.8 当前
    2026-05-03 03:25 安全 安全
  • v0.1.6
    2026-03-29 07:30 安全 安全
  • v0.1.3
    2026-03-26 21:41
  • v0.1.4
    2026-03-14 01:22

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 68 📥 180,200
data-analysis

Agent Lifecycle Manager

miniade
在节点上完整管理 OpenClaw 代理生命周期:创建/注册代理、配置通道绑定、可选显式继承凭据。
★ 0 📥 649
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 668 📥 324,188