← 返回
未分类 中文

Text Summarizer

Extractive AI text summarizer. Automatically extracts the most important sentences from any text using a hybrid TextRank + TF-IDF algorithm.
抽取式AI文本摘要工具,自动使用TextRank+TF‑IDF混合算法从任意文本中提取最重要的句子。
openlark
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 675
下载
💾 3
安装
1
版本
#latest

概述

Text Summarizer

Condenses long text into a concise summary by extracting the most important sentences. Works on articles, research papers, reports, meeting notes, emails, and any long document. Zero hallucination risk since it extracts exact sentences from the original.

Use Cases

the user pastes text or a file and asks to summarize, shorten, condense, or extract key points; the user wants to turn a long article/paper/report/notes into a brief overview; the user says 'summarize this', 'TL;DR', 'key points', 'condense', 'extract main ideas'. Supports adjustable length (short 20% / medium 30% / long 50%) and two output formats (bullet points or flowing paragraph).

Quick Start

Default (bullet points, medium length):

summarize.py --text "<paste your text here>"

Paragraph format:

summarize.py --text "<text>" --format paragraph

Short summary (20% of original):

summarize.py --text "<text>" --length short

From a file:

summarize.py input.txt
summarize.py input.txt --length long --format paragraph

Algorithm

See references/algorithms.md for full details on the hybrid TextRank + TF-IDF approach.

TL;DR: The script scores every sentence by (1) TF-IDF term importance and (2) TextRank graph-based importance, then returns the top-ranked sentences. No AI generation — exact sentences from the original are extracted, so there is zero hallucination risk.

Length Presets

FlagRatioBest for
---------
--length short20%Headlines, quick scan
--length medium30%General-purpose (default)
--length long50%Detailed summaries

Output Formats

  • bullet (default): One extracted sentence per line, prefixed with
  • paragraph: A single flowing paragraph of extracted sentences

What to Summarize

  • Articles and blog posts
  • Research papers and academic abstracts
  • Reports and white papers
  • Meeting notes and transcripts
  • Long email threads
  • Any prose document

Limitations

  • Optimized for English prose. Code, tables, and structured data are treated as plain text.
  • Returns original text unchanged if the input has 2 or fewer sentences.
  • Single-document only (one article at a time).

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-03 06:27 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

Toutiao Graphic Publisher

openlark
通过浏览器自动化在头条发布图文内容,支持智能排版、自动生成热门标签等功能。
★ 2 📥 925

Sqlite Client

openlark
SQLite 数据库操作技能。当用户需要创建、读取、查询或修改 SQLite 数据库(.db 文件)时使用。
★ 0 📥 657

Tesseract OCR Image Text Extraction

openlark
使用 Tesseract.js(OCR)从图像中提取文字,支持多语言(包括中文、英文)识别、区域识别、白名单字符等...
★ 0 📥 599