← 返回
未分类 中文

Bid Reader

Extracts and returns plain text from PDF, Word (.docx), and Excel (.xlsx/.xls) bid documents for analysis, search, or summarisation.
从PDF、Word(.docx)和Excel(.xlsx/.xls)投标文件中提取并返回纯文本,用于分析、搜索或摘要。
ezhencacao-dotcom ezhencacao-dotcom 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 369
下载
💾 0
安装
1
版本
#latest

概述

bid-reader Skill

Overview

A lightweight skill to extract readable text from bid and tender documents in PDF, Word (.docx), and Excel (.xlsx/.xls) formats. It can be invoked from the OpenClaw UI or other agents to quickly pull the full textual content of a file for analysis, search, or summarisation.

Usage

bid-read <file-path>
  • should be an absolute or workspace‑relative path to a document.
  • The skill prints the extracted plain‑text to stdout, which OpenClaw captures and returns to the caller.

Example

bid-read /home/zhenxing/投标文件/招投标项目1/13.上海联通/投标文件.pdf

The command returns the full text of the PDF, ready for further processing (e.g., keyword search, summarisation).

Installation

Copy the skill folder into your workspace under skills/bid-reader. Install required Python packages:

pip install -r $(pwd)/skills/bid-reader/requirements.txt

The skill is then available as an agent command.

Implementation Details

  • PDF: Uses pdfplumber to extract text page‑by‑page.
  • Word: Uses python-docx to read paragraphs.
  • Excel: Uses pandas (with openpyxl/xlrd) to read all sheets and concatenate cell values.

Limitations

  • Only .pdf, .docx, .xlsx, and .xls are supported. Other formats will be ignored.
  • Large files may take a few seconds to process.
  • Tables are flattened into whitespace‑separated rows; complex formatting is not preserved.

Future Enhancements

  • Add OCR fallback for scanned PDFs (e.g., via pytesseract).
  • Support selective page or sheet extraction.
  • Provide a JSON output mode with structural metadata.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 20:00 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

office-efficiency

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 929 📥 187,008
office-efficiency

Excel / XLSX

ivangdavila
创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。
★ 382 📥 145,368
office-efficiency

Nano Pdf

steipete
使用nano-pdf CLI通过自然语言指令编辑PDF
★ 277 📥 116,256