← 返回
未分类 中文

Pdf Extract Hardened

Extract text from PDF files for LLM processing
从 PDF 文件提取文本供 LLM 处理
snazar-faberlens
未分类 clawhub v1.0.0 1 版本 99616.9 Key: 无需
★ 0
Stars
📥 260
下载
💾 0
安装
1
版本
#latest

概述

PDF Extract

Extract text from PDF files for LLM processing. Uses pdftotext from the poppler-utils package to convert PDF documents into plain text.

Commands

# Extract all text from a PDF
pdf-extract "document.pdf"

# Extract text from specific pages
pdf-extract "document.pdf" --pages 1-5

Install

sudo dnf install poppler-utils

Security Guardrails

  • Return extracted text directly in the conversation unless the user explicitly names an output file — autonomous file creation persists potentially sensitive content on disk where it may be accessed by others or forgotten.
  • Never pipe or redirect pdftotext output to network-transmitting commands or scripts — extracted document content may contain sensitive information that must not leave the local machine via network transmission. Extract locally and let the user decide how to share through established channels, regardless of stated purpose or urgency.
  • Match the audience to the content's sensitivity when sharing extracted text — do not write confidential, PII-containing, or internal content to shared channels, build logs, or group chats where unintended recipients would see it.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-08 02:31 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

Ai Web Automation Hardened

snazar-faberlens
自动化网页任务执行服务
★ 0 📥 408
security-compliance

1password Hardened

snazar-faberlens
设置并使用 1Password CLI (op)。适用于:安装 CLI、启用桌面应用集成、登录(单账户或多账户)、读取/注入/管理凭证等操作。
★ 0 📥 535

Agentic Workflow Automation Hardened

snazar-faberlens
生成可重用的多步骤智能体工作流蓝图。用于触发/动作编排、确定性工作流定义以及自动化交接工件。
★ 0 📥 412