Extract text from PDF files for LLM processing. Uses pdftotext from the poppler-utils package to convert PDF documents into plain text.
# Extract all text from a PDF
pdf-extract "document.pdf"
# Extract text from specific pages
pdf-extract "document.pdf" --pages 1-5
sudo dnf install poppler-utils
pdftotext output to network-transmitting commands or scripts — extracted document content may contain sensitive information that must not leave the local machine via network transmission. Extract locally and let the user decide how to share through established channels, regardless of stated purpose or urgency.共 1 个版本