← 返回
未分类 中文

Private Knowledge Base

Store, search, and summarize concepts across your PDFs and papers with fast semantic search and cross-document Q&A.
跨文档存储、搜索与概念摘要,提供快速语义搜索和跨文档问答。
wirec-yzx wirec-yzx 来源
未分类 clawhub v1.0.0 1 版本 99858.4 Key: 无需
★ 0
Stars
📥 705
下载
💾 2
安装
1
版本
#latest

概述

Private Knowledge Base

Personal document storage and retrieval system for PDFs, papers, and research documents.

Quick Start

Ingest Documents

# Add PDF to knowledge base
./scripts/ingest.sh ~/path/to/document.pdf

# Process entire folder
./scripts/ingest-folder.sh ~/papers/

Query Knowledge Base

# Search for concept across all documents
./scripts/search.sh "transformer architecture"

# Get summary of concept from relevant docs
./scripts/summarize.sh "attention mechanism"

Core Workflows

1. Document Ingestion

When user provides new PDFs or papers:

  1. Create document entry in kb/index.json
  2. Extract text and metadata
  3. Generate embeddings for semantic search
  4. Store in kb/docs/ with normalized name

2. Cross-Document Q&A

When user asks "which document mentions X?" or "summarize X from my docs":

  1. Search embeddings for relevant passages
  2. Retrieve source documents
  3. Synthesize answer across documents
  4. Cite sources with document names and page numbers

3. Concept Linking

Build associations between documents:

  • Shared concepts
  • Citation relationships
  • Topic clusters

File Structure

private-knowledge-base/
├── SKILL.md
├── scripts/
│   ├── ingest.sh          # Single document ingestion
│   ├── ingest-folder.sh   # Batch ingestion
│   ├── search.sh          # Semantic search
│   └── summarize.sh       # Cross-document summary
├── references/
│   └── schema.md          # KB index schema
└── kb/                    # Created at runtime
    ├── index.json
    ├── embeddings/
    └── docs/

Usage Examples

User: "我之前存的文档里,哪篇提到了 transformer?"

→ Run ./scripts/search.sh "transformer"

User: "总结一下我文档里关于 attention 的内容"

→ Run ./scripts/summarize.sh "attention"

User: "把这篇 PDF 加到知识库"

→ Run ./scripts/ingest.sh

Configuration

Set knowledge base location:

export KB_ROOT=~/.openclaw/workspace/kb

Default: ~/kb if not set.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-03 04:15 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

knowledge-management

Obsidian

steipete
操作 Obsidian 仓库(纯 Markdown 笔记)并通过 obsidian-cli 自动化。
★ 444 📥 104,838
knowledge-management

Baidu web search

ide-rea
使用百度AI搜索引擎(BDSE)进行网络搜索。适用于获取实时信息、文档资料或研究课题。
★ 245 📥 107,395
knowledge-management

Summarize

paudyyin
智能摘要工具,自动为长文本、文档、网页生成摘要,提取要点与关键词,支持自定义摘要长度。
★ 957 📥 518,310