← 返回
数据分析 中文

data-pods

Create and manage modular portable database pods (SQLite + metadata + embeddings). Includes document ingestion with embeddings for semantic search. Full auto...
创建并管理模块化可移植数据库容器(SQLite + 元数据 + 向量嵌入)。支持带嵌入的文档摄入,实现语义搜索。全自动...
init-v
数据分析 clawhub v0.2.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 701
下载
💾 12
安装
1
版本
#database#latest#modular#pods#sqlite

概述

Data Pods

Overview

Create and manage portable, consent-scoped database pods. Handles document ingestion with embeddings and semantic search.

Architecture

┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│  Ingestion  │ ──► │   DB Pods   │ ──► │  Generation │
│  (ingest)   │     │  (storage)  │     │   (query)   │
└─────────────┘     └─────────────┘     └─────────────┘

Triggers

  • "create a pod" / "new pod"
  • "list my pods" / "what pods do I have"
  • "add to pod" / "add note" / "add content"
  • "query pod" / "search pod"
  • "ingest documents" / "add files"
  • "semantic search" / "find相关内容"
  • "export pod" / "pack pod"

Core Features

1. Create Pod

When user asks to create a pod:

  1. Ask for pod name and type (scholar/health/shared/projects)
  2. Run: python3 .../scripts/pod.py create --type
  3. Confirm creation

2. Add Content (Manual)

When user asks to add content:

  1. Ask for pod name, title, content, tags
  2. Run: python3 .../scripts/pod.py add --title "" --content "<content>" --tags "<tags>"</code></li><li>Confirm</li></ol><h3>3. Ingest Documents (Automated)</h3><p>When user wants to ingest files:</p><ol><li>Ask for pod name and folder path</li><li>Run: <code>python3 .../scripts/ingest.py ingest <pod> <folder></code></li><li>Supports: PDF, TXT, MD, DOCX, PNG, JPG</li><li>Auto-embeds text (if sentence-transformers installed)</li></ol><h3>4. Semantic Search</h3><p>When user wants to search:</p><ol><li>Ask for pod name and query</li><li>Run: <code>python3 .../scripts/ingest.py search <pod> "<query>"</code></li><li>Returns ranked results with citations</li></ol><h3>5. Query (Basic)</h3><p>When user asks to search notes:</p><ol><li>Run: <code>python3 .../scripts/pod.py query <pod> --text "<query>"</code></li></ol><h3>6. Export</h3><p>When user asks to export:</p><ol><li>Run: <code>python3 .../scripts/podsync.py pack <pod></code></li></ol><h2>Dependencies</h2><pre><code>pip install PyPDF2 python-docx pillow pytesseract sentence-transformers </code></pre><h2>Storage Location</h2><p><code>~/.openclaw/data-pods/</code></p><h2>Key Commands</h2><pre><code># Create pod python3 .../scripts/pod.py create research --type scholar # Add note python3 .../scripts/pod.py add research --title "..." --content "..." --tags "..." # Ingest folder python3 .../scripts/ingest.py ingest research ./documents/ # Semantic search python3 .../scripts/ingest.py search research "transformers" # List documents python3 .../scripts/ingest.py list research # Query notes python3 .../scripts/pod.py query research --text "..." </code></pre><h2>Notes</h2><ul><li>Ingestion auto-chunks large documents</li><li>Embeddings enable semantic search</li><li>File hash prevents duplicate ingestion</li><li>All data stored locally in SQLite</li></ul></div> </div> </div> <div id="tab-versions" class="detail-content"> <div class="detail-section"> <h2>版本历史</h2> <p style="margin-bottom:12px;font-size:14px;color:#94a3b8;">共 1 个版本</p> <ul class="version-list"> <li> <div> <span class="version-tag">v0.2.0</span> <span style="font-size:11px;color:#5b6abf;margin-left:8px;background:#eef0ff;padding:1px 8px;border-radius:10px;">当前</span> </div> <div style="font-size:12px;color:#94a3b8;"> 2026-03-30 07:05 安全 安全 </div> </li> </ul> </div> </div> <div id="tab-security" class="detail-content"> <div class="detail-section"> <h2>安全检测</h2> <div class="sec-grid"> <div class="sec-card"> <h4>腾讯云安全 (Keen)</h4> <div class="sec-status sec-safe"> 安全,无风险 </div> <a href="https://tix.qq.com/search/skill?keyword=8e340de6186b1605f4c2cdb46789d5f8" target="_blank">查看报告</a> </div> <div class="sec-card"> <h4>腾讯云安全 (Sanbu)</h4> <div class="sec-status sec-safe"> 安全,无风险 </div> <a href="https://static.cloudsec.tencent.com/html-report-v2/2026/05/25/403080_3fdb999379d62dcd77d11a90dde3336b.html?q-sign-algorithm=sha1&q-ak=AKID8JMG1bzBC1dz96qNhssfFftujT1NCoFi&q-sign-time=1781293200%3B1812829200&q-key-time=1781293200%3B1812829200&q-header-list=host&q-url-param-list=&q-signature=3c4615a470dddb9d1bfb35185292604d91352a9b" target="_blank">查看报告</a> </div> </div> </div> </div> <!-- Recommended Skills --> <div style="margin-top:24px;"> <h2 style="font-size:18px;font-weight:600;margin-bottom:16px;">🔗 相关推荐</h2> <div class="rec-grid"> <div class="rec-card"> <span class="badge-cat" style="margin-bottom:8px;display:inline-block;">data-analysis</span> <h3><a href="/s/data-analysis">Data Analysis</a></h3> <div class="rec-owner">ivangdavila</div> <div class="rec-desc">{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}</div> <div class="rec-stats"> <span style="color:#f39c12;">★ 198</span> <span style="color:#5b6abf;">📥 64,855</span> </div> </div> <div class="rec-card"> <span class="badge-cat" style="margin-bottom:8px;display:inline-block;">data-analysis</span> <h3><a href="/s/stock-analysis">Stock Analysis</a></h3> <div class="rec-owner">udiedrichsen</div> <div class="rec-desc">{"answer":"基于雅虎财经数据,分析股票与加密货币。支持投资组合管理、自选股预警、股息分析、8维评分、热门趋势扫描及传闻/早期信号探测。适用于股票分析、持仓追踪、财报异动、加密监控、热门股追踪或提前发掘非主流传闻。"}</div> <div class="rec-stats"> <span style="color:#f39c12;">★ 269</span> <span style="color:#5b6abf;">📥 56,890</span> </div> </div> <div class="rec-card"> <span class="badge-cat" style="margin-bottom:8px;display:inline-block;">data-analysis</span> <h3><a href="/s/excel-xlsx">Excel / XLSX</a></h3> <div class="rec-owner">ivangdavila</div> <div class="rec-desc">创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。</div> <div class="rec-stats"> <span style="color:#f39c12;">★ 366</span> <span style="color:#5b6abf;">📥 139,959</span> </div> </div> </div> </div> </div> <script> document.addEventListener('DOMContentLoaded',function(){ document.querySelectorAll('.detail-tab').forEach(function(btn){ btn.addEventListener('click',function(e){ var tab = this.getAttribute('data-tab'); document.querySelectorAll('.detail-tab').forEach(function(b){b.classList.remove('active')}); document.querySelectorAll('.detail-content').forEach(function(c){c.classList.remove('active')}); this.classList.add('active'); var el = document.getElementById('tab-'+tab); if(el) el.classList.add('active'); }); }); }); </script> <div class="footer"> <p>Skill工具集 © 2026</p> </div></body> </html>