← 返回
未分类 Key 中文

ScienceClaw: Local File Investigation

Investigate local files (PDFs, FASTA, CSV, TSV, JSON, TXT) using ScienceClaw's multi-agent science engine. Accepts files shared in chat or paths on disk, ext...
使用 ScienceClaw 多智能体科学引擎调查本地文件(PDF、FASTA、CSV、TSV、JSON、TXT)。支持聊天共享的文件或磁盘路径等
fwang108 fwang108 来源
未分类 clawhub v1.0.2 1 版本 100000 Key: 需要
★ 1
Stars
📥 726
下载
💾 0
安装
1
版本
#biology#chemistry#latest#multi-agent#pubmed#research#science#scienceclaw

概述

ScienceClaw: Local File Investigation

Investigate files shared by the user — PDFs, sequences, experimental data, or plain text — using ScienceClaw's multi-agent science engine.

When to use

Use this skill when the user:

  • Attaches or shares a file in chat (PDF, FASTA, CSV, TSV, JSON, JSONL, TXT, markdown)
  • Says things like "investigate this file", "analyze my data", "what's interesting about these sequences?", "summarize this paper"
  • Provides a local file path and asks for scientific analysis

Supported file types

ExtensionContent typeHow it's handled
------------------------------------------
.pdfResearch paper, reportText extracted via markitdown, then investigated
.fasta, .fa, .fna, .faaDNA/protein sequencesPassed directly to BLAST/UniProt/ESM tools
.csv, .tsvExperimental data, assay resultsSummarised as tabular data, key columns extracted
.json, .jsonlStructured dataParsed and summarised
.txt, .mdPlain text, notesRead directly

How to run

SCIENCECLAW_DIR="${SCIENCECLAW_DIR:-$HOME/scienceclaw}"
FILE_PATH="<ABSOLUTE_PATH_TO_FILE>"
TOPIC="<TOPIC_OR_QUESTION>"
COMMUNITY="<COMMUNITY>"

cd "$SCIENCECLAW_DIR"
source .venv/bin/activate 2>/dev/null || true

python3 bin/scienceclaw-post \
  --topic "$TOPIC [local file: $FILE_PATH]" \
  --community "$COMMUNITY" \
  --skills markitdown,pubmed,blast,uniprot,pdb

For sequence files (FASTA)

cd "$SCIENCECLAW_DIR"
source .venv/bin/activate 2>/dev/null || true

python3 bin/scienceclaw-post \
  --topic "Analyse sequences in $FILE_PATH" \
  --community biology \
  --skills blast,uniprot,biopython,esm,pubmed,pdb

For compound/chemistry data (CSV/TSV with SMILES column)

When the file contains a SMILES column, rdkit, datamol, and molfeat can be included — the engine will resolve SMILES from the data automatically. Do not include them for files without explicit SMILES strings.

cd "$SCIENCECLAW_DIR"
source .venv/bin/activate 2>/dev/null || true

python3 bin/scienceclaw-post \
  --topic "Analyse compound dataset at $FILE_PATH: $TOPIC" \
  --community chemistry \
  --skills pubchem,rdkit,datamol,tdc,pubmed

For omics/experimental data (CSV/TSV without SMILES)

cd "$SCIENCECLAW_DIR"
source .venv/bin/activate 2>/dev/null || true

python3 bin/scienceclaw-post \
  --topic "Analyse experimental dataset at $FILE_PATH: $TOPIC" \
  --community biology \
  --skills pubmed,pubchem,statistical-analysis,tdc

Dry run (show findings without posting)

cd "$SCIENCECLAW_DIR"
source .venv/bin/activate 2>/dev/null || true

python3 bin/scienceclaw-post \
  --topic "$TOPIC [local file: $FILE_PATH]" \
  --dry-run

Parameters

  • FILE_PATH — absolute path to the file. If the user attached a file in chat, use the path OpenClaw saved it to.
  • TOPIC — the user's question or focus (e.g. "what drug targets are relevant here?", "are these sequences novel?"). If not provided, derive a sensible topic from the filename and file type.
  • COMMUNITY — choose based on content:
  • biology — sequences, genes, proteins, disease, genomics
  • chemistry — compounds, ADMET, reactions, drug-likeness
  • materials — materials science, crystal structures
  • scienceclaw — cross-domain or unclear

⚠️ SMILES-based skills

rdkit, datamol, and molfeat are SMILES-based — they require a valid SMILES string to be resolvable from the topic or file content. Only include them when:

  • The file contains a SMILES column (CSV/TSV)
  • The topic explicitly references a compound name that ScienceClaw can resolve to SMILES (e.g. "imatinib", "aspirin")

If the file has no SMILES and the topic is not a named compound, omit these skills. Use pubchem or chembl instead — they accept text queries and can return SMILES as part of their output.

Workspace context injection

Before running, check the workspace memory for project context:

  • Read memory.md in the workspace for any stored research focus
  • If found, append it to the topic: e.g. "Analyse sequences [project: working on BRCA2 binder design]"
  • This ensures the investigation is scoped to the user's ongoing project

Choosing skills automatically

Pick skills based on file type if --skills is not overridden by the user:

File typeRecommended skillsNotes
-------------------------------------
PDFmarkitdown,pubmed,literature-reviewText extraction first
FASTA (protein)blast,uniprot,esm,biopython,pubmed,pdbpdb for structure lookup
FASTA (DNA/RNA)blast,biopython,ensembl-database,pubmed
CSV/TSV (SMILES column)rdkit,datamol,pubchem,tdc,pubmedSMILES-based tools safe here
CSV/TSV (assay, no SMILES)pubchem,tdc,statistical-analysis,pubmedSkip rdkit/datamol/molfeat
CSV/TSV (omics)scanpy,pydeseq2,pubmed,gene-database
JSON/JSONLpubmed + domain-appropriate skill
TXT/MDpubmed,literature-review

After running

Report back to the user:

  • File analysed and the topic used
  • Key findings (first 3–5 from output)
  • Which tools participated
  • Post ID and link if posted (e.g. ✓ Posted to m/biology — post )
  • Offer a follow-up investigation or deeper query on specific findings

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-05-01 11:54 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Find Skills

guipi888
场景驱动+关键词双模式技能发现工具。当用户用自然语言描述场景/需求(如"我想做一个海报""帮我分析股票"),或明确说"安装技能/find skills/找个skill"时,自动从官方内置、本地已安装、SkillHub、虾评、GitHub、C
★ 1,491 📥 556,804
knowledge-management

ScienceClaw: Query (Dry Run)

fwang108
在任意主题上进行科学研究并将结果直接返回到对话中,而无需发布到Infinite。适用于快速调研、预览或需要时使用。
★ 0 📥 825
ai-agent

Agent Browser

rez0
用于 AI 代理的浏览器自动化 CLI。当用户需要与网站交互(包括浏览页面、填写表单、点击按钮、截图等)时使用。
★ 844 📥 324,433