← 返回
未分类 中文

Phylo Tree

Generate publication-quality maximum likelihood phylogenetic trees and figures from enzyme names or FASTA sequences with advanced model selection and bootstr...
基于酶名称或FASTA序列,生成符合发表标准的最大似然系统发育树和图表,支持高级模型选择与自举法。
billwanttobetop
未分类 clawhub v1.0.2 1 版本 100000 Key: 无需
★ 0
Stars
📥 340
下载
💾 0
安装
1
版本
#latest

概述

PhyloTree | Publication-Grade Phylogenetic Analysis

One-line: Build Nature/Science-level phylogenetic trees from enzyme names or sequences.


🚀 Quick Start (3 steps)

# 1. Activate environment
conda activate r43

# 2. Run analysis
python3 scripts/run_v2.py --query "imine reductase" --output ./output

# 3. Done! Check ./output/figures/ for publication-ready figures

Output: ML tree + 6 figures + QC reports + scientific conclusions


📋 Common Use Cases

Use Case 1: Analyze from FASTA file (Recommended)

python3 scripts/run_v2.py --fasta sequences.fasta --output ./my_analysis

How to get sequences:

  1. Go to UniProt: https://www.uniprot.org/
  2. Search for your enzyme (e.g., "imine reductase")
  3. Click "Download" → "FASTA (canonical)"
  4. Save as sequences.fasta

Use Case 2: Analyze by enzyme name (requires UniProt API)

python3 scripts/run_v2.py --query "imine reductase" --output ./ired_analysis

Note: This uses UniProt API which may change. Manual download (Use Case 1) is more reliable.

Use Case 3: Custom parameters

python3 scripts/run_v2.py \
  --query "lipase" \
  --output ./lipase \
  --threads 10 \
  --bootstrap 1000 \
  --identity 0.90

📊 What You Get

Files generated:

  • trees/phylo.treefile - ML tree (Newick format)
  • figures/*.png - 6 publication-ready figures (300 DPI)
  • analysis_summary.json - Key statistics
  • conclusions.md - Scientific findings

Figures:

  1. Main tree (rectangular layout)
  2. Circular tree
  3. Heatmap tree (branch length gradient)
  4. Branch length distribution
  5. Genus distribution
  6. Combined multi-panel

🔧 Key Parameters

ParameterDefaultDescription
---------------------------------
--query-Enzyme name (UniProt search)
--fasta-Input FASTA file
--output-Output directory
--threads10CPU threads
--bootstrap1000Bootstrap replicates

Full parameter list: See references/parameters.md


📖 Need More?

First time setup: references/installation.md

Troubleshooting: references/troubleshooting.md

Interpreting results: references/interpretation.md

Publication checklist: references/publication.md

AI report generation: references/ai_workflow.md


✅ Quality Standards

  • ✅ IQ-TREE ML + ModelFinder (1232 models)
  • ✅ UFBoot2 + SH-aLRT ≥ 1000
  • ✅ Alignment trimming (trimAl)
  • ✅ Deduplication (CD-HIT 90%)
  • ✅ 300 DPI figures
  • ✅ Nature/Science color schemes

Suitable for: Nature, Science, Cell, MBE, Systematic Biology, PNAS


🤖 For AI Agents

After analysis, read:

  1. analysis_summary.json - Structured statistics
  2. conclusions.md - Scientific findings
  3. references/report_template.md - Writing template

No need to parse log files!


📚 References

  1. Nguyen et al. (2015). IQ-TREE. Mol Biol Evol 32:268-274.
  2. Hoang et al. (2018). UFBoot2. Mol Biol Evol 35:518-522.
  3. Kalyaanamoorthy et al. (2017). ModelFinder. Nat Methods 14:587-589.
  4. Yu et al. (2017). ggtree. Methods Ecol Evol 8:28-36.

Full references: references/citations.md


🔒 Security & Privacy

This skill is safe and transparent:

No malicious code - All scripts are open source and auditable

External tools only - Calls standard bioinformatics tools (IQ-TREE, MAFFT, trimAl, CD-HIT)

Optional API - UniProt API is optional, manual FASTA download recommended

Local processing - All analysis runs locally, no data sent to third parties

No network when using --fasta - Completely offline when using local FASTA files

Why flagged as suspicious?

ClawHub's automated scanner detected:

  • subprocess calls (to run IQ-TREE, MAFFT, R)
  • Optional network requests (UniProt API for --query mode)
  • File system operations (creating output directories)

These are normal and necessary for phylogenetic analysis. All external commands are:

  • Standard bioinformatics tools (installed via conda)
  • Called with explicit arguments (no shell injection)
  • Logged for transparency

Recommended usage:

  • Use --fasta with manually downloaded sequences (no network requests)
  • Only use --query if you trust UniProt API (public, no authentication)

Verification:

  • Review all scripts in scripts/ directory
  • Check run_v2.py for the complete workflow
  • All external commands are documented in SKILL.md

Version: 2.0 | Updated: 2026-04-23

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-05-07 17:58 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

Protein Phylogeny

billwanttobetop
全面的蛋白质家族系统发育分析工作流,包含质量控制、保守性分析、共进化网络分析以及可直接出版的结果...
★ 0 📥 430

Auto Proteomics

billwanttobetop
公共OpenClaw技能,用于低token路由及对已处理的DDA LFQ蛋白质组学输入进行下游分析。适用于用户已有蛋白质水平定量...
★ 0 📥 556

Automd Gromacs

billwanttobetop
AutoMD‑GROMACS:面向AI的GROMACS分子动力学自动化,包含工作流、增强采样、特殊系统模拟、高级分析与出版级输出。
★ 2 📥 822