One-line: Build Nature/Science-level phylogenetic trees from enzyme names or sequences.
# 1. Activate environment
conda activate r43
# 2. Run analysis
python3 scripts/run_v2.py --query "imine reductase" --output ./output
# 3. Done! Check ./output/figures/ for publication-ready figures
Output: ML tree + 6 figures + QC reports + scientific conclusions
python3 scripts/run_v2.py --fasta sequences.fasta --output ./my_analysis
How to get sequences:
sequences.fastapython3 scripts/run_v2.py --query "imine reductase" --output ./ired_analysis
Note: This uses UniProt API which may change. Manual download (Use Case 1) is more reliable.
python3 scripts/run_v2.py \
--query "lipase" \
--output ./lipase \
--threads 10 \
--bootstrap 1000 \
--identity 0.90
Files generated:
trees/phylo.treefile - ML tree (Newick format)figures/*.png - 6 publication-ready figures (300 DPI)analysis_summary.json - Key statisticsconclusions.md - Scientific findingsFigures:
| Parameter | Default | Description |
|---|---|---|
| ----------- | --------- | ------------- |
--query | - | Enzyme name (UniProt search) |
--fasta | - | Input FASTA file |
--output | - | Output directory |
--threads | 10 | CPU threads |
--bootstrap | 1000 | Bootstrap replicates |
Full parameter list: See references/parameters.md
First time setup: references/installation.md
Troubleshooting: references/troubleshooting.md
Interpreting results: references/interpretation.md
Publication checklist: references/publication.md
AI report generation: references/ai_workflow.md
Suitable for: Nature, Science, Cell, MBE, Systematic Biology, PNAS
After analysis, read:
analysis_summary.json - Structured statisticsconclusions.md - Scientific findingsreferences/report_template.md - Writing templateNo need to parse log files!
Full references: references/citations.md
This skill is safe and transparent:
✅ No malicious code - All scripts are open source and auditable
✅ External tools only - Calls standard bioinformatics tools (IQ-TREE, MAFFT, trimAl, CD-HIT)
✅ Optional API - UniProt API is optional, manual FASTA download recommended
✅ Local processing - All analysis runs locally, no data sent to third parties
✅ No network when using --fasta - Completely offline when using local FASTA files
Why flagged as suspicious?
ClawHub's automated scanner detected:
subprocess calls (to run IQ-TREE, MAFFT, R)--query mode)These are normal and necessary for phylogenetic analysis. All external commands are:
Recommended usage:
--fasta with manually downloaded sequences (no network requests)--query if you trust UniProt API (public, no authentication)Verification:
scripts/ directoryrun_v2.py for the complete workflowVersion: 2.0 | Updated: 2026-04-23
共 1 个版本