← 返回
效率工具 中文

PRISM-GEN-DEMO

English: Retrieve, filter, sort, merge, and visualize multiple CSV result files from PRISM-Gen molecular generation/screening. Provides portable query-based...
检索、筛选、排序、合并和可视化来自PRISM-Gen分子生成/筛选的多个CSV结果文件。提供可移植的基于查询的
senazeng
效率工具 clawhub v2.0.0 3 版本 100000 Key: 无需
★ 0
Stars
📥 680
下载
💾 9
安装
3
版本
#latest

概述

PRISM-Gen Demo

Description

PRISM-Gen Demo is a read-only data analysis skill for exploring pre-calculated molecular screening results from the PRISM-Gen broad-spectrum coronavirus Mpro inhibitor discovery pipeline.

All analysis is performed locally on CSV files bundled with the skill. No network access, no external API calls, no credential requirements.

What This Skill Does

  • Retrieve data from any of 10 pipeline stage CSV files (step3a through step5b)
  • Filter molecules by property thresholds (pIC50, QED, MW, LogP, gap_ev, hERG_Prob, etc.)
  • Sort and rank molecules by any numeric column
  • Merge data across pipeline stages using SMILES as join keys
  • Score molecules with composite scoring, worst-case multi-target analysis, summary statistics, and Pareto front identification
  • Plot property distributions, scatter plots, docking score heatmaps, pipeline attrition funnels, and Pareto fronts
  • Summarize the full pipeline with attrition statistics and key findings

What This Skill Does NOT Do

  • Does NOT run any computational chemistry calculations (no DFT, no docking, no MD)
  • Does NOT access any network or external services
  • Does NOT require or use any credentials, API keys, or tokens
  • Does NOT modify any files outside its own output directory
  • Does NOT contain any shell scripts
  • Does NOT use hardcoded absolute paths

Commands

retrieve

Retrieve and display data from any pipeline stage.

python3 scripts/retrieve.py --stage step5b --columns name,pIC50,Broad_Spectrum_Score --max_rows 10
python3 scripts/retrieve.py --list_stages
python3 scripts/retrieve.py --stage step4a --list_columns
python3 scripts/retrieve.py --stage step5b --name mol_16

filter

Filter molecules by property thresholds. Multiple conditions are combined with AND.

python3 scripts/filter.py --stage step4a --where "pIC50>7.5" "QED>0.7" "hERG_Prob<0.5"
python3 scripts/filter.py --stage step5b --where "Broad_Spectrum_Score<-7.0" "Is_Final_Top==True"

sort

Sort and rank molecules by any column.

python3 scripts/sort.py --stage step5b --by Broad_Spectrum_Score --ascending --top 10
python3 scripts/sort.py --stage step4c --by R_global --top 20

merge

Merge data across pipeline stages on SMILES keys.

python3 scripts/merge.py --stages step3c,step4a --on smiles --columns pIC50,gap_ev,Lipinski_Pass,hERG_Prob

score

Compute composite scores, worst-case analysis, statistics, and Pareto fronts.

python3 scripts/score.py --mode worst_case --stage step5a --top 10
python3 scripts/score.py --mode composite --stage step4c --weights "pIC50:1.0,QED:0.5,R_ADMET:2.0"
python3 scripts/score.py --mode stats --stage step5b --columns pIC50,QED,Broad_Spectrum_Score
python3 scripts/score.py --mode pareto --stage step5b --obj1 pIC50 --obj2 QED

plot

Generate visualizations (requires matplotlib).

python3 scripts/plot.py --mode histogram --stage step4a --column pIC50 --output hist.png
python3 scripts/plot.py --mode scatter --stage step5b --x pIC50 --y Broad_Spectrum_Score --output scatter.png
python3 scripts/plot.py --mode heatmap --stage step5a --output heatmap.png
python3 scripts/plot.py --mode funnel --output funnel.png
python3 scripts/plot.py --mode pareto --stage step5b --x pIC50 --y QED --output pareto.png

summary

Generate a full pipeline summary report.

python3 scripts/summary.py
python3 scripts/summary.py --detailed

Available Pipeline Stages

Stage KeyDescriptionRowsColumns
---------------------------------------
step3aRL-optimized molecules (generation + surrogate scoring)20010
step3a_topTop 200 molecules by Reward20010
step3bGFN2-xTB electronic structure results2006
step3cxTB-refined ranking with GEM scoring20024
step4aADMET filtering (Lipinski, hERG, QED)20038
step4bB3LYP/6-31G* DFT validation (PySCF)4634
step4cMaster summary (all stages merged)20065
step5aBroad-spectrum docking (3 targets)3613
step5bFinal candidates with full annotations3675
step5b_masterMaster summary with docking20074

Dependencies

  • Required: Python 3.7+ (core functionality uses only Python standard library: csv, json, argparse, math, os, sys)
  • Optional: matplotlib (for visualization commands only; all non-plot commands work without it)

Security

  • Pure Python implementation — no shell scripts, no subprocess calls, no os.system calls
  • No hardcoded paths — all file paths are resolved relative to the skill root directory
  • No network access — all data is bundled locally in the data/ directory
  • No credentials — no environment variables, API keys, or tokens required
  • Read-only on input data — CSV files in data/ are never modified
  • Output isolation — generated plots are written only to user-specified paths

License

MIT-0 — Free to use, modify, and redistribute. No attribution required.

Author

@SenaZeng

Related

  • GitHub: https://github.com/SenaZeng/PRISM-Gen
  • Zenodo: https://doi.org/10.5281/zenodo.18764996

版本历史

共 3 个版本

  • v2.0.0 当前
    2026-03-29 13:32 安全 安全
  • v1.0.4
    2026-03-26 21:40
  • v1.0.1
    2026-03-11 11:00

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

productivity

Weather

steipete
获取当前天气和预报(无需API密钥)
★ 445 📥 226,202
productivity

Obsidian

steipete
操作 Obsidian 仓库(纯 Markdown 笔记)并通过 obsidian-cli 自动化。
★ 430 📥 103,735
productivity

Nano Pdf

steipete
使用nano-pdf CLI通过自然语言指令编辑PDF
★ 275 📥 114,789