← 返回
数据分析 中文

Volcano Plot Script

Generate R/Python code for volcano plots from DEG (Differentially Expressed Genes) analysis results. Triggered when user needs visualization of gene expressi...
根据差异表达基因(DEG)分析结果生成R或Python代码以绘制火山图。当用户需要基因表达可视化时触发。
ec-cyber258
数据分析 clawhub v0.1.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 501
下载
💾 20
安装
1
版本
#latest

概述

Volcano Plot Script Generator

A skill for generating publication-ready volcano plots from differential gene expression analysis results.

Overview

Volcano plots visualize the relationship between statistical significance (p-values) and magnitude of change (fold changes) in gene expression data. This skill generates customizable R or Python scripts for creating high-quality figures suitable for publications.

Use Cases

  • Visualize RNA-seq DEG analysis results
  • Identify significantly upregulated and downregulated genes
  • Highlight genes of interest (markers, pathways)
  • Generate publication-quality figures for manuscripts
  • Compare multiple experimental conditions

Input Requirements

Required input data format:

  • Gene identifier (gene symbol or ENSEMBL ID)
  • Log2 fold change values
  • Adjusted or raw p-values
  • Optional: gene annotations, pathways

Output

  • Publication-ready volcano plot (PNG/PDF/SVG)
  • Customizable R or Python script
  • Optional: labeled significant gene lists

Usage

# Example: Run the volcano plot generator
python scripts/main.py --input deg_results.csv --output volcano_plot.png

Parameters

ParameterDescriptionDefault
---------------------------------
--inputPath to DEG results CSV/TSVrequired
--outputOutput plot file pathvolcano_plot.png
--log2fc-colColumn name for log2 fold changelog2FoldChange
--pvalue-colColumn name for p-valuepadj
--gene-colColumn name for gene IDsgene
--log2fc-threshLog2 FC threshold for significance1.0
--pvalue-threshP-value threshold0.05
--label-genesFile with genes to labelNone
--top-nLabel top N significant genes10
--color-upColor for upregulated genes#E74C3C
--color-downColor for downregulated genes#3498DB
--color-nsColor for non-significant genes#95A5A6

Technical Difficulty

Medium - Requires understanding of:

  • DEG analysis concepts (fold change, p-values, FDR)
  • Data visualization principles
  • Matplotlib/ggplot2 plotting libraries

Dependencies

Python

  • pandas
  • matplotlib
  • seaborn
  • numpy

R

  • ggplot2
  • dplyr
  • ggrepel (for label positioning)

References

Author

Auto-generated skill for bioinformatics visualization.

Risk Assessment

Risk IndicatorAssessmentLevel
-----------------------------------
Code ExecutionPython/R scripts executed locallyMedium
Network AccessNo external API callsLow
File System AccessRead input files, write output plotsMedium
Instruction TamperingStandard prompt guidelinesLow
Data ExposureOutput files saved to workspaceLow

Security Checklist

  • [ ] No hardcoded credentials or API keys
  • [ ] Input file paths validated (no ../ traversal)
  • [ ] Output directory restricted to workspace
  • [ ] Script execution in sandboxed environment
  • [ ] Error messages sanitized (no stack traces exposed)
  • [ ] Dependencies audited (pandas, matplotlib, seaborn, numpy)

Prerequisites

# Python dependencies
pip install -r requirements.txt

# R dependencies (if using R)
install.packages(c("ggplot2", "dplyr", "ggrepel"))

Evaluation Criteria

Success Metrics

  • [ ] Successfully generates executable Python/R script
  • [ ] Output plot is publication-ready quality
  • [ ] Correctly identifies significant genes based on thresholds
  • [ ] Handles missing or malformed data gracefully
  • [ ] Color scheme is accessible (colorblind-friendly)

Test Cases

  1. Basic DEG Visualization: Input standard DESeq2 results → Valid volcano plot
  2. Custom Thresholds: Adjust log2FC and p-value thresholds → Correct gene classification
  3. Gene Labeling: Specify genes to label → Labels appear correctly
  4. Large Dataset: Input 20,000+ genes → Performance remains acceptable
  5. Malformed Data: Input with missing values → Graceful error handling

Lifecycle Status

  • Current Stage: Draft
  • Next Review Date: 2026-03-06
  • Known Issues: None
  • Planned Improvements:
  • Add interactive plot option (Plotly)
  • Support for multiple comparison groups
  • Integration with pathway enrichment tools

版本历史

共 1 个版本

  • v0.1.0 当前
    2026-03-30 03:32 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 162 📥 59,672
content-creation

Vector Text Fixer

ec-cyber258
修复PDF/SVG矢量图形乱码以便AI最终编辑。检测、替换并修复矢量文件中的乱码,保持原始格式。
★ 0 📥 493
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 198 📥 64,855