Skill ID: nutrigenomics
Version: 0.3.2
Status: Beta
Author: David de Lorenzo
Requires: Python 3.11+, pandas, numpy, matplotlib, seaborn, reportlab (optional)
The Nutrigenomics generates a personalised nutrition report from consumer
genetic data (23andMe, AncestryDNA raw files or VCF). It interrogates a curated
set of nutritionally-relevant SNPs drawn from GWAS Catalog, ClinVar, and
peer-reviewed nutrigenomics literature, then translates genotype calls into
actionable dietary and supplementation guidance — all computed locally.
Key outputs
README_reproducibility.txt, environment.yml, checksums.txt, provenance.json)The Bio Orchestrator should route to this skill when the user says anything like:
.txt or .csv (23andMe), .csv (AncestryDNA), .vcf| Gene | SNP | Nutrient Impact | Evidence |
|---|---|---|---|
| --------- | ------------ | ------------------------------------------ | ---------- |
| FTO | rs9939609 | Energy balance, fat mass, carb sensitivity | Strong (GWAS) |
| PPARG | rs1801282 | Fat metabolism, insulin sensitivity | Moderate |
| APOA5 | rs662799 | Triglyceride response to dietary fat | Strong |
| TCF7L2 | rs7903146 | Carbohydrate metabolism, T2D risk | Strong |
| ADRB2 | rs1042713 | Fat oxidation, exercise × diet interaction | Moderate |
| Gene | SNP | Nutrient | Effect of risk allele |
|---|---|---|---|
| --------- | ------------ | ------------------------- | ---------------------------------- |
| MTHFR | rs1801133 | Folate / B12 | ↓ 5-MTHF conversion (~70%) |
| MTHFR | rs1801131 | Folate / B12 | ↓ enzyme activity (~30%) |
| MTR | rs1805087 | B12 / homocysteine | ↑ homocysteine risk |
| BCMO1 | rs7501331 | Beta-carotene → Vitamin A | ↓ conversion (~50%) |
| BCMO1 | rs12934922 | Beta-carotene → Vitamin A | ↓ conversion (compound het) |
| VDR | rs2228570 | Vitamin D absorption | ↓ VDR function |
| VDR | rs731236 | Vitamin D | ↓ bone mineral density response |
| GC | rs4588 | Vitamin D binding | ↑ deficiency risk |
| SLC23A1 | rs33972313 | Vitamin C transport | ↓ renal reabsorption |
| ALPL | rs1256335 | Vitamin B6 | ↓ alkaline phosphatase activity |
| Gene | SNP | Nutrient | Effect |
|---|---|---|---|
| --------- | ------------ | ---------------------- | --------------------------------- |
| FADS1 | rs174546 | LC-PUFA synthesis | ↑/↓ EPA/DHA from ALA |
| FADS2 | rs1535 | LC-PUFA synthesis | Modulates omega-6:omega-3 ratio |
| ELOVL2 | rs953413 | DHA synthesis | ↓ elongation of EPA→DHA |
| APOE | rs429358 | Saturated fat response | ε4 → ↑ LDL-C on high SFA diet |
| APOE | rs7412 | Saturated fat response | Combined with rs429358 for ε typing |
| Gene | SNP | Compound | Effect |
|---|---|---|---|
| --------- | ------------ | ------------- | -------------------------------- |
| CYP1A2 | rs762551 | Caffeine | Slow/Fast metaboliser |
| AHR | rs4410790 | Caffeine | Modulates CYP1A2 induction |
| ADH1B | rs1229984 | Alcohol | Acetaldehyde accumulation risk |
| ALDH2 | rs671 | Alcohol | Asian flush / toxicity risk |
| Gene | SNP | Sensitivity | Effect |
|---|---|---|---|
| --------- | ------------ | ---------------------- | --------------------------------- |
| MCM6 | rs4988235 | Lactose intolerance | Non-persistence of lactase |
| HLA-DQ2 | Proxy SNPs | Coeliac / gluten | HLA-DQA1/DQB1 risk haplotypes |
| Gene | SNP | Pathway | Effect |
|---|---|---|---|
| --------- | ------------ | ---------------------- | --------------------------------- |
| SOD2 | rs4880 | Manganese SOD | ↓ mitochondrial antioxidant |
| GPX1 | rs1050450 | Selenium / GSH-Px | ↓ glutathione peroxidase |
| GSTT1 | Deletion | Glutathione-S-trans | Null genotype → ↑ oxidative risk |
| NQO1 | rs1800566 | Coenzyme Q10 | ↓ CoQ10 regeneration |
| COMT | rs4680 | Catechol / B vitamins | Met/Val → methylation load |
parse_input.py)Accepts:
.txt or .csv (tab-separated: rsid, chromosome, position, genotype).csvAuto-detects format from header lines. Normalises alleles to forward strand using
a hard-coded reference table (avoids requiring external databases).
extract_genotypes.py)For each SNP in the panel:
"AT", "TT", "AA")"NOT_TESTED" if absent (common for chip-to-chip variation)score_variants.py)Each SNP is scored on a 0 / 0.5 / 1.0 scale:
0.0 — homozygous reference (lowest risk)0.5 — heterozygous1.0 — homozygous risk alleleComposite Nutrient Risk Scores (0–10) are computed per nutrient domain by
summing weighted SNP scores. Weights are derived from reported effect sizes
(beta coefficients or OR) in the primary literature.
Risk categories:
> Important caveat: These are polygenic risk indicators based on common
> variants. They are not diagnostic. Rare pathogenic variants (e.g. MTHFR
> compound heterozygosity with high homocysteine) require clinical confirmation.
generate_report.py)Outputs a structured Markdown report with:
repro_bundle.py)Exports to the output directory (not committed to the repo):
README_reproducibility.txt — step-by-step instructions to reproduce the analysis manuallyenvironment.yml — pinned conda environmentchecksums.txt — SHA-256 checksums of the SNP panel and output report (input file intentionally excluded to avoid persisting a fingerprint of genetic data)provenance.json — timestamp, version, and format arguments (input filename intentionally omitted)Note: No executable scripts are generated. The reproducibility bundle contains
only text files for documentation and integrity verification.
To run the analysis on a user-provided genetic file, execute this command directly:
python {baseDir}/openclaw_adapter.py --input <path_to_genetic_file> --format auto
To run a demo without real genetic data (synthetic patient file included with the skill):
python {baseDir}/openclaw_adapter.py --input {baseDir}/tests/synthetic_patient.csv --format 23andme
{baseDir} is replaced by OpenClaw at runtime with the absolute path to this skill's folder. Do not substitute it manually. Output is written to a timestamped directory (nutrigenomics_output_YYYYMMDD_HHMMSS/) in the current working directory and persists until manually deleted.
Supported --format values: auto (default), 23andme, ancestry, vcf.
# From 23andMe raw data
openclaw "Generate my personalised nutrition report from genome.csv"
# From VCF
openclaw "Run Nutrigenomics analysis on variants.vcf and flag any folate pathway risks"
# Targeted query
openclaw "What does my APOE status mean for my saturated fat intake?"
# Run the demo report (no real genetic data needed)
openclaw "Run a demo nutrigenomics report using the synthetic patient file"
skills/nutrigenomics/
├── SKILL.md ← this file (agent instructions)
├── nutrigenomics.py ← main entry point
├── parse_input.py ← multi-format parser
├── extract_genotypes.py ← SNP lookup engine
├── score_variants.py ← risk scoring algorithm
├── generate_report.py ← Markdown + figures
├── repro_bundle.py ← reproducibility export
├── data/
│ └── snp_panel.json ← curated SNP definitions
├── tests/
│ ├── synthetic_patient.csv ← fixed 23andMe-format test data (for pytest)
│ └── test_nutrigenomics.py ← pytest suite
└── examples/
├── generate_patient.py ← random patient generator (demo use)
├── data/ ← generated patient files land here (gitignored)
└── output/
├── nutrigenomics_report.md ← pre-rendered demo report
├── nutrigenomics_radar.png ← demo radar chart (nutrient risk profile)
└── nutrigenomics_heatmap.png ← demo gene × nutrient heatmap
> Note: Runtime output directories and randomly generated patient files are
> excluded from version control. Only the pre-rendered demo
> report in examples/output/ is committed.
All computation runs locally — no genetic data is ever transmitted to external
servers or third-party services.
What the report contains: The Markdown report includes per-SNP genotype calls
(e.g. AT, TT) for each of the 58 panel SNPs analysed. This is intentional:
knowing your specific genotype at each nutrition-related locus is what makes the
report actionable. Full raw genome data from the input file is not reproduced in
the report; only the 58 panel SNPs are included.
File persistence: Output files (report, figures, reproducibility bundle) are
written to a timestamped nutrigenomics_output_YYYYMMDD_HHMMSS/ directory under
the working directory and persist on disk until manually deleted. The input
file is read-only and is never copied into the output directory.
If you are running this skill on behalf of others or in a shared environment,
delete the output directory once the user has downloaded their results.
nutrigenomics analysis. It does not constitute medical advice.
major population. Rare pathogenic variants are out of scope.
GWAS cohorts. Risk estimates may not generalise equally across all ancestries.
diet, lifestyle, microbiome, and epigenetic state. A "high risk" score does not
mean a nutrient deficiency is present — it means the individual may benefit from
monitoring.
may not reflect individual trajectories (see Corpas 2025, *Nutrigenomics and
the Ecological Fallacy*).
This skill's SNP panel and methodology are informed by peer-reviewed nutrigenomics research. For verification and additional details, consult:
Users are encouraged to verify specific claims through these authoritative sources and with qualified healthcare providers.
The SNP panel (data/snp_panel.json) is maintained by the skill author.
To suggest additions or corrections, contact David de Lorenzo directly via
GitHub (@drdaviddelorenzo) or open
an issue on GitHub.
共 3 个版本