← 返回
未分类

Communication DNA

Analyze speech and meeting transcriptions to build communication profiles — vocabulary fingerprints, filler word detection, speech patterns, commitment extra...
分析演讲和会议转录,构建沟通画像——词汇指纹、填充词检测、语速与语调模式、承诺等
artofcoding-by-hanif
未分类 clawhub v1.0.1 1 版本 99601.6 Key: 无需
★ 0
Stars
📥 250
下载
💾 0
安装
1
版本
#latest

概述

Communication DNA 🧬

Analyze transcriptions to extract communication intelligence — how people speak, what they commit to, how they compare.

Setup

The project lives at communication-dna/ in the workspace. On first use, initialize the DB:

cd <skill-dir>/scripts
python3 db.py  # Creates communication_dna.db with all tables + FTS5

Core Workflow

1. Ingest Transcriptions

python3 dna.py ingest <file> --title "Meeting Name" --date 2026-02-23 --context meeting
python3 dna.py ingest-dir <directory>  # Batch ingest

Supported formats:

  • TXT — Auto-detects speaker labels ("John:", "[Alice]", "Speaker 1:")
  • SRT — SubRip subtitles with timestamps
  • VTT — WebVTT with timestamps
  • JSON — Whisper/Otter.ai exports with segments

Speaker matching is fuzzy — reuses existing speakers by name.

2. Analyze Speakers

python3 dna.py analyze <speaker_id>    # Full report
python3 dna.py analyze-all             # All speakers
python3 dna.py fingerprint <speaker_id> # Vocabulary deep dive
python3 dna.py fillers <speaker_id>     # Filler word report
python3 dna.py patterns <speaker_id>    # Speech patterns
python3 dna.py commitments             # All extracted commitments
python3 dna.py sentiment <trans_id>    # Sentiment arc
python3 dna.py topics                  # Top topics
python3 dna.py compare <id1> <id2>    # Side-by-side comparison

3. Speaker Profiles

Auto-generated style tags based on analysis:

  • Formal/Casual, Assertive/Cautious, Inquisitive, Filler-heavy/Articulate, Optimistic/Critical, Diverse vocabulary/Repetitive

4. Cross-System Integration

python3 dna.py link-crm               # Auto-link speakers → CRM contacts
python3 dna.py link-kb                 # Cross-reference with Knowledge Base
python3 dna.py push-to-kb <trans_id>  # Push transcription to KB
python3 dna.py cross-search "query"   # Search DNA + KB together

CRM path: ../personal-crm/crm.db | KB path: ../knowledge-base/knowledge.db

5. Web UI

python3 app.py  # Starts on port 5053

Pages: Dashboard, Speakers, Speaker Profile, Transcriptions, Transcription Detail, Compare, Search, Ingest (drag & drop), Integrations.

API endpoints: /api/speakers, /api/speaker/, /api/search, /api/ingest, /api/link-crm, /api/link-kb, /api/cross-search, /api/push-to-kb/

Analysis Capabilities

AnalysisWhat it extracts
---------------------------
Vocabulary FingerprintWord frequency, type-token ratio, sentence length, formality score, unique words
Filler DetectionRate per 100 words, filler distribution, cross-speaker comparison
Speech PatternsBigram/trigram phrases, question rate, hedging vs assertiveness scores
Commitments"I'll do X", decisions, action items — stored in extractions table
SentimentPer-segment scoring (-1 to +1), arcs over time, speaker averages
TopicsTF-IDF extraction per transcription and per speaker

File Reference

All source files are in scripts/:

  • db.py — Schema + DB initialization
  • ingest.py — Format parsers + speaker detection
  • analyze.py — Analysis engine (6 functions + caching)
  • wordlists.py — Stop words, fillers, sentiment words, formal/informal lists
  • profiles.py — Speaker profile generator + comparison engine
  • integrations.py — CRM + KB connectors
  • dna.py — CLI (argparse, 17 subcommands)
  • app.py — Flask web UI
  • templates/ — Jinja2 templates (dark theme, Tailwind CSS)

Dependencies

  • Python 3 stdlib (no pip installs for core)
  • Flask (for web UI only)
  • SQLite FTS5 (built into Python's sqlite3)

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-05-12 05:35 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,374 📥 319,907
security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,223 📥 267,466
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 673 📥 325,068