← 返回
数据分析 Key 中文

Pronunciation Coach

Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonatio...
利用Azure语音服务进行发音指导,通过真实语音分析评估音素级准确度、流利度、韵律和语调。
crazybuffon
数据分析 clawhub v1.0.4 1 版本 99885.1 Key: 需要
★ 0
Stars
📥 869
下载
💾 16
安装
1
版本
#latest

概述

Pronunciation Coach

Analyze spoken English pronunciation using Azure Speech Services and provide actionable coaching feedback.

Privacy Note: This skill reads local voice messages from ~/.openclaw/media/inbound/ and transmits them to Microsoft Azure Speech Services for processing.

Prerequisites

  • Azure Speech API Key: Set AZURE_SPEECH_KEY env var
  • Azure Speech Region: Set AZURE_SPEECH_REGION env var (e.g., southeastasia)
  • ffmpeg: Required for audio format conversion (must be on PATH)
  • Node.js: Required for report generation

Workflow

1. Receive Audio

Voice messages from Telegram are stored in ~/.openclaw/media/inbound/. Find the latest .ogg file matching the message timestamp.

ls -lt ~/.openclaw/media/inbound/*.ogg | head -5

2. Run Assessment

scripts/pronunciation-assess.sh <audio_file> "<reference_text>"
  • audio_file: Path to the voice message (ogg/wav/mp3/m4a)
  • reference_text: What the speaker intended to say (from transcript)
  • The script auto-converts any format to WAV 16kHz mono

3. Generate Report

Pipe the JSON output into the report generator:

scripts/pronunciation-assess.sh audio.ogg "reference text" | node scripts/pronunciation-report.js

The report includes:

  • Overall scores (Pronunciation, Accuracy, Fluency, Prosody, Completeness)
  • Word-by-word breakdown with per-phoneme scores
  • Problem sounds highlighted
  • Verdict with actionable next steps

4. Provide Coaching

After generating the report:

  1. Send the text report to the user (scores + word breakdown)
  2. Identify top 3 problem sounds from the phoneme scores
  3. Explain each problem — what the correct sound is and how to produce it
    • See references/phoneme-guide.md for phoneme descriptions and fixes
  4. Send a voice message (via TTS) demonstrating the correct pronunciation of problem words
  5. Assign practice — give the user specific sentences to re-record focusing on weak sounds

Coaching Tips

  • Scores ≥ 90: Excellent, minor polish
  • Scores 70-89: Good, targeted practice needed
  • Scores < 70: Needs focused drill on that specific sound
  • "Omission" errors mean the word wasn't detected — speaker may have been too quiet or mumbled
  • Prosody score < 85 suggests monotone delivery — coach on intonation rises/falls
  • Compare scores across multiple recordings to track improvement

版本历史

共 1 个版本

  • v1.0.4 当前
    2026-03-29 08:36 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 166 📥 60,216
data-analysis

Stock Analysis

udiedrichsen
{"answer":"基于雅虎财经数据,分析股票与加密货币。支持投资组合管理、自选股预警、股息分析、8维评分、热门趋势扫描及传闻/早期信号探测。适用于股票分析、持仓追踪、财报异动、加密监控、热门股追踪或提前发掘非主流传闻。"}
★ 270 📥 57,025
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 199 📥 65,248