Transcribe audio/video files using the SenseASR API (api.senseaudio.cn), then summarize the content into structured notes.
{baseDir} refers to this skill's directory.
SENSEAUDIO_API_KEY configured (get your key at https://senseaudio.cn/platform/api-key)requests installedffmpeg installed for splitting(macOS: brew install ffmpeg,Windows: ffmpeg.org 下载并加入 PATH,Linux: apt install ffmpeg)python {baseDir}/scripts/transcribe.py <audio_file> [--model sense-asr-pro] [--language zh] [--speakers] [--sentiment] [--translate en]
.txt file alongside the source fileCheck file size and format:
| Model | Use When |
|---|---|
| ------- | ---------- |
sense-asr-lite | Quick batch transcription, simple audio, cost-sensitive |
sense-asr | General transcription, need speaker separation or timestamps |
sense-asr-pro | High accuracy needed: meetings, interviews, complex audio |
sense-asr-deepthink | Noisy audio, dialects, heavy jargon, speech-to-clean-text |
Default to sense-asr-pro for best quality.
Run the transcription script. Key options:
# Basic transcription
python {baseDir}/scripts/transcribe.py recording.mp3
# Meeting with multiple speakers + emotion
python {baseDir}/scripts/transcribe.py meeting.wav \
--model sense-asr-pro \
--speakers --max-speakers 4 \
--sentiment \
--timestamps segment
# Transcribe and translate to English
python {baseDir}/scripts/transcribe.py lecture.mp3 \
--model sense-asr \
--translate en
After transcription, read the transcript file and produce a summary using the format below.
Generate summaries in this structure:
# [Title - inferred from content]
**Source**: filename.mp3
**Duration**: X min Y sec
**Date**: YYYY-MM-DD
**Speakers**: [if speaker diarization was used]
## Key Points
- Point 1
- Point 2
- ...
## Detailed Summary
[2-4 paragraph summary of the content organized by topic/chronology]
## Action Items
- [ ] Action item 1 (assigned to Speaker X, if applicable)
- [ ] Action item 2
## Notable Quotes
> "Direct quote from transcript" — Speaker X, [timestamp if available]
## Full Transcript
<details>
<summary>Click to expand full transcript</summary>
[Full transcript text here, with speaker labels and timestamps if available]
</details>
Adapt the template based on content type:
For full SenseASR API parameters and response formats, see api-reference.md.
共 1 个版本