This skill is for commercial teams who need to test which spoken script performs best, while keeping the same voice across all variants.
That matters because otherwise too many variables change at once:
This skill keeps the voice fixed and lets you vary:
Generate 4 to 8 spoken openers for the same product:
Then synthesize all of them with the same voice for fast creative screening.
Use the same host-like voice to test:
Generate multiple voice-note versions for:
This skill can generate regional phrasing styles for comparison, while keeping the same voice.
Important:
产品 / 人群 / 卖点 / 优惠 / 行动scripts/senseaudio_asr.py, then scripts/extract_spoken_brief.py.scripts/run_typed_brief_pipeline.py directly, or call scripts/build_voice_ab_variants.py yourself.scripts/build_voice_ab_variants.py to generate variants.voice_id.voice_id.vc-..., and can be passed directly with --clone-voice-id.scripts/senseaudio_asr.py or scripts/run_spoken_brief_pipeline.py.scripts/batch_tts_variants.py to synthesize every variant with the same voice. This skill already uses AudioClaw streaming TTS under the hood and now records stream chunk metadata.vc-..., the batch TTS step now auto-routes to SenseAudio-TTS-1.5.scripts/send_ab_variants_to_feishu.py after synthesis, or use scripts/run_spoken_brief_pipeline.py --send-feishu-audio / scripts/run_typed_brief_pipeline.py --send-feishu-audio..mp3 variants into .ogg/.opus and sends them one by one as real audio messages.Use this skill as an explicit task mode, not as a hidden background guess.
Recommended user trigger:
用 $senseaudio-voice-ab-lab 处理我刚发的语音。
产品:轻量保温杯
人群:通勤上班族
卖点:轻便保温不漏水
优惠:第二件半价
行动:现在点击下单
clone voice_id:your_clone_voice_id
生成 4 条口播,输出到 /tmp/voice_ab_run
If the user already sent a voice memo, the agent should:
scripts/run_spoken_brief_pipeline.py.If the user says "一条一条发语音给我听" or "直接发到飞书里试听", the agent should:
scripts/send_ab_variants_to_feishu.py, or add --send-feishu-audio to scripts/run_spoken_brief_pipeline.py.audio messages instead of replying with local paths.--limit or --variant-ids.If the user gave a typed brief and also says "直接一条一条发语音给我听", the agent should:
campaign_nameproductaudiencekey_messagectaofferproofscripts/run_typed_brief_pipeline.py.--send-feishu-audio.If the user does not provide a cloned voice, ask for either:
voice_idvoice_idregional_style as a wording choice, not an official dialect model.voice_id into this skill for generationFor the generation side of this skill:
SENSEAUDIO_API_KEYPractical rule:
scripts/run_spoken_brief_pipeline.py, scripts/run_typed_brief_pipeline.py, and scripts/batch_tts_variants.py now default to SENSEAUDIO_API_KEYSENSEAUDIO_API_KEY as a login token such as v2.public..., the shared bootstrap replaces it with the real sk-... value from ~/.audioclaw/workspace/state/senseaudio_credentials.json before the synthesis step startsscripts/build_voice_ab_variants.pyscripts/senseaudio_asr.pysense-asr-deepthink model for spoken briefsscripts/extract_spoken_brief.pyscripts/run_spoken_brief_pipeline.py--stream-asr, --clone-voice-id, and --send-feishu-audioscripts/run_typed_brief_pipeline.py--clone-voice-id and --send-feishu-audioscripts/batch_tts_variants.pyvoice_idscripts/send_ab_variants_to_feishu.pyscripts/export_ab_review_csv.pyreferences/commercial_ab_patterns.mdreferences/asr_brief_pipeline.md共 1 个版本