← 返回
开发者工具 Key 中文

Step Asr

Transcribe audio files to text via Step ASR streaming API (HTTP SSE). Supports Chinese and English, multiple audio formats (PCM, WAV, MP3, OGG/OPUS), real-ti...
通过 Step ASR 流式 API (HTTP SSE) 将音频文件转录为文本。支持中英文、多种音频格式 (PCM、WAV、MP3、OGG/OPUS) 及实时...
randzero
开发者工具 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 1
Stars
📥 560
下载
💾 12
安装
1
版本
#latest

概述

Step ASR - Streaming Speech-to-Text

Transcribe audio files using the Step (StepFun) ASR API with HTTP SSE streaming.

Quick start

python3 {baseDir}/scripts/transcribe.py /path/to/audio.wav

Usage examples

Basic transcription (Chinese, streaming output):

python3 {baseDir}/scripts/transcribe.py /path/to/audio.wav

Specify language and save to file:

python3 {baseDir}/scripts/transcribe.py /path/to/audio.mp3 --language en --out /tmp/transcript.txt

Use a prompt for terminology correction:

python3 {baseDir}/scripts/transcribe.py /path/to/audio.pcm --prompt "Related terms: OpenClaw, StepFun, ASR"

Output as JSON (includes usage stats):

python3 {baseDir}/scripts/transcribe.py /path/to/audio.ogg --json

Non-streaming mode (only print final result):

python3 {baseDir}/scripts/transcribe.py /path/to/audio.wav --no-stream

Specify audio format explicitly (for raw PCM files without extension):

python3 {baseDir}/scripts/transcribe.py /path/to/raw_audio --format-type pcm --sample-rate 16000

Supported audio formats

FormatExtensionsNotes
--------------------------
PCM.pcm, .rawRaw PCM, default codec pcm_s16le
WAV.wavWAV container format
MP3.mp3
OGG/OPUS.ogg, .opus

All options

FlagDefaultDescription
----------------------------
--languagezhLanguage code (zh or en)
--modelstep-asrASR model name
--out(stdout)Save transcription to file
--prompt(none)Hint text to improve accuracy for domain-specific terms
--format-type(auto)Audio format: pcm, mp3, ogg (auto-detected from extension)
--sample-rate16000Audio sample rate in Hz
--no-streamfalseOnly print the final complete result
--jsonfalseOutput as JSON with usage statistics
--no-itnfalseDisable inverse text normalization
--no-rerunfalseDisable second-pass error correction

API key

Set STEPFUN_API_KEY environment variable, or configure it in ~/.openclaw/openclaw.json:

{
  skills: {
    "step-asr": {
      apiKey: "YOUR_STEPFUN_API_KEY"
    }
  }
}

Get your API key from Step Platform.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 04:51 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Agent Browser

matrixy
专为AI智能体优化的无头浏览器自动化CLI,支持无障碍树快照和基于引用的元素选择。
★ 427 📥 118,474
developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 68 📥 180,601
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 672 📥 324,682