← 返回
未分类 Key 中文

Aliyun Qwen Asr Realtime

Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li...
适用于需要低延迟实时语音识别的场景,搭配阿里云 Model Studio Qwen ASR 实时模型,支持流式麦克风输入等。
cinience
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 291
下载
💾 0
安装
1
版本
#latest

概述

Category: provider

Model Studio Qwen ASR Realtime

Validation

mkdir -p output/aliyun-qwen-asr-realtime
python -m py_compile skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py && echo "py_compile_ok" > output/aliyun-qwen-asr-realtime/validate.txt

Pass criteria: command exits 0 and output/aliyun-qwen-asr-realtime/validate.txt is generated.

Output And Evidence

  • Save session payloads and response samples under output/aliyun-qwen-asr-realtime/.

Critical model names

Use one of these exact model strings:

  • qwen3-asr-flash-realtime
  • qwen3-asr-flash-realtime-2026-02-10

Use cases

  • Realtime subtitles and captions
  • Voice-agent duplex input
  • Streaming speech-to-text in browser or terminal clients

Prerequisites

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
  • Realtime sessions generally require WebSocket or streaming session handling in the client.

Normalized interface (asr.realtime)

Request

  • model (string, optional): default qwen3-asr-flash-realtime
  • language_hints (array, optional)
  • format (string, optional): e.g. pcm, wav
  • sample_rate (int, optional): e.g. 16000
  • chunk_ms (int, optional): frame size in milliseconds

Response

  • text (string): recognized transcript fragment
  • is_final (bool): finalization marker
  • usage (object, optional)

Quick start

Generate a request template:

python skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py \
  --output output/aliyun-qwen-asr-realtime/request.json

Operational guidance

  • Prefer 16kHz mono PCM unless your client stack requires another format.
  • Keep chunks small enough for responsive partial results.
  • If you only have recorded files, use skills/ai/audio/aliyun-qwen-asr/ instead.

References

  • references/sources.md

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 16:16 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

content-creation

Volcengine Ai Image Generation

cinience
火山引擎AI服务图像生成工作流。适用于文生图、风格变体、提示词优化、确定性图像生成参数设置及问题排查。
★ 3 📥 4,502
data-analysis

Alicloud Ai Content Aimiaobi

cinience
使用OpenAPI/SDK管理阿里云全秒(AIMiaoBi),在用户请求阿里云秒币内容操作(如列出资源)时使用。
★ 0 📥 1,880
content-creation

Volcengine Ai Video Generation

cinience
火山引擎AI视频生成工作流。适用于文字生成视频、图片生成视频、生成参数调整及视频任务异步排查。
★ 0 📥 2,178