← 返回
未分类 Key 中文

asr

Transcribe audio from URLs or local files into text using multilingual automatic speech recognition via SkillBoss API Hub.
利用 SkillBoss API Hub 的多语言自动语音识别,将 URL 或本地文件中的音频转录为文字
kirkraman kirkraman 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 348
下载
💾 0
安装
1
版本
#ai#latest

概述

ASR (Automatic Speech Recognition) Skill

Fast, accurate automatic speech-to-text transcription powered by SkillBoss API Hub.

Why use this skill?

  • Multilingual: Supports 100 languages with auto-detection.
  • Flexible Input: Transcribe from a URL or a local file.
  • Agent-Ready: Designed for high-volume, automated pipelines.
  • Unified API: Powered by SkillBoss API Hub — single key, single endpoint.

Setup

1. Get an API Key

Sign up at skillbossai.com to obtain your SKILLBOSS_API_KEY.

2. Configure Authentication

This skill looks for your API key in the SKILLBOSS_API_KEY environment variable.

Add this to your .env or agent config:

SKILLBOSS_API_KEY=your_key_here

Usage

TL;DR for Agents

When this skill is installed, you can transcribe any URL or local file by running:

./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"

Transcribe a URL

# Basic transcription
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"

# With language hint
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3" --language "en"

Transcribe a Local File

# Upload and transcribe local media
./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav"

Supported Options

  • --language : ISO language code (e.g., 'en', 'es')

Output

Returns a JSON response. The transcription text is in:

.result.text

If the SKILLBOSS_API_KEY is missing, the tool will provide a clear error message.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 08:27 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

knowledge-management

summarize

kirkraman
使用 summarize CLI 对 URL、本地文件(PDF、图片、音频)以及 YouTube 链接进行摘要,支持自定义长度和模型选项。
★ 0 📥 630
design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 224 📥 48,537
design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 333 📥 94,466