← 返回
未分类 Key 中文

Aliyun Modelstudio Entry

Use when routing Alibaba Cloud Model Studio requests to the right local skill (Qwen text, coder, deep research, image, video, audio, search and multimodal sk...
用于将阿里云 Model Studio 请求路由至正确的本地技能(Qwen 文本、代码、深度研究、图像、视频、音频、搜索和多模态等)
cinience
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 283
下载
💾 0
安装
1
版本
#latest

概述

Category: task

Alibaba Cloud Model Studio Entry (Routing)

Route requests to existing local skills to avoid duplicating model/parameter details.

Prerequisites

  • Install SDK (virtual environment recommended to avoid PEP 668 restrictions):
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Configure DASHSCOPE_API_KEY (environment variable preferred; or dashscope_api_key in ~/.alibabacloud/credentials).

Routing Table (currently supported in this repo)

NeedTarget skill
------
Text generation / reasoning / tool-callingskills/ai/text/aliyun-qwen-generation/
Coding / repository reasoningskills/ai/code/aliyun-qwen-coder/
Deep multi-step researchskills/ai/research/aliyun-qwen-deep-research/
Text-to-image / image generationskills/ai/image/aliyun-qwen-image/
Image editingskills/ai/image/aliyun-qwen-image-edit/
Text-to-video / image-to-video (t2v/i2v)skills/ai/video/aliyun-wan-video/
Non-Wan PixVerse video generationskills/ai/video/aliyun-pixverse-generation/
Reference-to-video (r2v)skills/ai/video/aliyun-wan-r2v/
Digital human talking / singing avatarskills/ai/video/aliyun-wan-digital-human/
Expressive portrait video (EMO)skills/ai/video/aliyun-emo/
Lightweight portrait animation (LivePortrait)skills/ai/video/aliyun-liveportrait/
Motion transfer / dancing avatar (AnimateAnyone)skills/ai/video/aliyun-animate-anyone/
Emoji / meme portrait videoskills/ai/video/aliyun-emoji/
Text-to-speech (TTS)skills/ai/audio/aliyun-qwen-tts/
Speech recognition/transcription (ASR)skills/ai/audio/aliyun-qwen-asr/
Realtime speech recognitionskills/ai/audio/aliyun-qwen-asr-realtime/
Realtime TTSskills/ai/audio/aliyun-qwen-tts-realtime/
Live speech translationskills/ai/audio/aliyun-qwen-livetranslate/
CosyVoice voice cloneskills/ai/audio/aliyun-cosyvoice-voice-clone/
CosyVoice voice designskills/ai/audio/aliyun-cosyvoice-voice-design/
Voice cloneskills/ai/audio/aliyun-qwen-tts-voice-clone/
Voice designskills/ai/audio/aliyun-qwen-tts-voice-design/
Omni multimodal interactionskills/ai/multimodal/aliyun-qwen-omni/
Visual reasoningskills/ai/multimodal/aliyun-qvq/
OCR / document parsing / table parsingskills/ai/multimodal/aliyun-qwen-ocr/
Text embeddingsskills/ai/search/aliyun-qwen-text-embedding/
Multimodal embeddingsskills/ai/search/aliyun-qwen-multimodal-embedding/
Rerankskills/ai/search/aliyun-qwen-rerank/
Vector retrievalskills/ai/search/aliyun-dashvector-search/ or skills/ai/search/aliyun-opensearch-search/ or skills/ai/search/aliyun-milvus-search/
Document understandingskills/ai/text/aliyun-docmind-extract/
Video editingskills/ai/video/aliyun-wan-edit/
Video lip-sync replacement / retalkskills/ai/video/aliyun-videoretalk/
Model list crawl/updateskills/ai/misc/aliyun-modelstudio-crawl-and-skill/

When Not Matched

  • Clarify model capability and input/output type first.
  • If capability is missing in repo, add a new skill first.

Common Missing Capabilities In This Repo (remaining gaps)

  • image translation
  • virtual try-on / digital human / advanced video personas
  • For multimodal/ASR download failures, prefer public URLs listed above.
  • For ASR parameter errors, use data URI in input_audio.data.
  • For multimodal embedding 400, ensure input.contents is an array.

Async Task Polling Template (video/long-running tasks)

When X-DashScope-Async: enable returns task_id, poll as follows:

GET https://dashscope.aliyuncs.com/api/v1/tasks/<task_id>
Authorization: Bearer $DASHSCOPE_API_KEY

Example result fields (success):

{
  "output": {
    "task_status": "SUCCEEDED",
    "video_url": "https://..."
  }
}

Notes:

  • Recommended polling interval: 15-20 seconds, max 10 attempts.
  • After success, download output.video_url.

Clarifying questions (ask when uncertain)

  1. Are you working with text, image, audio, or video?
  2. Is this generation, editing/understanding, or retrieval?
  3. Do you need speech (TTS/ASR/live translate) or retrieval (embedding/rerank/vector DB)?
  4. Do you want runnable SDK scripts or just API/parameter guidance?

References

  • Model list and links:output/alicloud-model-studio-models-summary.md
  • API/parameters/examples: see target sub-skill SKILL.md and references/*.md
  • Official source list:references/sources.md

Validation

mkdir -p output/aliyun-modelstudio-entry
echo "validation_placeholder" > output/aliyun-modelstudio-entry/validate.txt

Pass criteria: command exits 0 and output/aliyun-modelstudio-entry/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/aliyun-modelstudio-entry/.
  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

1) Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.

2) Run one minimal read-only query first to verify connectivity and permissions.

3) Execute the target operation with explicit parameters and bounded scope.

4) Verify results and save output/evidence files.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 15:34 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Volcengine Ai Audio Tts

cinience
在火山引擎音频服务上进行文本转语音生成。适用于需要配音、多语言语音输出、声音选择或TTS故障排除的场景。
★ 1 📥 2,196
data-analysis

Alicloud Ai Content Aimiaobi

cinience
使用OpenAPI/SDK管理阿里云全秒(AIMiaoBi),在用户请求阿里云秒币内容操作(如列出资源)时使用。
★ 0 📥 1,895
content-creation

Volcengine Ai Image Generation

cinience
火山引擎AI服务图像生成工作流。适用于文生图、风格变体、提示词优化、确定性图像生成参数设置及问题排查。
★ 3 📥 4,513