← 返回
未分类 Key 中文

Mmx

Multimodal content generation and analysis via MiniMax CLI, including text chat, image/video creation, speech synthesis, music, vision, and web search with A...
通过 MiniMax CLI 进行多模态内容生成与分析,支持文字聊天、图像/视频创作、语音合成、音乐、视觉与网页搜索等功能
ariffazil ariffazil 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 357
下载
💾 1
安装
1
版本
#latest

概述

MMX — MiniMax Multimodal CLI

Full reference for mmx CLI. For deep research workflows, use mmx-text-researcher skill instead.

Auth Check (First)

mmx auth status
# If not authenticated:
mmx auth login --api-key <your-api-key>
# Check region:
mmx config show

Text Chat

# Basic
mmx text chat --message "What is MiniMax?"

# Streaming
mmx text chat --model MiniMax-M2.7-highspeed --message "Hello" --stream

# With system prompt
mmx text chat --system "You are a coding assistant" --message "Fizzbuzz in Go"

# Multi-turn (conversation history)
mmx text chat --message "user:Hi" --message "assistant:Hey!" --message "How are you?"

# JSON output
mmx text chat --message "Extract key facts as JSON" --output json

# From file
cat messages.json | mmx text chat --messages-file - --output json

Models

  • MiniMax-M2.7 — standard
  • MiniMax-M2.7-highspeed — faster response
  • MiniMax-Text-01 — best for research/synthesis

Image Generation

# Simple
mmx image "A cat in a spacesuit"

# With options
mmx image generate --prompt "A cat" --n 3 --aspect-ratio 16:9

# Output to directory
mmx image generate --prompt "Logo" --out-dir ./out/

# Available aspect ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9

Video Generation

# Async (start and track)
mmx video generate --prompt "Ocean waves at sunset"

# Async with progress tracking
mmx video generate --prompt "A robot painting" --async

# Get task status
mmx video task get --task-id <task-id>

# Download completed video
mmx video download --file-id <file-id> --out video.mp4
mmx video generate --prompt "Ocean waves at sunset" --download sunset.mp4

Speech / TTS

# Basic synthesis
mmx speech synthesize --text "Hello!" --out hello.mp3

# Streaming playback (pipe to mpv)
mmx speech synthesize --text "Stream me" --stream | mpv -

# Voice selection + speed
mmx speech synthesize --text "Hi" --voice English_magnetic_voiced_man --speed 1.2

# List available voices
mmx voices

# From stdin
echo "Breaking news" | mmx speech synthesize --text-file - --out news.mp3

Music Generation

# With lyrics
mmx music generate --prompt "Upbeat pop" --lyrics "[verse] La da dee, sunny day" --out song.mp3

# Auto-generate lyrics from prompt
mmx music generate --prompt "Indie folk, melancholic, rainy night" --lyrics-optimizer --out song.mp3

# Instrumental (no vocals)
mmx music generate --prompt "Cinematic orchestral" --instrumental --out bgm.mp3

# Cover (generate cover from reference audio)
mmx music cover --prompt "Jazz, piano, warm female vocal" --audio-file original.mp3 --out cover.mp3

# Cover from URL
mmx music cover --prompt "Indie folk" --audio https://example.com/song.mp3 --out cover.mp3

Vision / Image Understanding

# Local file
mmx vision photo.jpg

# Describe with custom prompt
mmx vision describe --image https://example.com/img.jpg --prompt "What breed?"

# From file-id
mmx vision describe --file-id file-123

Web Search

# Basic
mmx search "MiniMax AI"

# Structured JSON output
mmx search query --q "latest news" --output json

Utility

# Check quota
mmx quota

# Show config
mmx config show

# Set region (global or cn)
mmx config set --key region --value cn

# Set default model
mmx config set --key default-text-model --value MiniMax-M2.7-highspeed

# Export schema
mmx config export-schema | jq .

# Update CLI
mmx update
mmx update latest

Common Use Cases

TaskCommand
------
Generate image for Arif's geology vizmmx image generate --prompt "" --aspect-ratio 16:9 --out-dir ./output/
Create video clipmmx video generate --prompt "" --async
TTS voice messagemmx speech synthesize --text "" --voice --out voice.mp3
Compose background musicmmx music generate --prompt "" --instrumental --out bgm.mp3
Analyze geology photommx vision
Quick fact checkmmx search ""
Research synthesisUse mmx-text-researcher skill instead

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 16:24 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

design-media

Openai Whisper

steipete
使用 Whisper CLI 进行本地语音转文字(无需 API 密钥)
★ 333 📥 94,259
design-media

UI/UX Pro Max

xobi667
提供 UI/UX 设计智能与实现指导,帮助打造精美界面。适用于 UI 设计、UX 流程、信息架构、视觉风格、设计系统/标记、组件规格、文案/微文案、无障碍及前端 UI(HTML/CSS/JS、React、Next.js、Vue、Svelte
★ 222 📥 48,203
ai-agent

Chain Reason

ariffazil
针对需要3步以上逻辑推理、权衡分析或明确解释的复杂问题,提供详细、可审计的多步骤推理过程。
★ 0 📥 578