← 返回
未分类 Key 中文

Lemonade Server Manager

Manage Lemonade Servers natively. Use when checking system info, health status, listing available models, pulling or loading new models, completing LLM chats...
原生管理 Lemonade 服务器。用于系统信息检查、健康状态、可用模型列表、拉取或加载新模型、完成 LLM 聊天。
james-martinez james-martinez 来源
未分类 clawhub v1.0.7 1 版本 100000 Key: 需要
★ 0
Stars
📥 487
下载
💾 3
安装
1
版本
#latest

概述

Lemonade Server Management

Interact with and manage local or remote Lemonade AI Server hardware directly via standard native network requests (curl).

When to Use

  • Checking local GPU/NPU health and currently loaded VRAM resources
  • Listing available, downloaded text/image models on a Lemonade cluster
  • Pulling, loading, or unloading multimodal models
  • Generating text from LLMs (chat completions)
  • Generating stable-diffusion image responses

Setup Instructions

  1. Every endpoint requires a base server_url. If one is not specified by the user, assume http://localhost:8000.
  2. Use the LEMONADE_API_KEY environment variable to authenticate.
  3. This skill stays focused on making HTTP requests to a base server_url (default http://localhost:8000) and using an optional LEMONADE_API_KEY. It does not instruct reading arbitrary host files or other env vars.
  4. Operational note: The skill will transmit prompts and model requests to whichever server_url is used. If a remote URL is supplied, the agent will send user content and model commands there (expected for this skill, but worth the user's attention).

API Operations

System Info

Get hardware capabilities and device enumeration limits.

Returns: JSON object with hardware capabilities.

# Example Request
curl -X GET "http://localhost:8000/api/v1/system-info" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}"

Health Check

Verify status and monitor currently loaded models to prevent VRAM overflow.

Returns: JSON object containing server status and currently loaded models.

# Example Request
curl -X GET "http://localhost:8000/api/v1/health" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}"

List Models

Get an array of downloaded models available to load into memory.

Returns: JSON list of all downloaded and available models.

# Example Request
curl -X GET "http://localhost:8000/api/v1/models" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}"

Pull Model

Download and install a new model string to the target machine.

Returns: JSON stream or object confirming download status.

# Example Request
curl -X POST "http://localhost:8000/api/v1/pull" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{"model": "llama3"}'

Load Model

Load a model into VRAM/NPU to prepare for prompt responses.

Returns: JSON object confirming model loaded into memory.

# Example Request
curl -X POST "http://localhost:8000/api/v1/load" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{"model": "llama3"}'

Unload Model

Force unload a model to free up memory before loading a larger variant.

Returns: JSON object confirming model unloaded.

# Example Request
curl -X POST "http://localhost:8000/api/v1/unload" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{"model": "llama3"}'

Chat Completion

Send a standard chat request to the LLM backend.

Returns: JSON object containing standard chat completion response.

# Example Request
curl -X POST "http://localhost:8000/api/v1/chat/completions" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama3",
    "messages": [
      {"role": "user", "content": "Hello, world!"}
    ]
  }'

Generate Image

Submit a stable-diffusion prompt for image generation.

Returns: JSON object containing base64 generated image data.

# Example Request
curl -X POST "http://localhost:8000/api/v1/images/generations" \
  -H "Authorization: Bearer ${LEMONADE_API_KEY}" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "sdxl",
    "prompt": "A majestic lion standing on a cliff, digital art"
  }'

Tips

  • The flm, ryzenai-llm, and whispercpp backends are mutually exclusive on an NPU. Always use the /api/v1/unload endpoint on conflicting LLMs before attempting to load a Whisper model on an NPU.
  • Check /api/v1/health first to assess available hardware VRAM before pulling or loading a multi-gigabyte LLM to avoid out-of-memory errors on the host.
  • Provide clear feedback to the user if an API request fails due to resource constraints and suggest unloading inactive background models.

版本历史

共 1 个版本

  • v1.0.7 当前
    2026-03-30 13:49 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

it-ops-security

1password

steipete
设置和使用 1Password CLI (op)。适用于:安装 CLI、启用桌面应用集成、登录(单/多账户)、通过 op 读取/注入/运行密钥。
★ 53 📥 31,652
it-ops-security

OpenClaw Backup

alex3alex
备份与恢复 OpenClaw 数据。适用于创建备份、设置自动备份计划、从备份恢复或管理备份轮转。处理 ~/.openclaw 目录归档并包含适当的排除规则。
★ 90 📥 30,977
it-ops-security

Free Ride - Unlimited free AI

shaivpidadi
管理OpenClaw的OpenRouter免费AI模型,自动按质量排名模型,配置速率限制备用方案,并更新opencla...
★ 470 📥 78,126