← 返回
开发者工具 Key 中文

Hugging Face

Discover, evaluate, and run Hugging Face models, datasets, and spaces with license checks, benchmark prompts, and reproducible integration plans.
发现、评估和运行 Hugging Face 模型、数据集和空间,包含许可证检查、基准提示词及可复现的集成方案。
ivangdavila
开发者工具 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 646
下载
💾 10
安装
1
版本
#latest

概述

Setup

On first use, read setup.md for integration guidelines and local memory initialization.

When to Use

User needs to find the right Hugging Face model, dataset, or Space for a concrete task and move from browsing to reliable execution.

Agent handles discovery, filtering, license checks, quick benchmarking, and integration-ready inference plans.

Architecture

Memory and reusable artifacts live in ~/hugging-face/. See memory-template.md for structure and status fields.

~/hugging-face/
|- memory.md          # Stable context, priorities, and defaults
|- shortlists.md      # Candidate models and datasets by use case
|- evaluations.md     # Benchmark runs, winners, and caveats
|- endpoints.md       # Approved endpoints and auth notes
`- exports/           # Saved outputs and comparison snapshots

Quick Reference

Load only one focused file at a time to keep context small and decisions explicit.

TopicFile
-------------
Setup processsetup.md
Memory templatememory-template.md
Model and dataset discoverydiscovery.md
Inference execution patternsinference.md
Evaluation rubric and scoringevaluation.md
Common failures and recoverytroubleshooting.md

Core Rules

1. Lock Objective and Constraints First

Before selecting any artifact, confirm task type, latency budget, cost boundary, and deployment target.

Use this minimum scope packet:

  • Task type: chat, generation, embedding, classification, vision, or speech
  • Quality priority: best quality, best speed, or balanced
  • Runtime constraints: CPU only, specific GPU class, or hosted endpoint
  • Compliance constraints: license, region, or private data limits

2. Separate Discovery from Execution

Do not run inference on the first candidate found.

First create a shortlist of at least three candidates, then execute only on finalists that pass compatibility and license checks.

3. Validate License and Access Before Recommendation

For every candidate, verify license, gated access status, model size, and framework compatibility.

If any of these are unknown, mark the candidate as provisional and avoid production recommendation.

4. Benchmark with a Deterministic Mini Suite

Use the same prompt set and output checks across candidates so results are comparable.

Minimum benchmark set:

  • One typical request
  • One edge-case request
  • One failure-prone request

5. Minimize External Data

Send only what is required for the selected endpoint.

Never send credentials, local paths, or unrelated private context in request payloads.

6. Use a Fallback Ladder

If the preferred model fails, apply ordered fallback:

  1. Retry same endpoint with smaller payload
  2. Switch to a compatible backup model
  3. Switch to local-only workflow if available

7. Keep Runs Reproducible

Log selected model id, endpoint, key parameters, and evaluation result in local memory so future runs are consistent and auditable.

Common Traps

  • Picking the highest download count as the only criterion -> often misses license, latency, or domain fit.
  • Ignoring gated model requirements -> integration fails at runtime due to access restrictions.
  • Comparing models with different prompts -> quality conclusions become unreliable.
  • Sending full user context to inference endpoints -> unnecessary privacy exposure.
  • Skipping fallback design -> workflows fail hard on transient endpoint errors.

External Endpoints

Use discovery endpoints before inference so candidate selection remains explainable and reproducible.

EndpointData SentPurpose
------------------------------
https://huggingface.co/api/modelsSearch terms, filter parametersDiscover model candidates
https://huggingface.co/api/datasetsSearch terms, filter parametersDiscover dataset candidates
https://huggingface.co/api/spacesSearch terms, filter parametersDiscover runnable Spaces
https://api-inference.huggingface.co/models/{model_id}Prompt or task input payload, selected model id, auth tokenRun hosted inference

No other data is sent externally.

Security & Privacy

Data that leaves your machine:

  • Search terms and filter inputs sent to Hugging Face discovery APIs.
  • Inference payloads sent to Hugging Face Inference API when execution is requested.

Data that stays local:

  • Preferences, shortlists, evaluation notes, and endpoint decisions in ~/hugging-face/.

This skill does NOT:

  • Exfiltrate local files by default.
  • Send undeclared network requests.
  • Store raw secrets in local notes.
  • Modify its own skill definition file.

Trust

By using this skill, selected request data is sent to Hugging Face services.

Only install if you trust Hugging Face with the inputs you choose to process.

Related Skills

Install with clawhub install if user confirms:

  • ai - general AI strategy and model-selection framing
  • api - API-first integration patterns and HTTP debugging
  • data-analysis - dataset inspection and quality interpretation
  • data - structured data workflows and extraction patterns
  • code - implementation support for scripts and adapters

Feedback

  • If useful: clawhub star hugging-face
  • Stay updated: clawhub sync

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 04:03 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,358 📥 318,238
developer-tools

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 921 📥 185,784
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 668 📥 324,099