Hugging Face

概述

Setup

On first use, read setup.md for integration guidelines and local memory initialization.

When to Use

User needs to find the right Hugging Face model, dataset, or Space for a concrete task and move from browsing to reliable execution.

Agent handles discovery, filtering, license checks, quick benchmarking, and integration-ready inference plans.

Architecture

Memory and reusable artifacts live in ~/hugging-face/. See memory-template.md for structure and status fields.

~/hugging-face/
|- memory.md          # Stable context, priorities, and defaults
|- shortlists.md      # Candidate models and datasets by use case
|- evaluations.md     # Benchmark runs, winners, and caveats
|- endpoints.md       # Approved endpoints and auth notes
`- exports/           # Saved outputs and comparison snapshots

Quick Reference

Load only one focused file at a time to keep context small and decisions explicit.

Topic	File
-------	------
Setup process	`setup.md`
Memory template	`memory-template.md`
Model and dataset discovery	`discovery.md`
Inference execution patterns	`inference.md`
Evaluation rubric and scoring	`evaluation.md`
Common failures and recovery	`troubleshooting.md`

Core Rules

1. Lock Objective and Constraints First

Before selecting any artifact, confirm task type, latency budget, cost boundary, and deployment target.

Use this minimum scope packet:

Task type: chat, generation, embedding, classification, vision, or speech
Quality priority: best quality, best speed, or balanced
Runtime constraints: CPU only, specific GPU class, or hosted endpoint
Compliance constraints: license, region, or private data limits

2. Separate Discovery from Execution

Do not run inference on the first candidate found.

First create a shortlist of at least three candidates, then execute only on finalists that pass compatibility and license checks.

3. Validate License and Access Before Recommendation

For every candidate, verify license, gated access status, model size, and framework compatibility.

If any of these are unknown, mark the candidate as provisional and avoid production recommendation.

4. Benchmark with a Deterministic Mini Suite

Use the same prompt set and output checks across candidates so results are comparable.

Minimum benchmark set:

One typical request
One edge-case request
One failure-prone request

5. Minimize External Data

Send only what is required for the selected endpoint.

Never send credentials, local paths, or unrelated private context in request payloads.

6. Use a Fallback Ladder

If the preferred model fails, apply ordered fallback:

Retry same endpoint with smaller payload
Switch to a compatible backup model
Switch to local-only workflow if available

7. Keep Runs Reproducible

Log selected model id, endpoint, key parameters, and evaluation result in local memory so future runs are consistent and auditable.

Common Traps

Picking the highest download count as the only criterion -> often misses license, latency, or domain fit.
Ignoring gated model requirements -> integration fails at runtime due to access restrictions.
Comparing models with different prompts -> quality conclusions become unreliable.
Sending full user context to inference endpoints -> unnecessary privacy exposure.
Skipping fallback design -> workflows fail hard on transient endpoint errors.

External Endpoints

Use discovery endpoints before inference so candidate selection remains explainable and reproducible.

Endpoint	Data Sent	Purpose
----------	-----------	---------
`https://huggingface.co/api/models`	Search terms, filter parameters	Discover model candidates
`https://huggingface.co/api/datasets`	Search terms, filter parameters	Discover dataset candidates
`https://huggingface.co/api/spaces`	Search terms, filter parameters	Discover runnable Spaces
`https://api-inference.huggingface.co/models/{model_id}`	Prompt or task input payload, selected model id, auth token	Run hosted inference

No other data is sent externally.

Security & Privacy

Data that leaves your machine:

Search terms and filter inputs sent to Hugging Face discovery APIs.
Inference payloads sent to Hugging Face Inference API when execution is requested.

Data that stays local:

Preferences, shortlists, evaluation notes, and endpoint decisions in ~/hugging-face/.

This skill does NOT:

Exfiltrate local files by default.
Send undeclared network requests.
Store raw secrets in local notes.
Modify its own skill definition file.

Trust

By using this skill, selected request data is sent to Hugging Face services.

Only install if you trust Hugging Face with the inputs you choose to process.

Related Skills

Install with clawhub install if user confirms:

ai - general AI strategy and model-selection framing
api - API-first integration patterns and HTTP debugging
data-analysis - dataset inspection and quality interpretation
data - structured data workflows and extraction patterns
code - implementation support for scripts and adapters

Feedback

If useful: clawhub star hugging-face
Stay updated: clawhub sync

版本历史

共 1 个版本

v1.0.0 当前

2026-03-30 04:03 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)