← 返回
效率工具 Key 中文

Embeddings

Generate, store, and search vector embeddings with provider selection, chunking strategies, and similarity search optimization.
生成、存储和检索向量嵌入,支持提供商选择、分块策略以及相似性搜索优化
ivangdavila
效率工具 clawhub v1.0.0 1 版本 99858.4 Key: 需要
★ 2
Stars
📥 1,370
下载
💾 29
安装
1
版本
#latest

概述

When to Use

User wants to convert text/images to vectors, build semantic search, or integrate embeddings into applications.

Quick Reference

TopicFile
-------------
Provider comparison & selectionproviders.md
Chunking strategies & codechunking.md
Vector database patternsstorage.md
Search & retrieval tuningsearch.md

Core Capabilities

  1. Generate embeddings — Call provider APIs (OpenAI, Cohere, Voyage, local models)
  2. Chunk content — Split documents with overlap, semantic boundaries, token limits
  3. Store vectors — Insert into Pinecone, Weaviate, Qdrant, pgvector, Chroma
  4. Similarity search — Query with top-k, filters, hybrid search
  5. Batch processing — Handle large datasets with rate limiting and retries
  6. Model comparison — Evaluate embedding quality for specific use cases

Decision Checklist

Before recommending approach, ask:

  • [ ] What content type? (text, code, images, multimodal)
  • [ ] Volume and update frequency?
  • [ ] Latency requirements? (real-time vs batch)
  • [ ] Budget constraints? (API costs vs self-hosted)
  • [ ] Existing infrastructure? (cloud provider, database)

Critical Rules

  • Same model everywhere — Query embeddings MUST use identical model as document embeddings
  • Normalize before storage — Most similarity metrics assume unit vectors
  • Chunk with overlap — 10-20% overlap prevents context loss at boundaries
  • Batch API calls — Never embed one item at a time in production
  • Cache embeddings — Regenerating is expensive; store with source hash
  • Monitor dimensions — Higher isn't always better; 768-1536 is usually optimal

Provider Quick Selection

NeedProviderWhy
---------------------
Best quality, any costOpenAI text-embedding-3-largeTop benchmarks
Cost-sensitiveOpenAI text-embedding-3-small5x cheaper, 80% quality
MultilingualCohere embed-multilingual-v3100+ languages
Code/technicalVoyage voyage-code-2Optimized for code
Privacy/offlineLocal (e5, bge, nomic)No data leaves machine
ImagesOpenAI CLIP, Cohere multimodalCross-modal search

Common Patterns

# Batch embedding with retry
def embed_batch(texts, model="text-embedding-3-small"):
    results = []
    for chunk in batched(texts, 100):  # API limit
        response = client.embeddings.create(input=chunk, model=model)
        results.extend([e.embedding for e in response.data])
    return results

# Similarity search with filter
results = index.query(
    vector=query_embedding,
    top_k=10,
    filter={"category": "technical"},
    include_metadata=True
)

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 02:20 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

productivity

Baidu web search

ide-rea
使用百度AI搜索引擎(BDSE)进行网络搜索。适用于获取实时信息、文档资料或研究课题。
★ 237 📥 105,637
productivity

Weather

steipete
获取当前天气和预报(无需API密钥)
★ 445 📥 226,367
productivity

Word / DOCX

ivangdavila
创建、检查和编辑 Microsoft Word 文档及 DOCX 文件,支持样式、编号、修订记录、表格、分节符及兼容性检查等功能。
★ 440 📥 147,840