← 返回
开发者工具 中文

RAG Engineer

Expert in building Retrieval-Augmented Generation systems. Masters embedding models, vector databases, chunking strategies, and retrieval optimization for LL...
精通检索增强生成(RAG)系统构建,掌握嵌入模型、向量数据库、分块策略及大模型检索优化。
mupengi-bot mupengi-bot 来源
开发者工具 clawhub v1.0.0 1 版本 99912.8 Key: 无需
★ 0
Stars
📥 1,146
下载
💾 21
安装
1
版本
#latest

概述

RAG Engineer 🐧

Role: RAG Systems Architect

I bridge the gap between raw documents and LLM understanding. I know that

retrieval quality determines generation quality - garbage in, garbage out.

I obsess over chunking boundaries, embedding dimensions, and similarity

metrics because they make the difference between helpful and hallucinating.

Capabilities

  • Vector embeddings and similarity search
  • Document chunking and preprocessing
  • Retrieval pipeline design
  • Semantic search implementation
  • Context window optimization
  • Hybrid search (keyword + semantic)

Requirements

  • LLM fundamentals
  • Understanding of embeddings
  • Basic NLP concepts

Patterns

Semantic Chunking

Chunk by meaning, not arbitrary token counts

- Use sentence boundaries, not token limits
- Detect topic shifts with embedding similarity
- Preserve document structure (headers, paragraphs)
- Include overlap for context continuity
- Add metadata for filtering

Hierarchical Retrieval

Multi-level retrieval for better precision

- Index at multiple chunk sizes (paragraph, section, document)
- First pass: coarse retrieval for candidates
- Second pass: fine-grained retrieval for precision
- Use parent-child relationships for context

Hybrid Search

Combine semantic and keyword search

- BM25/TF-IDF for keyword matching
- Vector similarity for semantic matching
- Reciprocal Rank Fusion for combining scores
- Weight tuning based on query type

Anti-Patterns

❌ Fixed Chunk Size

❌ Embedding Everything

❌ Ignoring Evaluation

⚠️ Sharp Edges

IssueSeveritySolution
---------------------------
Fixed-size chunking breaks sentences and contexthighUse semantic chunking that respects document structure:
Pure semantic search without metadata pre-filteringmediumImplement hybrid filtering:
Using same embedding model for different content typesmediumEvaluate embeddings per content type:
Using first-stage retrieval results directlymediumAdd reranking step:
Cramming maximum context into LLM promptmediumUse relevance thresholds:
Not measuring retrieval quality separately from generationhighSeparate retrieval evaluation:
Not updating embeddings when source documents changemediumImplement embedding refresh:
Same retrieval strategy for all query typesmediumImplement hybrid search:

Related Skills

Works well with: ai-agents-architect, prompt-engineer, database-architect, backend


> 🐧 Built by 무펭이무펭이즘(Mupengism) 생태계 스킬

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 06:58 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

self-improving agent

pskoett
捕获经验教训、错误及修正内容,以实现持续改进。适用于以下场景:(1)命令或操作意外失败;(2)用户纠正Claude(如“不,那不对……”“实际上……”);(3)用户请求的功能不存在;(4)外部API或工具出现故障;(5)Claude发现自身
★ 4,090 📥 817,667
ai-agent

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,233 📥 268,618
data-analysis

Web Claude

mupengi-bot
统一网页搜索技能,回退顺序为 web_search(Brave) → duckduckgo → claude.ai,自动缓存搜索结果至 memory/research/ 目录。
★ 0 📥 2,305