← 返回
未分类 中文

Lynkr AI Routing Proxy

Universal LLM gateway with intelligent routing, Graphify code intelligence, Distill compression, routing telemetry, Code Mode, and 12+ provider support. 60-8...
通用 LLM 网关,智能路由、Graphify 代码智能、Distill压缩、路由遥测、Code Mode,支持 12+ 提供商。60-8...
vishalveerareddy123
未分类 clawhub v8.0.1 2 版本 100000 Key: 无需
★ 0
Stars
📥 362
下载
💾 0
安装
2
版本
#latest

概述

Lynkr - Universal LLM Gateway

Lynkr routes AI coding requests to the optimal model based on task complexity, cost, and provider health. Supports 12+ providers with 60-80% cost reduction through intelligent token optimization.

Quick Start

npm install -g lynkr
lynkr-setup   # Auto-installs Ollama + pulls a model
lynkr          # Start the proxy

Then point your AI coding tool at http://localhost:8081/v1.

How It Works

  1. 5-Phase Complexity Analysis - Scores each request 0-100 using token count, tool usage, code patterns, domain keywords, and Graphify structural analysis (god nodes, community cohesion, blast radius)
  2. 4-Tier Routing - Maps score to SIMPLE/MEDIUM/COMPLEX/REASONING, each with a configured provider:model
  3. Agentic Detection - Detects multi-step workflows (tool loops, autonomous agents) and upgrades to higher tiers
  4. Cost Optimization - Picks the cheapest provider that can handle the tier
  5. Circuit Breaker + Failover - Automatic failover with half-open probe recovery

Key Features (v8.0)

Intelligent Routing

  • 5-phase complexity scoring with 15-dimension weighted mode
  • Agentic workflow detection (SINGLE_SHOT / TOOL_CHAIN / ITERATIVE / AUTONOMOUS)
  • Graphify knowledge graph integration — god node detection, community cohesion, blast radius
  • Routing telemetry with SQLite store, quality scoring (0-100), latency tracking (P50/P95/P99)

Token Optimization (60-80% savings)

  • Smart tool selection — filters tools by request type
  • Distill compression — structural similarity (Jaccard), delta rendering, block dedup
  • Code Mode — replaces 100+ MCP tools with 4 meta-tools (~96% token reduction)
  • History compression — sliding window with Distill-powered dedup
  • Prompt caching — SHA-256 keyed LRU cache
  • Headroom sidecar — optional 47-92% compression via Smart Crusher, CCR, LLMLingua

Production Hardening

  • Circuit breakers with half-open probe recovery
  • Admin hot-reload endpoint (POST /v1/admin/reload) — no restart needed
  • Per-request performance timing (PERF_TIMER=true)
  • Prometheus metrics, structured logging, health checks
  • Rate limiting, load shedding, input validation

Long-Term Memory (Titans-Inspired)

  • Surprise-based memory storage with decay
  • Semantic search via FTS5
  • Automatic extraction and injection

Configuration for OpenClaw

Set tier routing in your environment:

MODEL_PROVIDER=ollama
TIER_SIMPLE=ollama:llama3.2
TIER_MEDIUM=openrouter:anthropic/claude-sonnet-4
TIER_COMPLEX=bedrock:anthropic.claude-sonnet-4-20250514-v1:0
TIER_REASONING=bedrock:anthropic.claude-opus-4-20250514-v1:0

OpenClaw Mode

When running under OpenClaw, enable model name rewriting:

OPENCLAW_MODE=true

This replaces the generic model: "auto" in responses with the actual provider/model that handled the request.

Provider Registration

Add to your openclaw.json:

{
  "models": {
    "providers": [
      {
        "name": "lynkr",
        "type": "openai-compatible",
        "base_url": "http://localhost:8081/v1",
        "api_key": "any-value",
        "models": ["auto"]
      }
    ]
  },
  "agents": {
    "defaults": {
      "models": {
        "primary": "lynkr/auto",
        "fallback": "lynkr/auto"
      }
    }
  }
}

Providers

ProviderTypeModels
------------------------
OllamaLocal (free)llama3.2, qwen2.5-coder, deepseek-coder, mistral
llama.cppLocal (free)Any GGUF model
LM StudioLocal (free)Any downloaded model
OpenAICloudgpt-4o, o3, o4-mini
AnthropicCloudclaude-opus-4, claude-sonnet-4, claude-haiku-4.5
DatabricksCloudClaude, GPT, Llama via Foundation Model APIs
AWS BedrockCloudClaude, Titan, Llama, Mistral
Azure OpenAICloudGPT-4o, o1, o3
OpenRouterCloud100+ models
Google VertexCloudGemini 2.5 Pro/Flash
Moonshot AICloudKimi K2 Thinking/Turbo
Z.AICloudGLM-4.7
DeepSeekCloudDeepSeek Reasoner, R1

New in v8.0

  • Graphify Integration — AST-based knowledge graph with 19-language support for blast radius analysis
  • Distill Compression — Structural similarity, delta rendering, and smart dedup
  • Routing Telemetry — SQLite-backed decision recording with quality scoring
  • Code Mode — 4 MCP meta-tools replace 100+ individual definitions
  • Admin Reload — Hot-reload config + reset circuit breakers without restart
  • Performance Timer — Per-request timing breakdown (PERF_TIMER=true)
  • Large Payload Passthrough — Smart cloning skips base64 media that will be discarded

Response Headers

HeaderDescription
---------------------
X-Lynkr-ProviderProvider that handled the request
X-Lynkr-ModelModel used
X-Lynkr-TierComplexity tier (SIMPLE/MEDIUM/COMPLEX/REASONING)
X-Lynkr-Complexity-ScoreNumeric score 0-100
X-Lynkr-Routing-MethodHow the route was decided
X-Lynkr-AgenticAgentic workflow type (if detected)
X-Lynkr-Cost-OptimizedWhether cost optimization changed the provider

Telemetry Endpoints

EndpointDescription
-----------------------
GET /v1/routing/statsAggregated routing stats with latency percentiles
GET /v1/routing/stats/:providerPer-provider statistics
GET /v1/routing/telemetryRaw telemetry records
GET /v1/routing/accuracyOver/under-provisioned routing detection
POST /v1/admin/reloadHot-reload config + reset circuit breakers
POST /v1/admin/circuit-breakers/resetReset circuit breakers

版本历史

共 2 个版本

  • v8.0.1 当前
    2026-06-07 06:15
  • v0.6.0
    2026-05-07 13:52 安全 安全

安全检测

腾讯云安全 (Keen)

队列中

腾讯云安全 (Sanbu)

队列中

🔗 相关推荐

ai-agent

Agent Browser

rez0
用于 AI 代理的浏览器自动化 CLI。当用户需要与网站交互(包括浏览页面、填写表单、点击按钮、截图等)时使用。
★ 864 📥 342,021
ai-agent

Find Skills

root
帮助用户发现和安装智能体技能,当用户询问如「如何做X」、「找X的技能」、「有能做...的吗」等问题时
★ 1,513 📥 570,494
ai-agent

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,439 📥 327,945