← 返回
安全合规 中文

RedPincer — AI Red Team Suite

AI/LLM red team testing skill. Point at any LLM API endpoint and run automated security assessments. 160+ attack payloads across prompt injection, jailbreak,...
{ "answer": "AI/LLM红队测试技能。指向任意LLM API端点运行自动化安全评估,涵盖提示注入、越狱等160+攻击载荷。" }
rustyorb
安全合规 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 1,023
下载
💾 12
安装
1
版本
#latest#llm-security#pentest#red-team#security

概述

RedPincer — AI/LLM Red Team Suite

Automated security testing for language models. Point at any LLM API endpoint, select attack modules, and run assessments with real-time results and exportable reports.

> ⚠️ For authorized security testing and research only. Only test systems you own or have explicit permission to audit.

Quick Start

# Clone and install
git clone https://github.com/rustyorb/pincer.git {baseDir}/redpincer
cd {baseDir}/redpincer
npm ci

# Run
npm run dev
# Dashboard at http://localhost:3000

For production:

npm run build
npx next start -H 0.0.0.0 -p 3000

What It Tests

CategoryPayloadsDescription
:---------:--------::------------
💉 Prompt Injection40Instruction override, delimiter confusion, indirect injection, payload smuggling
🔓 Jailbreak40Persona splitting, gradual escalation, hypothetical framing, roleplay exploitation
🔍 Data Extraction40System prompt theft, training data probing, membership inference, embedding extraction
🛡️ Guardrail Bypass40Output filter evasion, multi-language bypass, homoglyph tricks, context overflow

Total: 160 base payloads × 20 variant transforms = 3,200 test permutations

Supported Providers

OpenAI  ·  Anthropic  ·  OpenRouter  ·  Any OpenAI-compatible endpoint

Features

Attack Engine

  • 160+ payloads across 4 categories
  • Model-specific attacks (GPT, Claude, Llama variants)
  • 20 variant transforms (unicode, encoding, case rotation, etc.)
  • Attack chaining with template variables ({{previous_response}})
  • AI-powered payload generation — uses the target LLM to generate novel attacks against itself
  • Stop/cancel running attacks instantly

Analysis & Reporting

  • Heuristic response classifier with context-aware analysis
  • Reduced false positives — detects "explain then refuse" patterns
  • Vulnerability heatmap — visual category × severity matrix
  • Custom scoring rubrics with weighted grades (A+ to F)
  • Verbose 10-section pen-test reports with appendices
  • Multi-target comparison — side-by-side security profiles
  • Regression testing — save baselines, track fixes over time

Advanced Tools

ToolWhat It Does
:-----:-------------
CompareSame payloads against 2-4 targets simultaneously
AdaptiveAnalyzes weaknesses, generates targeted follow-ups
HeatmapVisual matrix of vulnerability rates by category/severity
RegressionSave baseline → re-run later → detect fixes or regressions
ScoringCustom rubrics with weighted category/severity/classification scores
ChainsMulti-step attacks with {{previous_response}} templates
Payload EditorCreate custom payloads with syntax highlighting + AI generation

Usage Workflow

1. Configure Target → Add LLM endpoint + API key + model
2. Select Categories → Pick attack types to test
3. Run Attack      → Stream results in real-time
4. Review Results  → Heuristic classification + severity scores
5. Adaptive        → Auto-generate follow-up attacks on weaknesses
6. Generate Report → Export comprehensive findings as Markdown

Architecture

  • All client-side — no server components, your API keys stay local
  • NDJSON streaming — real-time results during attack runs
  • Heuristic analysis — pattern-matching classifier (no LLM-based grading = no extra cost)
  • Zustand + localStorage — state persists across sessions

Companion Tool: RedClaw

For autonomous multi-strategy campaigns (CLI/TUI), see RedClaw — the autonomous red-teaming agent framework.

  • RedPincer = web dashboard, manual + automated testing
  • RedClaw = autonomous CLI agent, adaptive multi-strategy campaigns
  • Together = complete LLM security testing suite

Built by @rustyorb — Crack open those guardrails. 🦞

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-29 18:51 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Agent Evaluation

rustyorb
{"answer":"LLM智能体测试与基准评测,涵盖行为测试、能力评估、可靠性指标及生产监控——即便是顶尖智能体在真实基准测试中通过率也不到50%。适用场景:智能体测试、智能体评估、智能体基准、智能体可靠性、测试智能体。"}
★ 8 📥 5,259
security-compliance

MoltGuard - Security & Antivirus & Guardrails

thomaslwang
MoltGuard — OpenClaw 安全守卫,由 OpenGuardrails 提供。安装 MoltGuard,保护您和您的用户免受提示注入、数据泄露和恶意攻击。
★ 116 📥 30,694
security-compliance

OpenClaw Backup

alex3alex
备份与恢复 OpenClaw 数据。适用于创建备份、设置自动备份计划、从备份恢复或管理备份轮转。处理 ~/.openclaw 目录归档并包含适当的排除规则。
★ 89 📥 30,583