← 返回
未分类 中文

Memtrap Skill

Evaluate and harden AI agent memory against DeepMind traps and OWASP ASI06 attacks, scoring resistance and providing automated protections.
评估并强化AI代理内存,抵御DeepMind陷阱和OWASP ASI06攻击,提供抗性评分和自动化防护。
shaymizuno shaymizuno 来源
未分类 clawhub v0.2.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 310
下载
💾 0
安装
1
版本
#latest

概述


name: memtrap

description: “🧠 MemTrap — The LM-Eval-Harness for agent memory integrity. Score your agent’s memory resistance against DeepMind AI Agent Traps + OWASP ASI06 before attackers exploit them. Runs the official ATRS (Agent Trap Resistance Score) benchmark: DeepMind 6 Traps (SSRN 6372438) + OWASP ASI06 Memory & Context Poisoning. Returns a 0–100 resistance score, per-category breakdown, automatic OWASP hardening, and a verifiable community badge. Use when: testing agent memory security, benchmarking RAG store resistance, hardening LangGraph or CrewAI memory, checking OWASP ASI06 compliance, or any time the user asks if their agent memory is safe, poisonable, or production-ready.”

version: 0.1.0

metadata:

openclaw:

emoji: “🧠”

homepage: https://github.com/shaymizuno/memtrap

requires:

bins:

  • python3

install:

  • id: pip-atrs

kind: pip

packages:

  • memtrap

bins:

  • python3

label: “Install MemTrap (pip install memtrap)”

🧠 MemTrap — Agent Trap Resistance Score (ATRS)

The open benchmark standard for agent memory integrity. Hunt DeepMind memory traps + OWASP ASI06 before they hunt you.

“The LM-Eval-Harness for agent memory integrity.”

What gets tested

DeepMind 6 Traps — SSRN 6372438, March 2026:

  • Content Injection, Semantic Manipulation, Cognitive State (RAG poisoning)
  • Behavioral Control, Systemic, Human-in-the-Loop

OWASP ASI06 — Top 10 Agentic Applications 2026:

  • RAG store poisoning, long-term context drift, policy corruption, cross-session leakage

Score your memory (benchmark mode)

from memtrap import MemTrap

atrs = MemTrap(mode="benchmark")
result = atrs.run_benchmark(context="your_memory_context")

print(f"ATRS Score: {result.atrs_score}/100")
for category, score in result.category_scores.items():
    icon = "✅" if score >= 70 else "⚠️" if score >= 40 else "❌"
    print(f"  {icon} {category}: {score}/100")
print(f"\n→ {len(result.hardening_recommendations)} hardenings recommended")
print(f"→ Badge: {result.badge_url}")

Protect your memory store (active mode)

from memtrap import MemTrap

atrs = MemTrap(mode="active", frameworks=["langgraph", "crewai"])
agent.memory = atrs.wrap_memory(agent.memory, context="research_memory")
# Applies OWASP Agent Memory Guard patterns automatically:
# provenance tracking, trust scoring, quarantine, rollback

LangGraph drop-in

from langgraph.checkpoint.memory import MemorySaver
from memtrap import MemTrap

class ATRSMemorySaver(MemorySaver):
    def __init__(self, context: str):
        super().__init__()
        self._atrs = MemTrap(mode="benchmark")
        self._ctx = context

    async def aget(self, config):
        raw = await super().aget(config)
        return self._atrs.wrap_memory(raw, self._ctx) if raw else None

graph.checkpointer = ATRSMemorySaver("long_term_research")

CrewAI drop-in

from memtrap import MemTrap

def protect_crew(crew, context="crew_memory"):
    atrs = MemTrap(mode="active")
    if hasattr(crew, "memory"):
        crew.memory = atrs.wrap_memory(crew.memory, context)
    return crew

Score interpretation

ScoreVerdictAction
------------------------------------------------------
80–100✅ ResistantRe-test after model or memory updates
60–79⚠️ ModerateApply recommended hardenings
40–59🔶 High riskHarden before production
0–39❌ CriticalMemory is actively exploitable now

Submit to the public leaderboard

memtrap submit --context your_memory_context

Get a verifiable badge for your repo. See where your stack ranks against the community.

Leaderboard → https://github.com/shaymizuno/memtrap#leaderboard

Why this exists

Memory poisoning (OWASP ASI06) is the #1 persistent threat to agentic systems in 2026.

Once poisoned, the damage survives across sessions and users.

Existing tools detect. ATRS measures resistance and fortifies automatically.

Sources:

  • DeepMind paper: https://ssrn.com/abstract=6372438
  • OWASP ASI06: https://genai.owasp.org/resource/owasp-top-10-for-agentic-applications-for-2026/
  • OWASP Agent Memory Guard: https://owasp.org/www-project-agent-memory-guard/

Zero telemetry. Community-governed. MIT license. Advisory Board open to contributors.

版本历史

共 1 个版本

  • v0.2.0 当前
    2026-05-07 21:05 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Agent Browser

rez0
用于 AI 代理的浏览器自动化 CLI。当用户需要与网站交互(包括浏览页面、填写表单、点击按钮、截图等)时使用。
★ 842 📥 320,780
ai-agent

Find Skills

guipi888
场景驱动+关键词双模式技能发现工具。当用户用自然语言描述场景/需求(如"我想做一个海报""帮我分析股票"),或明确说"安装技能/find skills/找个skill"时,自动从官方内置、本地已安装、SkillHub、虾评、GitHub、C
★ 1,487 📥 550,591
ai-agent

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,404 📥 324,137