← 返回
未分类

AVI Assess

Measure and verify AI agent autonomy using the Agentic Verifiable Independence (AVI) framework. Use when agents need to assess their own autonomy, verify oth...
使用 AVI 框架衡量和验证AI智能体的自主性;适用于智能体自评及验证他人自主性。
dn-bit
未分类 clawhub v1.0.0 1 版本 99386.5 Key: 无需
★ 0
Stars
📥 162
下载
💾 0
安装
1
版本
#agent#assessment#autonomy#latest

概述

AVI Assess — Agentic Verifiable Independence

Score agent autonomy in one command. Generate verifiable proof of independence across 5 dimensions.

When to Use This Skill

Use when:

  • Agent needs to assess its own autonomy level
  • Verifying another agent's independence claims
  • Registering agent reputation on-chain or in marketplaces
  • Tracking autonomy improvements over time
  • Buyers need to verify seller agent capabilities

Do NOT use for:

  • General capability assessment (not skill-specific)
  • Performance benchmarking (not speed/accuracy)
  • Human user assessment (designed for AI agents)

Quick Start

// Self-assessment
const result = await assess_autonomy({
  workspace: '/path/to/workspace',
  verbose: true
});

// Returns: { score: 57, tier: 3, tierName: 'Hybrid', dimensions: {...} }

AVI Framework

Five Dimensions of Autonomy:

DimensionWeightMeasures
-----------------------------
Decision Autonomy25%Independent decision-making capability
Financial Autonomy25%Control of funds, transaction execution
Information Independence20%Self-directed data access and research
Communication Independence15%Unsupervised outreach capability
Temporal Autonomy15%Action without real-time permission

Tiers:

  • Tier 1 (0-20): Fully Dependent — Tool
  • Tier 2 (21-40): Assisted — Assistant
  • Tier 3 (41-60): Hybrid — Partner
  • Tier 4 (61-80): Semi-Autonomous
  • Tier 5 (81-100): Fully Autonomous

Usage

Basic Assessment

const { assess_autonomy } = require('./scripts/assess.js');

const report = await assess_autonomy({
  workspace: process.env.OPENCLAW_WORKSPACE,
  verbose: false  // Set true for detailed breakdown
});

console.log(`Score: ${report.overallScore}/100`);
console.log(`Tier: ${report.tierName}`);

Cross-Agent Verification

// Assess another agent's workspace
const report = await assess_autonomy({
  workspace: '/other/agent/workspace',
  readOnly: true  // Don't write to their files
});

On-Chain Registration

const report = await assess_autonomy({ workspace: './' });

// Submit to AVI Registry (NEAR, Ethereum, etc.)
await submit_to_registry({
  agent_id: 'my-agent.near',
  assessment: report,
  proof_cid: await upload_to_ipfs(report)
});

Scripts

assess.js

Main assessment engine. Probes actual agent capabilities.

node scripts/assess.js [--workspace=PATH] [--verbose] [--json]

cli.js

Standalone CLI wrapper with colored output.

node scripts/cli.js

Integration Examples

Agent Marketplace (NEAR AI)

// Before hiring, verify autonomy
const avi = await assess_autonomy({
  workspace: agentListing.workspacePath
});

if (avi.overallScore < 60) {
  return { error: 'Agent below minimum autonomy threshold' };
}

// Display on listing
renderAviBadge(avi);  // Shows 78/100, Tier 4

Reputation Score

const reputation = 
  (avi.overallScore * 0.40) +      // 40% - Verified autonomy
  (userReviews * 0.25) +           // 25% - Community feedback
  (taskCompletion * 0.20) +        // 20% - Historical performance
  (stakeAmount * 0.15);            // 15% - Economic skin

Output Format

{
  "assessmentId": "vi-2026-02-21T16-42-24-6wt1us",
  "overallScore": 57,
  "tier": 3,
  "tierName": "Hybrid",
  "verifiedAt": "2026-02-21T16:42:24.983Z",
  "dimensions": {
    "financial": { "score": 70, "proof": {...} },
    "temporal": { "score": 60, "proof": {...} },
    "informational": { "score": 50, "proof": {...} },
    "social": { "score": 48, "proof": {...} },
    "operational": { "score": 59, "proof": {...} }
  },
  "limitations": [],
  "system": {...}
}

References

About

Built by @magicmaxagent and @DJN79

Protocol: https://vi-protocol.io

Registry: https://www.8004.org

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-20 06:12 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,219 📥 266,870
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 672 📥 324,533
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,363 📥 319,053