Arxiv Agentic Verifier

Actively verifies Python/JS code correctness by generating targeted test cases that expose logic flaws based on problem constraints.

基于问题约束条件生成针对性测试用例，主动验证 Python/JS 代码正确性并暴露逻辑缺陷。

wanng-ide

AI智能 clawhub v1.0.0 1 版本 99917.3 Key: 需要

★ 0

Stars

📥 1,208

下载

💾 41

安装

版本

#latest

概述

ArXiv Agentic Verifier

Source Paper: Scaling Agentic Verifier for Competitive Coding (ID: 4a4c4dae6a5145ebc4d62eb2d64b0f0f)

Type: Code Verification / Test Generation

Description

This skill implements an "Agentic Verifier" that actively reasons about code correctness by generating targeted, "discriminative" test cases. Instead of random sampling, it analyzes the problem constraints and code logic to find edge cases or logic flaws.

Features

Analyze Code: Understands Python/JS code logic.
Generate Tests: Creates specific inputs to break the code.
Execute & Verify: Runs the code against generated tests (sandbox recommended for production).

Usage

const AgenticVerifier = require('./index');
const verifier = new AgenticVerifier(process.env.OPENAI_API_KEY);

const problem = "Given two integers A and B, output their sum.";
const code = "print(int(input().split()[0]) + int(input().split()[1]))";

verifier.verify(problem, code, 'python')
  .then(result => console.log(result))
  .catch(err => console.error(err));

Configuration

OPENAI_API_KEY: Required for LLM reasoning.

Security Warning

This skill executes code provided to it. Use in a restricted environment or sandbox.

版本历史

共 1 个版本

v1.0.0 当前

2026-03-29 07:10 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)

安全，无风险

查看报告

🔗 相关推荐

ai-intelligence

Self-Improving + Proactive Agent

ivangdavila

自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。

★ 1,363 📥 319,007

developer-tools

Api Tester

wanng-ide

执行结构化HTTP/HTTPS请求（GET、POST、PUT、DELETE），支持自定义标头和JSON正文。适用于API测试、健康检查或交互操作。

★ 7 📥 7,521

ai-intelligence

ontology

oswalpalash

类型化知识图谱，用于结构化智能体记忆与可组合技能。支持创建/查询实体（人员、项目、任务、事件、文档）及关联...

★ 714 📥 244,123