← 返回
未分类 中文

Genai Toolkit

Bridge AI models to databases through MCP with config and evaluation tools. Use when setting up DB tools, comparing engines, or evaluating prompt quality.
通过MCP协议将AI模型与数据库连接,提供配置和评估工具。适用于数据库工具设置、引擎对比或提示词质量评估。
xueyetianya xueyetianya 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 542
下载
💾 2
安装
1
版本
#latest

概述

Genai Toolkit

Genai Toolkit v2.0.0 — an AI toolkit for managing generative AI workflows from the command line. Log configurations, benchmarks, prompts, evaluations, fine-tuning runs, cost tracking, and optimization notes. Each entry is timestamped and persisted locally. Works entirely offline — your data never leaves your machine.

Why Genai Toolkit?

  • Works entirely offline — your data never leaves your machine
  • Simple command-line interface with no GUI dependency
  • Export to JSON, CSV, or plain text at any time for sharing or archival
  • Automatic activity history logging across all commands
  • Each domain command doubles as both a logger and a viewer

Commands

Domain Commands

Each domain command works in two modes: log mode (with arguments) saves a timestamped entry, view mode (no arguments) shows the 20 most recent entries.

CommandDescription
----------------------
genai-toolkit configure Log a configuration note such as model parameters, API keys, or environment settings. Use this to record setup changes and track which configurations were active during experiments.
genai-toolkit benchmark Log a benchmark result or performance observation. Record latency, throughput, accuracy, or other metrics to compare across runs and model versions.
genai-toolkit compare Log a comparison note between models, configurations, or approaches. Useful for side-by-side evaluations like GPT-4 vs Claude on specific tasks.
genai-toolkit prompt Log a prompt template or prompt engineering note. Track iterations on prompt design, record what worked, and document prompt versioning.
genai-toolkit evaluate Log an evaluation result or quality metric. Record accuracy scores, F1 metrics, human ratings, or any qualitative assessment of model outputs.
genai-toolkit fine-tune Log a fine-tuning run or hyperparameter note. Track epochs, learning rates, dataset sizes, and resulting model performance after fine-tuning.
genai-toolkit analyze Log an analysis observation or insight. Record patterns found in data, failure mode analysis, or trends across experiments.
genai-toolkit cost Log cost tracking data including API costs, compute expenses, and token consumption. Essential for budget monitoring across projects and providers.
genai-toolkit usage Log usage metrics or consumption data. Track request volumes, token counts, rate limit encounters, and daily/monthly consumption patterns.
genai-toolkit optimize Log optimization attempts or performance improvements. Record what was changed, the expected vs actual impact, and next steps.
genai-toolkit test Log test results or test case notes. Record pass/fail outcomes, edge cases discovered, and regression test results.
genai-toolkit report Log a report entry or summary finding. Capture weekly summaries, milestone reports, or executive-level findings from AI workflows.

Utility Commands

CommandDescription
----------------------
genai-toolkit statsShow summary statistics across all log files, including entry counts per category and total data size on disk.
genai-toolkit export Export all data to a file in the specified format. Supported formats: json, csv, txt. Output is saved to the data directory.
genai-toolkit search Search all log entries for a term using case-insensitive matching. Results are grouped by log category for easy scanning.
genai-toolkit recentShow the 20 most recent entries from the unified activity log, giving a quick overview of recent work across all commands.
genai-toolkit statusHealth check showing version, data directory path, total entry count, disk usage, and last activity timestamp.
genai-toolkit helpShow the built-in help message listing all available commands and usage information.
genai-toolkit versionPrint the current version (v2.0.0).

Data Storage

All data is stored locally at ~/.local/share/genai-toolkit/. Each domain command writes to its own log file (e.g., configure.log, benchmark.log). A unified history.log tracks all actions across commands. Use export to back up your data at any time.

Requirements

  • Bash (4.0+)
  • No external dependencies — pure shell script
  • No network access required

When to Use

  • Tracking AI model benchmarks and comparisons across different providers and versions over time
  • Logging prompt engineering iterations to understand what improvements actually moved the needle
  • Monitoring API costs and token usage across multiple projects and billing periods
  • Evaluating fine-tuning experiments with detailed hyperparameter and metric tracking
  • Building a searchable knowledge base of optimization attempts and analysis insights

Examples

# Log a benchmark result
genai-toolkit benchmark "GPT-4o latency: avg 1.2s, p99 3.8s on summarization task, 500 samples"

# Track a cost entry
genai-toolkit cost "March batch processing: $42.50 across 15k requests, avg $0.0028/req"

# Compare two models
genai-toolkit compare "Claude 3.5 vs GPT-4o on code generation — Claude 15% faster, GPT-4o 5% more accurate"

# Log a prompt iteration
genai-toolkit prompt "v3: Added chain-of-thought instruction, reduced hallucination rate from 12% to 3%"

# Record a fine-tuning run
genai-toolkit fine-tune "SQL-gen model epoch 5: accuracy=0.96, loss=0.12, lr=2e-5, dataset=50k rows"

# View all statistics
genai-toolkit stats

# Export everything to JSON
genai-toolkit export json

# Search for entries mentioning latency
genai-toolkit search latency

# Check recent activity
genai-toolkit recent

# Health check
genai-toolkit status

Powered by BytesAgain | bytesagain.com | hello@bytesagain.com

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-31 00:27 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-agent

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,406 📥 324,518
ai-agent

Agent Browser

rez0
用于 AI 代理的浏览器自动化 CLI。当用户需要与网站交互(包括浏览页面、填写表单、点击按钮、截图等)时使用。
★ 843 📥 322,700
ai-agent

Find Skills

guipi888
场景驱动+关键词双模式技能发现工具。当用户用自然语言描述场景/需求(如"我想做一个海报""帮我分析股票"),或明确说"安装技能/find skills/找个skill"时,自动从官方内置、本地已安装、SkillHub、虾评、GitHub、C
★ 1,490 📥 553,763