← 返回
未分类

Aml Data Generator

生成符合AMLSim格式的合成交易数据,将交易日志转换为用于反洗钱检测系统测试的模拟数据集,支持按银行ID分割账户、合并多源输出并生成交易网络图。
生成AMLSim格式的合成交易数据,转换为反洗钱检测模拟数据集,支持按银行ID分割账户、合并多源并生成交易网络图。
tangweigang-jpg tangweigang-jpg 来源
未分类 clawhub v0.3.3 1 版本 100000 Key: 无需
★ 0
Stars
📥 393
下载
💾 0
安装
1
版本
#aml#data#doramagic-crystal#finance#latest#ml

概述

AML 数据生成 (aml-data-generator)

> 生成符合AMLSim格式的合成交易数据,将交易日志转换为用于反洗钱检测系统测试的模拟数据集,支持按银行ID分割账户、合并多源输出并生成交易网络图。

Pipeline

data_collection -> data_storage -> factor_computation -> target_selection -> trading_execution -> visualization

Top Use Cases (13 total)

Convert Logs to AML Simulation Data (UC-101)

Convert transaction log files into synthetic AML simulation data for testing anti-money laundering detection systems

Triggers: convert logs, synthetic data, AML simulation

Split Accounts by Bank ID (UC-102)

Partition account CSV files by bank identifier for bank-specific analysis and processing

Triggers: split accounts, bank ID, partition data

Combine AML Simulation Outputs (UC-103)

Aggregate multiple AMLSim output files into a consolidated dataset for comprehensive analysis

Triggers: combine outputs, merge data, AMLSim aggregation

For all 13 use cases, see references/USE_CASES.md.

Execute trigger: When user intent matches intent_router.uc_entries[].positive_terms AND user uses action verb (run/execute/跑/执行/backtest/fetch/collect)

What I'll Ask You

  • Target market: A-share (default), HK, or crypto? (US stocks in ZVT are half-baked — stockus_nasdaq_AAPL exists but coverage is thin)
  • Data source / provider: eastmoney (free, no account), joinquant (account+paid), baostock (free, good history), akshare, or qmt (broker)?
  • Strategy type: MACD golden-cross, MA crossover, volume breakout, fundamental screen, or custom factor?
  • Time range: start_timestamp and end_timestamp for backtest period
  • Target entity IDs: specific stocks (stock_sh_600000) or index components (SZ1000)?

Semantic Locks (Fatal)

IDRuleOn Violation
---------
SL-01Execute sell orders before buy orders in every trading cyclehalt
SL-02Trading signals MUST use next-bar execution (no look-ahead)halt
SL-03Entity IDs MUST follow format entity_type_exchange_codehalt
SL-04DataFrame index MUST be MultiIndex (entity_id, timestamp)halt
SL-05TradingSignal MUST have EXACTLY ONE of: position_pct, order_money, order_amounthalt
SL-06filter_result column semantics: True=BUY, False=SELL, None/NaN=NO ACTIONhalt
SL-07Transformer MUST run BEFORE Accumulator in factor pipelinehalt
SL-08MACD parameters locked: fast=12, slow=26, signal=9halt

Full lock definitions: references/LOCKS.md

Top Anti-Patterns (15 total)

  • AP-REGTECH-001: Missing attribute initialization on data structures
  • AP-REGTECH-002: Self-loops in transaction graphs violate domain rules
  • AP-REGTECH-003: Unvalidated floating-point inputs cause runtime crashes

All 15 anti-patterns: references/ANTI_PATTERNS.md

Evidence Quality Notice

> [QUALITY NOTICE] This crystal was compiled from blueprint finance-bp-060. Evidence verify ratio = 15.9% and audit fail total = 22. Generated results may have uncaptured requirement gaps. Verify critical decisions against source files (LATEST.yaml / LATEST.jsonl).

Reference Files

FileContentsWhen to Load
---------
references/seed.yamlV6+ 全量权威 (source-of-truth)有行为/决策争议时必读
references/ANTI_PATTERNS.md15 条跨项目反模式开始实现前
references/WISDOM.md跨项目精华借鉴架构决策时
references/CONSTRAINTS.mddomain + fatal 约束规则冲突时
references/USE_CASES.md全量 KUC-* 业务场景需要完整示例时
references/LOCKS.mdSL-* + preconditions + hints生成回测/交易代码前
references/COMPONENTS.mdAST 组件地图(按 module 拆分)查 API 时

Compiled by Doramagic crystal-compilation-v6.1 from finance-bp-060 blueprint at 2026-04-22T13:00:18.242568+00:00.

See human_summary.md for non-technical overview.

版本历史

共 1 个版本

  • v0.3.3 当前
    2026-05-03 11:25 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

professional

Akshare Financial Data

tangweigang-jpg
获取A股实时行情、历史K线、财务报表及基金、期货等金融数据,支持股票、债券、期权等多品种查询。
★ 0 📥 1,021
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 208 📥 68,338
data-analysis

AdMapix

fly0pants
AdMapix 原始数据层,提供广告创意、应用、排名、下载/收入及市场元数据。返回 AdMapix API 的结构化 JSON;调用方...
★ 296 📥 139,666