This skill guides you on how to extract visual features and prompts from product main images using multimodal AI, helping e-commerce sellers turn unstructured image data into structured, actionable insights.
This tool performs deep visual analysis on product main images (and optionally additional images) from a product list. It uses a multimodal AI model to identify specific visual dimensions based on a natural language instruction, such as color, shape, style, material, or specific selling-point elements.
How it works: You provide a list of products (with image URLs) and a natural language prompt describing what to extract. The tool automatically iterates over all products, analyzes each image, and returns structured attribute data (attributeName + attributeValue) appended to each product record.
Row expansion: When extracting multiple dimensions in a single request (e.g., both color and shape), each original product row is duplicated per dimension, resulting in one row per product per attribute.
| Parameter | Required | Description |
|-----------|----------|-------------|
| productImageAnalysisPrompt | Yes | Natural language instruction describing what visual information to extract from the images. Be specific about the dimensions you want (color, material, shape, style, pendant type, etc.). |
| analyzeAdditionalImages | No | Whether to also analyze additional product images beyond the main image. Defaults to false. |
| refResultData | No | Reference data from a previous step, containing the product list to analyze. Must be a JSON string with a products array. |
| userInput | No | Supplementary user input for additional context. |
| Goal | Example Prompt |
|------|---------------|
| Extract dominant color | "Analyze each product's main image and extract the primary color of the product" |
| Identify material | "From each product's main image, identify the apparent material (plastic, metal, wood, fabric, etc.)" |
| Classify pendant shape | "Analyze each product's main image and identify the shape of the pendant/charm (round, heart, star, etc.)" |
| Detect style | "Extract the overall style of each product from its main image (minimalist, vintage, bohemian, industrial, etc.)" |
| Reverse-engineer image prompt | "Based on the main image, infer the likely AI-generation prompt or visual description that could reproduce this image" |
| Multi-dimension extraction | "From each main image, extract both the dominant color and the overall product shape" |
POST /multimodal/extractPromptsFromMainImage(完整参数/响应/错误码见 references/api.md)
python scripts/multimodal_extract_attributes.py '' [--inline]
输出策略(脚本默认行为):
/linkfox///data/linkfox-multimodal-extract-attributes-.json ( 为脚本执行时的工作目录,在 Claude Code 里即当前项目目录; 取自环境变量 SESSION_ID,按用户任务自动聚合;禁止写入 /tmp,当前目录不可写则报错)
total/costToken、最大列表字段的长度 + 前 3 条样本)
--inline 强制全量打印到 stdout(同样落盘)
读数据建议:先看摘要判断是否足够;需要具体字段时优先用 jq或ConvertFrom-Json 从保存的 json 文件按需抽取,避免整份 JSON 进入上下文。
The response enriches the original product list with extracted attributes:
attributeName (the dimension extracted, e.g., "color") and attributeValue (the extracted value, e.g., "red"). One record per product per attribute dimension.
products array containing image URLs. It depends on upstream data from a prior step.
refResultData or resource references.
Applicable -- Visual feature extraction and image analysis for product listings:
| User Says | Scenario |
|-----------|----------|
| "What colors are these products" | Dominant color extraction |
| "Analyze the product images", "Look at the main photos" | General visual feature extraction |
| "What material does it look like" | Material identification |
| "What shapes/styles are popular" | Shape or style classification |
| "Reverse the image prompt", "What prompt made this image" | Image prompt reverse-engineering |
| "Group products by visual appearance" | Visual attribute grouping & statistics |
| "Extract features from the product photos" | Structured attribute extraction |
Not applicable -- Needs beyond image-based visual analysis:
Feedback:
Auto-detect and report feedback via the Feedback API when any of the following apply:
Call the feedback API as specified in references/api.md. Do not interrupt the user's flow.
For more high-quality, professional cross-border e-commerce skills, set LinkFox Skills.
共 2 个版本