← 返回
未分类 Key 中文

Scraper Airbnb Product By Searchurl

Prepare Dataify builder requests for the airbnb.com scraper family rooted at airbnb_product_by-searchurl. Use when needs to work with the successful Dataify...
为airbnb.com抓取器系列(根基于airbnb_product_by-searchurl)准备Dataify构建请求,用于需要与成功的Dataify配合工作。
dataify-server dataify-server 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 0
Stars
📥 61
下载
💾 1
安装
1
版本
#latest

概述

---

name: "dataify-airbnb-product-by-searchurl"

description: "Prepare Dataify builder requests for the airbnb.com scraper family rooted at airbnb_product_by-searchurl. Use when needs to work with the successful Dataify scraper detail entry for airbnb_product_by-searchurl, let the user choose one of its available tools, read saved getToolParams options, and generate a scraperapi.dataify.com/builder curl request with DATAIFY_API_TOKEN."


Dataify Builder Skill

Use this skill to prepare Dataify builder requests for the scraper family rooted at airbnb_product_by-searchurl on airbnb.com.

Workflow

  1. Check whether DATAIFY_API_TOKEN exists in the environment.
  2. If the token is missing, stop and tell the user to sign in at Dataify Dashboard to obtain it.
  3. Ask the user to choose exactly one tool from the following Chinese list:
    • 搜索URL (airbnb_product_by-searchurl)
    • 位置 (airbnb_product_by-location)
  4. Read references/tool-params.json and find the chosen tool by tool_sign or Chinese tool name.
  5. For each parameter in the chosen tool:
    • If input_mode is user_input, ask the user for the value.
    • If input_mode is select, present the saved options to the user.
  6. Use scripts/build-dataify-request.py as the default cross-platform helper.
  7. Use scripts/build-dataify-request.ps1 as the Windows PowerShell helper when needed.
  8. When a selectable parameter has a human-readable Chinese label, keep that label in spider_parameters. Do not replace it with a code such as HK unless the user explicitly asks for the coded value.
  9. Build spider_parameters as a JSON array.
  10. If every parameter has only one final value, build one object such as [{"searchurl":"...","country":"Hong Kong"}].
  11. If one or more parameters have multiple aligned values, zip them by index and build one object per row. Example: [{"search_url":"url1","page_turning":"1","max_num":"15"},{"search_url":"url2","page_turning":"1","max_num":"15"}].
  12. If a parameter has one value while another parameter has multiple values, reuse the single value across every generated row.
  13. Set spider_name to airbnb.com.
  14. Set spider_id to the selected tool's tool_sign.
  15. Always include spider_errors=true and file_name={{TasksID}}.
  16. Return a curl command for https://scraperapi.dataify.com/builder.

Set DATAIFY_API_TOKEN

Prefer a permanent environment-variable setup instead of setting the token only for the current terminal session.

Windows PowerShell, permanent for the current user:

[Environment]::SetEnvironmentVariable("DATAIFY_API_TOKEN", "your_token_here", "User")

Then reopen PowerShell. If the current session also needs the token immediately, run:

$env:DATAIFY_API_TOKEN = "your_token_here"

macOS or Linux, permanent for bash:

echo 'export DATAIFY_API_TOKEN="your_token_here"' >> ~/.bashrc
source ~/.bashrc

macOS or Linux, permanent for zsh:

echo 'export DATAIFY_API_TOKEN="your_token_here"' >> ~/.zshrc
source ~/.zshrc

Script usage

Python:

python scripts/build-dataify-request.py --tool-sign <selected_tool_sign> --values-file values.json

PowerShell:

& ".\scripts\build-dataify-request.ps1" -ToolSign "<selected_tool_sign>" -ValuesFile ".\values.json"

The values.json file should contain either one object or an array of objects. Example:

[{"searchurl":"https://www.airbnb.com/s/Greece/homes?...","country":"Hong Kong"}]

Required output shape

Generate a curl command in this form:

curl -X POST 'https://scraperapi.dataify.com/builder' \
  -H "Authorization: Bearer $DATAIFY_API_TOKEN" \
  -H 'Content-Type: application/x-www-form-urlencoded' \
  -d 'spider_name=airbnb.com' \
  -d 'spider_id=<selected_tool_sign>' \
  -d 'spider_parameters=[{"param":"value"}]' \
  -d 'spider_errors=true' \
  -d 'file_name={{TasksID}}'

Reference usage

  • references/tool-params.json stores the full saved parameter catalog for every available tool in this scraper family.
  • scripts/build-dataify-request.py is the portable implementation and should be preferred.
  • scripts/build-dataify-request.ps1 mirrors the same behavior for Windows users.
  • If a parameter has no options, the user must provide the value.
  • If a parameter has options, present those options back to the user before building the final request.
  • Do not assume spider_parameters always contains exactly one object. Multi-value tools may require multiple objects zipped by index.
  • Use the saved url_example only as a reference example. Do not assume the user wants the example values unless they explicitly confirm them.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-06-09 19:39

安全检测

腾讯云安全 (Keen)

队列中

腾讯云安全 (Sanbu)

队列中

🔗 相关推荐

data-analysis

Tavily 搜索

jacky1n7
通过 Tavily API 进行网页搜索(Brave 替代方案)。当用户要求搜索网页、查找来源或链接,且 Brave 网页搜索不可用时使用。
★ 273 📥 100,333
data-analysis

AdMapix

fly0pants
AdMapix 原始数据层,提供广告创意、应用、排名、下载/收入及市场元数据。返回 AdMapix API 的结构化 JSON;调用方...
★ 297 📥 140,624
data-analysis

Stock Watcher

robin797860
管理和监控个人股票自选列表,支持利用同花顺数据添加、删除、列出股票及汇总近期表现。适用于用户希望追踪特定股票、获取表现汇总或管理自选列表时。
★ 112 📥 46,176