← 返回
未分类 中文

Lowyat Forum Research

Research any topic by searching Lowyat forum (forum.lowyat.net), scraping relevant threads into Excel, and analyzing the results. Use this skill whenever the...
通过搜索 Lowyat 论坛 (forum.lowyat.net) 抓取相关帖子并导入 Excel 进行分析,从而研究任意主题。在需要时使用此技能。
superoo7 superoo7 来源
未分类 clawhub v1.0.2 1 版本 99740.3 Key: 无需
★ 0
Stars
📥 384
下载
💾 0
安装
1
版本
#latest

概述

Lowyat Forum Research Tool

End-to-end research pipeline: Search → Scrape → Analyze

Workflow

Step 1: Understand the user's research topic

  • Ask clarifying questions if needed (e.g. what specifically they want to learn)
  • Break the topic into 3-5 search keyword variations

Step 2: Search for relevant threads

  • Use WebSearch with site:forum.lowyat.net to find threads
  • Use allowed_domains: ["forum.lowyat.net"] to filter results
  • Run multiple searches in parallel with different keyword angles
  • Present the most relevant threads to the user with titles and URLs
  • Let the user pick which threads to scrape, or recommend the best ones

Step 3: Scrape the selected threads

  • The scraper script (datascraping.py) should be in the project root
  • Install Python dependencies:
pip install requests beautifulsoup4 html5lib openpyxl tqdm

Or if you have uv installed:

uv sync
  • Run the scraper for each thread:
python datascraping.py <TOPIC_URL>
  • IMPORTANT: Do NOT include /all or /+N suffixes in the URL — just use the base topic URL (e.g. https://forum.lowyat.net/topic/5411252)
  • To scrape multiple threads, run them sequentially (one at a time) to be respectful to the server. Only run up to 3 in parallel if the user explicitly asks for speed, using & and wait
  • Output: .xlsx files with columns: Name, Date, Comment

Step 4: Analyze the scraped data

  • Read the scraped .xlsx files to understand the forum discussions
  • Synthesize findings across all threads into a structured summary
  • Organize insights by the user's research questions
  • Include: consensus opinions, brand recommendations, price ranges, warnings, and specific user experiences
  • Cite which thread/user said what when relevant

Scraper Details

  • Forum uses 20 posts per page, paginated via /+N URL suffix
  • Scraper auto-detects total pages and crawls all of them
  • Random 0.5–2s delay between page requests
  • Saves incrementally after each page — safe to interrupt
  • If .xlsx already exists, it resumes by appending

Tips for good searches

  • Use brand names: site:forum.lowyat.net Toto toilet
  • Use Malay keywords too: site:forum.lowyat.net kipas exhaust tandas
  • Add "recommendation" or "review": site:forum.lowyat.net water heater recommendation
  • Search by location: site:forum.lowyat.net bathroom shop KL Selangor
  • Try year filters for recency: site:forum.lowyat.net smart toilet 2024 2025

Example usage

User: "I want to research mechanical keyboards on Lowyat"

  1. Search with variations: mechanical keyboard recommendation, cherry mx switch, keychron Malaysia, custom keyboard
  2. Present top threads to user
  3. Scrape selected threads in parallel
  4. Read the xlsx files and provide analysis: popular brands, price ranges, where to buy, common complaints

Links

Disclaimer

Scraped data contains publicly available usernames, dates, and comments from forum.lowyat.net. This tool is intended for personal research purposes only. Users are responsible for how they store, share, and use the scraped data in compliance with applicable privacy laws and Lowyat forum's terms of service.

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-05-07 06:57 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

Stock Watcher

robin797860
管理和监控个人股票自选列表,支持利用同花顺数据添加、删除、列出股票及汇总近期表现。适用于用户希望追踪特定股票、获取表现汇总或管理自选列表时。
★ 112 📥 45,684
data-analysis

Tavily 搜索

jacky1n7
通过 Tavily API 进行网页搜索(Brave 替代方案)。当用户要求搜索网页、查找来源或链接,且 Brave 网页搜索不可用时使用。
★ 272 📥 99,906
data-analysis

AdMapix

fly0pants
广告情报与应用数据分析助手,支持搜索广告素材、分析应用排名、下载量、收入及市场洞察,用于广告素材和竞品分析。
★ 295 📥 137,005