← 返回
未分类 中文

该 Skill 支持基于关键词的学术论文检索,可以设置发表时间范围与数量 并将论文下载到指定目录,帮助用户快速获取目标领域(如大模型)的最新或代表性研究成果。This skill enables keyword-based academic paper retrieval with optional constraints on time range and result size and download them in the folder.

Search and download related arXiv papers by topic plus date range, or from a seed paper title/id. Use when user asks to crawl related papers, collect arXiv a...
根据主题和日期范围搜索并下载相关的 arXiv 论文,或通过种子论文标题/ID 获取相关论文。当用户请求抓取相关论文、收集 arXiv 文献时使用。
ppingzhang ppingzhang 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 320
下载
💾 0
安装
1
版本
#latest

概述

arXiv Related Papers Downloader

What this skill does

  • Accepts either:
  • topic + time range, or
  • seed paper (arXiv id or title)
  • Finds related papers from arXiv API.
  • Downloads PDF files into arxiv/.
  • Uses filename format: versionDate-title.pdf, where versionDate is vN_YYYYMMDD.

When to use

Use this skill when the user asks to:

  • crawl/search related papers by topic;
  • find related papers from one article;
  • download arXiv PDFs in batch;
  • save with a deterministic naming rule.

Required user input

The user must provide one of these modes:

  1. Topic mode
    • topic
    • start date (YYYY-MM-DD)
    • end date (YYYY-MM-DD)
  1. Seed paper mode
    • seed arXiv id (preferred) or seed title
    • optional start date / end date

Optional:

  • max results (default: 20)

Execution steps

  1. Confirm missing parameters with the user.
  2. Run the script from workspace root:
# Topic mode
python ./arxiv-related-papers/scripts/download_arxiv.py \
  --topic "graph neural network" \
  --start-date 2024-01-01 \
  --end-date 2024-12-31 \
  --max-results 20

# Seed mode by arXiv id
python ./arxiv-related-papers/scripts/download_arxiv.py \
  --seed-id "2401.12345v1" \
  --max-results 20

# Seed mode by title
python ./arxiv-related-papers/scripts/download_arxiv.py \
  --seed-title "Attention Is All You Need" \
  --start-date 2018-01-01 \
  --end-date 2024-12-31 \
  --max-results 20
  1. Report back:
    • how many papers were found;
    • how many PDFs were downloaded;
    • the output directory path.

Output location and naming

  • Output dir: ./arxiv/ (auto-created if missing)
  • File naming rule:
  • v1_20240213-Your_Paper_Title.pdf
  • v3_20231105-Your_Paper_Title.pdf

Notes

  • The script only uses Python standard library.
  • If a paper has no PDF link or download fails, it is skipped with a warning.
  • Existing files are not downloaded again.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-05-07 23:17 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

professional

论文关键词搜索和自动下载到指定目录 Keyword search for papers and automatic download!

ppingzhang
根据主题和日期范围搜索并下载相关的 arXiv 论文,或通过种子论文标题/ID 获取相关论文。当用户请求抓取相关论文、收集 arXiv 文献时使用。
★ 1 📥 555
knowledge-management

Obsidian

steipete
操作 Obsidian 仓库(纯 Markdown 笔记)并通过 obsidian-cli 自动化。
★ 444 📥 104,827
knowledge-management

web-tools-guide

user_ec205dbb
MANDATORY before calling web_search, web_fetch, browser, or opencli. Contains required error-handling procedures (web_se
★ 66 📥 158,968