← 返回
数据分析 中文

arXiv Source Reader

Read and analyze arXiv papers by fetching LaTeX source, listing sections, or extracting abstracts
通过获取 LaTeX 源码、列出章节或提取摘要来阅读和分析 arXiv 论文
willamhou
数据分析 clawhub v1.0.5 2 版本 99890.6 Key: 无需
★ 0
Stars
📥 913
下载
💾 23
安装
2
版本
#latest

概述

arxiv-reader

Read and analyze arXiv papers by fetching their public LaTeX source. Converts LaTeX into clean text suitable for LLM analysis.

Description

This skill fetches arXiv papers from the public arXiv API (arxiv.org), flattens LaTeX includes, and returns clean text. No local file access is required — all content is fetched over HTTPS from arXiv's public endpoints and cached in memory for the session.

Network access: Only connects to arxiv.org and export.arxiv.org to download publicly available paper source tarballs and metadata. No other network connections are made. No data is sent to external services — this is read-only.

Caching: Results are cached in memory (process-scoped) for fast repeat access within the same session. No files are written to disk.

Usage Examples

  • "Read the paper 2301.00001 from arXiv"
  • "What sections does paper 2405.12345 have?"
  • "Get the abstract of 2312.09876"
  • "Fetch paper 2301.00001 without the appendix"

Process

  1. Quick look — Use arxiv_abstract to get a paper's abstract before committing to a full read
  2. Survey structure — Use arxiv_sections to understand the paper's outline
  3. Deep read — Use arxiv_fetch to get the full flattened LaTeX for analysis

Tools

arxiv_fetch

Fetch the full flattened LaTeX source of an arXiv paper.

Parameters:

  • arxiv_id (string, required): arXiv paper ID (e.g. 2301.00001 or 2301.00001v2)
  • remove_comments (boolean, optional): Strip LaTeX comments (default: true)
  • remove_appendix (boolean, optional): Remove appendix sections (default: false)
  • figure_paths (boolean, optional): Replace figures with file paths only (default: false)

Returns: { content: string, arxiv_id: string, cached: boolean }

Example:

{ "arxiv_id": "2301.00001", "remove_appendix": true }

arxiv_sections

List all sections and subsections of an arXiv paper.

Parameters:

  • arxiv_id (string, required): arXiv paper ID

Returns: { arxiv_id: string, sections: string[] }

Example:

{ "arxiv_id": "2301.00001" }

arxiv_abstract

Extract just the abstract from an arXiv paper.

Parameters:

  • arxiv_id (string, required): arXiv paper ID

Returns: { arxiv_id: string, abstract: string }

Example:

{ "arxiv_id": "2301.00001" }

Notes

  • Results are cached in memory — repeat requests within the same session are instant
  • Paper IDs support version suffixes (e.g. 2301.00001v2)
  • Very large papers may take 10-30 seconds on first fetch
  • arxiv_abstract uses the public arXiv Atom API for fast metadata retrieval
  • No filesystem writes — all caching is in-memory only
  • Only connects to arxiv.org (read-only, public data)

版本历史

共 2 个版本

  • v1.0.5 当前
    2026-03-29 13:44 安全 安全
  • v1.0.0
    2026-03-11 12:14

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 166 📥 60,253
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 199 📥 65,259
data-analysis

Excel / XLSX

ivangdavila
创建、检查和编辑 Microsoft Excel 工作簿及 XLSX 文件,支持可靠的公式、日期、类型、格式、重算及模板保留功能。
★ 368 📥 140,856