← 返回
未分类

Authenticated Paper Fetcher

Use when Codex needs to download academic articles, papers, book chapters, or PDFs through authorized institutional/library access, publisher platforms such as SpringerLink, ScienceDirect, Wiley, Taylor & Francis, JSTOR, library proxies/EZproxy/OpenAthens/Shibboleth, or a user-provided authenticated browser/cloud browser session. Helps fetch individual PDFs, preserve metadata, and avoid handling passwords or bypassing access controls.
Download academic PDFs through authorized institutional access, library proxies, local browser profiles, or cloud browser sessions without storing credentials. 通过学校/机构已授权访问、图书馆代理、本地浏览器登录态或云端浏览器会话下载学术 PDF,不保存账号密码。 Authenticated Paper Fetcher helps Codex retrieve individual academic articles, papers, and book chapters from publisher platforms such as SpringerLink, ScienceDirect, Wiley, Taylor & Francis, JSTOR, and library proxy systems including EZproxy, OpenAthens, and Shibboleth. Use it when the user already has legitimate access and wants Codex to fetch the PDF, preserve metadata, and keep the workflow reproducible. Authenticated Paper Fetcher 用于让 Codex 通过用户已有的合法访问权限下载单篇学术论文、文章或书籍章节,支持 SpringerLink、ScienceDirect、Wiley、Taylor & Francis、JSTOR 等出版社页面,以及 EZproxy、OpenAthens、Shibboleth 等图书馆/学校认证入口。适用于“我已经能登录学校账号下载,但希望 Codex 代为拉取 PDF、保存元数据、整理到项目目录”的场景。
user_bcff01f1
未分类 community v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 104
下载
💾 0
安装
1
版本
#latest

概述

Authenticated Paper Fetcher

Use this skill to fetch individual academic PDFs through access the user already has: open access, institutional library proxy, an authenticated local browser profile, or a user-provided remote browser/CDP session.

Boundaries

  • Proceed only for content the user is authorized to access or that is open access.
  • Never ask for or store passwords, SSO secrets, 2FA codes, session cookies, or publisher API keys in chat.
  • Do not bypass paywalls, CAPTCHAs, rate limits, DRM, robots controls, or account restrictions.
  • Do not do bulk downloading unless the user confirms the library/publisher license permits it. Prefer official TDM APIs for mining-scale requests.
  • Treat browser profiles, CDP endpoints, and downloaded PDFs as sensitive. Do not print tokens or cookie values.
  • Before using a cloud browser for university login, tell the user to confirm their school permits entering SSO credentials into that provider.

Preferred Workflow

  1. Normalize the request to a DOI, publisher URL, or library permalink.
  2. If the user gives an open-access URL or DOI, try normal direct retrieval first.
  3. If institutional access is needed, prefer a local persistent browser profile:

node /scripts/fetch-paper.mjs --url "" --out papers --pause-for-login

  1. If the user has a school proxy prefix, pass it explicitly:

node /scripts/fetch-paper.mjs --doi "" --proxy-prefix "https://ezproxy.example.edu/login?url=" --out papers --pause-for-login

  1. If the user provides a cloud browser or remote Chrome CDP endpoint, set PAPER_FETCH_CDP_ENDPOINT or use --cdp. Read references/cloud-browser-options.md first.
  2. For SpringerLink URLs, the helper will try page PDF links and the usual link.springer.com/content/pdf/.pdf pattern after the authenticated page loads.
  3. Save the PDF and sidecar metadata JSON. Report the saved path, final article URL, and any entitlement/login problem.

Helper Script

Use scripts/fetch-paper.mjs for repeatable retrieval.

Examples:

node <skill-dir>/scripts/fetch-paper.mjs --doi "10.1007/s00134-020-06033-2" --out papers --pause-for-login
node <skill-dir>/scripts/fetch-paper.mjs --url "https://link.springer.com/article/10.1007/s00134-020-06033-2" --out papers --headless
$env:PAPER_FETCH_CDP_ENDPOINT="wss://<redacted-remote-browser-endpoint>"
node <skill-dir>/scripts/fetch-paper.mjs --url "https://link.springer.com/article/<doi>" --out papers

If Node reports that Playwright is missing, ask permission before installing dependencies. Typical local setup:

npm install --save-dev playwright
npx playwright install chromium

For cloud-only CDP usage, playwright-core may be sufficient if the provider supplies the browser:

npm install --save-dev playwright-core

Handling Login

  • If the script says access is unavailable, ask the user to log in through the opened local or cloud browser session, then rerun the same command.
  • Use --pause-for-login only when the user is ready to complete SSO in the browser.
  • Use --login-only to warm the profile/session without downloading.
  • Do not automate 2FA, CAPTCHA solving, hidden proxy rotation, or anti-bot evasion.

When Retrieval Fails

Report the exact non-sensitive cause:

  • no PDF link found on the authenticated page
  • HTTP status such as 403, 401, or 404
  • publisher says the article is not included in the user's entitlement
  • Playwright/browser dependency is unavailable
  • cloud browser endpoint is expired or not connected

Then suggest a lawful next path: user reauthenticates, provides an EZproxy/OpenAthens URL, uses a library permalink, uses an official publisher API/TDM route, or manually supplies the PDF.

版本历史

共 1 个版本

  • v1.0.0 Initial release 当前
    2026-04-22 10:51 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

professional

Stock Analysis

udiedrichsen
{"answer":"基于雅虎财经数据,分析股票与加密货币。支持投资组合管理、自选股预警、股息分析、8维评分、热门趋势扫描及传闻/早期信号探测。适用于股票分析、持仓追踪、财报异动、加密监控、热门股追踪或提前发掘非主流传闻。"}
★ 279 📥 57,784
professional

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 194 📥 63,283
professional

All-Market Financial Data Hub

financial-ai-analyst
基于东方财富数据库,支持自然语言查询金融数据,覆盖A股、港股、美股、基金、债券等资产,提供实时行情、公司信息、估值、财务报表等,适用于投资研究、交易复盘、市场监控、行业分析、信用研究、财报审计、资产配置等场景,满足机构与个人需求。返回结果为
★ 129 📥 42,327