Orchestrate a title-to-arXiv metadata workflow for: $ARGUMENTS
This skill turns a list of paper titles into a reproducible arXiv lookup result:
paper titles
-> Tavily search only
-> canonical arXiv abs URL
-> local arXiv metadata fetch
-> JSONL process log
-> markdown report
The goal is not just to answer once. The goal is to leave behind a reusable work folder that can be resumed or consumed by another workflow.
Extract or ask for:
If WORKDIR is omitted, default to:
0_docs/tavily_arxiv_paper_fetechAccepted title formats:
Write all outputs under WORKDIR:
| Artifact | Purpose |
|---|---|
| ---------- | --------- |
input_titles.md | normalized input title list |
paper_fetches.jsonl | one JSON line per processed title |
paper_fetch_report.md | rendered report from the JSONL |
Process titles in order. Do not parallelize Tavily calls.
If Tavily rate limits:
WORKDIR.WORKDIR/input_titles.md.For each title:
"<paper title>" arXiv
https://arxiv.org/abs/...https://arxiv.org/html/...https://arxiv.org/pdf/...no_match instead of guessing.Once a reliable arXiv URL is known, run:
python3 ".cursor/skills/tavily-arxiv-paper-fetech/scripts/fetch_arxiv_abs.py" "<arxiv-url>"
This returns compact JSON with:
For each title, immediately append one JSON line to:
WORKDIR/paper_fetches.jsonlEach line should include at least:
indexinput_titletavily_statustavily_errorarxiv_urlfetchStatus values:
okno_matcherrorAfter all titles are processed, run:
python3 ".cursor/skills/tavily-arxiv-paper-fetech/scripts/jsonl_to_paper_fetch_md.py" \
"WORKDIR/paper_fetches.jsonl" \
"WORKDIR/paper_fetch_report.md"
The rendered report should look like:
# Tavily arXiv Paper Fetech Report
## Results
### 1. Original Title
- Status: ok
- arXiv URL: https://arxiv.org/abs/xxxx.xxxxx
- arXiv ID: xxxx.xxxxx
- Resolved Title: ...
- Authors: ...
- Abstract: ...
WORKDIR.scripts/fetch_arxiv_abs.py — fetch compact metadata from a known arXiv URLscripts/jsonl_to_paper_fetch_md.py — render JSONL to markdown/tavily-arxiv-paper-fetech "RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control\nOpenVLA: An Open-Source Vision-Language-Action Model --workdir 0_docs/tavily_arxiv_lookup_run_01"
共 1 个版本