MINERU_API_KEY is configured in the current environment. Explain the purpose: to run scripts/pdf_to_md.py to convert paper PDFs into Markdown (the terminal must be able to run python or python3). If Python is missing, help the user install it. If MINERU_API_KEY is missing, guide the user to https://mineru.net/apiManage/docs to obtain an API key.introduction: only return Markdown content that strictly matches the first-level # Introduction headingall: return the full Markdown content converted from the entire paperDo not start this skill's core workflow until the above confirmations are complete.
https://www.alphaxiv.org/?sort=Hot&interval=7+Dayshttps://www.alphaxiv.org/?source=GitHub&interval=7+Days&sort=Hothttps://github.com/trending?since=weekly10 entries from each data source by default.When retrieving paper lists, project lists, paper links, and arXiv IDs from web pages, try the following in order:
https://r.jina.ai/example.com to read the page as Markdown.arXiv ID from the page content.arXiv ID is not required.arXiv IDpending entries sequentially. Do not launch subagents.attempt += 1.status = "done" and write completedAtstatus = "failed" and write error and completedAtDirectly complete content retrieval, analysis, classification, and result aggregation for each entry.
If source is arxiv:
arXiv ID.arXiv ID is obtained, construct the paper PDF link:https://arxiv.org/pdf/{arxiv-id}.pdf--range:python scripts/pdf_to_md.py https://arxiv.org/pdf/{arxiv-id}.pdf tmp_llm_research/{arxiv-id}.md --range introductionpython scripts/pdf_to_md.py https://arxiv.org/pdf/{arxiv-id}.pdf tmp_llm_research/{arxiv-id}.md --range all--range parameter description:introduction: only return Markdown content that strictly matches the first-level # Introduction headingall: return the full Markdown content converted from the entire papertmp_llm_research/{arxiv-id}.md.references/categories.md.arXiv ID cannot be obtained reliably, do not fabricate an ID and do not directly replace the Markdown with a web summary. Mark the task as failed and state in the error: "Unable to reliably obtain arXiv ID".If source is github:
r.jina.ai version of the page as a fallback.references/categories.md.Maintain success and failure result collections in memory during the current run for final aggregation. Each result should include at least the following fields:
{
"id": "{sequence number}",
"title": "{title}",
"url": "{URL}",
"source": "{arxiv|github}",
"arxivId": "{arXiv ID, or null for GitHub}",
"category": "{category name}",
"authors": "{authors or organizations; use Unknown if unknown}",
"analysis": "{explain the content in simple, easy-to-understand language; the more detailed, the better}",
"status": "{done or failed; include the reason if failed}",
"attempt": "{current attempt count}",
"completedAt": "{ISO timestamp}"
}
After all tasks are complete, write the final Markdown report to the output folder. The file name format must be YYYYMMDDHHmm.md.
After the final report is successfully written, delete the entire tmp_llm_research folder.
The final report must include:
# Report Summary, including at least: Total, Success, Failed, and Retried Success.# Details, aggregated by category. Entries under each category must include at least: title, url, source, authors, and analysis.# Trending, summarizing the common trends, hot directions, and potential changes reflected by this batch of papers and projects.tmp_llm_research only after the final Markdown report has been successfully written, to avoid affecting the final aggregation.共 4 个版本