This skill extends Manus's capabilities with a custom-built, semantic search engine optimized specifically for AI agents and LLMs. It enables neural search, clean Markdown extraction, extractive query highlights, and conceptual similarity searches.
Use this skill when:
highlights) to reduce token consumption./search)Query Exa's index using semantic embeddings. Unlike keyword matching, this understands the conceptual meaning of your prompt.
from exa_py import Exa
exa = Exa(api_key="YOUR_EXA_API_KEY")
results = exa.search(
query="companies building innovative fusion energy reactors",
type="auto",
num_results=5,
contents={"highlights": True}
)
/contents)Retrieve webpage content stripped of navigation menus, sidebars, advertisements, and other boilerplate, returned as clean Markdown.
contents = exa.get_contents(
urls=["https://example.com/target-article"],
text=True,
max_age_hours=24
)
/findSimilar)Find conceptually similar pages in Exa's index using a starting URL as your query.
similar = exa.find_similar(
url="https://arxiv.org/abs/2307.06435",
num_results=5
)
By default, Exa serves cached pages to optimize speed. To control cache freshness, use max_age_hours instead of deprecated livecrawl parameters:
max_age_hours=0: Forces a live crawl of the URL.max_age_hours=1: Uses cache if it's less than 1 hour old, otherwise performs a live crawl.max_age_hours=-1: Cache-only lookup (never crawl).Automatically discover and extract content from linked subpages on a target site. Highly effective for documentation or news archives:
results = exa.get_contents(
["https://docs.exa.ai"],
subpages=10,
subpage_target=["api", "reference"],
max_age_hours=24
)
Always format extracted contents cleanly into XML blocks for downstream LLM generation:
context = "\n".join([
f"<source><url>{r.url}</url><highlights>{r.highlights}</highlights></source>"
for r in results.results
])
/home/ubuntu/skills/exa-search/scripts/exa_search.py --help共 1 个版本