← 返回
未分类 Key 中文

Instaparser

Use the Instaparser API to parse articles, PDFs, and generate summaries from URLs. Trigger when users want to extract content from web pages, parse PDF docum...
使用 Instaparser API 解析文章、PDF 并从 URL 生成摘要。当用户需要从网页提取内容、解析 PDF 文档或生成内容摘要时触发。
donohue donohue 来源
未分类 clawhub v1.0.2 1 版本 100000 Key: 需要
★ 1
Stars
📥 411
下载
💾 2
安装
1
版本
#article-parser#content-extraction#instaparser#latest#pdf-parser#summarization#web-scraping

概述

Instaparser API Skill

Use this skill when the user wants to interact with the Instaparser API to parse articles, PDFs, or generate summaries.

Requirements

  • Network access: This skill makes HTTPS requests to https://www.instaparser.com/api/. The user must grant network access when prompted.
  • API key: All requests require an Instaparser API key set as the INSTAPARSER_API_KEY environment variable.

Getting an API key

  1. Go to https://www.instaparser.com and create an account.
  2. After signing in, navigate to the API section of your dashboard to generate an API key.
  3. Set the key in your environment:

```bash

export INSTAPARSER_API_KEY="your_api_key_here"

```

  1. The free Trial plan includes a limited number of monthly credits. Paid plans are available for higher usage.

Authentication

All API requests require a Bearer token. The API key should be provided via the INSTAPARSER_API_KEY environment variable, or the user can provide it directly.

Authorization: Bearer $INSTAPARSER_API_KEY

API Endpoints

Article API

POST https://www.instaparser.com/api/1/article

Parse an article from a URL and extract its title, author, body content, images, and more. Uses 1 credit per call.

Request body (JSON):

ParameterTypeRequiredDescription
--------------------------------------------
urlstringYesURL of the article to parse
contentstringNoRaw HTML content to parse instead of fetching from url
outputstringNo"html" (default) or "text"
use_cacheboolNoWhether to use cache. Defaults to true

Example:

curl -X POST https://www.instaparser.com/api/1/article \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/article", "output": "text"}'

Response fields:

FieldDescription
----------------------------
urlCanonical URL
titleArticle title
site_nameWebsite name
authorAuthor name
datePublished date (UNIX timestamp)
descriptionArticle description
thumbnailThumbnail image URL
htmlHTML body (when output is "html")
textPlain text body (when output is "text")
wordsWord count
is_rtltrue if Arabic or Hebrew
imagesArray of image URLs
videosArray of video URLs

PDF API

Parse PDFs from a URL (GET) or by uploading a file (POST). Uses 5 credits per page.

Parse from URL

GET https://www.instaparser.com/api/1/pdf

ParameterTypeRequiredDescription
--------------------------------------------
urlstringYesURL of the PDF to parse
outputstringNo"html" (default) or "text"
use_cacheboolNoWhether to use cache. Defaults to true
curl "https://www.instaparser.com/api/1/pdf?url=https://example.com/report.pdf&output=text" \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY"

Upload a file

POST https://www.instaparser.com/api/1/pdf

Send as multipart form-data with a file field.

curl -X POST https://www.instaparser.com/api/1/pdf \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY" \
  -F "file=@report.pdf" \
  -F "output=text"

Response fields: Same as Article API.


Summary API

POST https://www.instaparser.com/api/1/summary

Generate an AI-powered summary with key sentences. Uses 10 credits per call.

Request body (JSON):

ParameterTypeRequiredDescription
--------------------------------------------
urlstringYesURL of the article to summarize
contentstringNoHTML content to parse instead of fetching from URL
use_cacheboolNoWhether to use cache. Defaults to true
streamboolNoStream the response. Defaults to false
curl -X POST https://www.instaparser.com/api/1/summary \
  -H "Authorization: Bearer $INSTAPARSER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/article"}'

Response fields:

FieldDescription
-------------------------------
key_sentencesArray of key sentences extracted from the article
summaryConcise summary of the article

Status Codes

CodeReason
--------------
200Success
400Parameter missing or malformed
401API key is invalid
403Account suspended (payment error)
409Exceeded monthly credits (Trial plan only)
412Upstream parsing error
429Rate limit exceeded

SDK Usage

Python:

from instaparser import InstaparserClient

client = InstaparserClient(api_key="YOUR_API_KEY")

# Article
article = client.Article(url="https://example.com/article", output="text")

# PDF
pdf = client.PDF(url="https://example.com/report.pdf")

# Summary
summary = client.Summary(url="https://example.com/article")

JavaScript:

import { InstaparserClient } from 'instaparser-api';

const client = new InstaparserClient({ apiKey: 'YOUR_API_KEY' });

// Article
const article = await client.article({ url: 'https://example.com/article', output: 'text' });

// PDF
const pdf = await client.pdf({ url: 'https://example.com/report.pdf' });

// Summary
const summary = await client.summary({ url: 'https://example.com/article' });

Instructions

When the user asks to parse an article, PDF, or generate a summary:

  1. Check if INSTAPARSER_API_KEY is set in the environment. If not, ask the user for their API key.
  2. Use curl via the Bash tool to make the API request.
  3. For article parsing, default to output: "text" unless the user specifically wants HTML.
  4. For PDF parsing from a local file, use the multipart form-data POST method.
  5. For PDF parsing from a URL, use the GET method with query parameters.
  6. Present the results clearly — show title, author, word count, and the extracted content.
  7. For summaries, display both the overview/summary and the key sentences.

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-03-31 05:47 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

knowledge-management

Obsidian

steipete
操作 Obsidian 仓库(纯 Markdown 笔记)并通过 obsidian-cli 自动化。
★ 446 📥 105,284
knowledge-management

web-tools-guide

user_ec205dbb
MANDATORY before calling web_search, web_fetch, browser, or opencli. Contains required error-handling procedures (web_se
★ 75 📥 163,938
knowledge-management

Summarize

paudyyin
智能摘要工具,自动为长文本、文档、网页生成摘要,提取要点与关键词,支持自定义摘要长度。
★ 963 📥 520,234