← 返回
未分类 Key 中文

Social Media Data Hub

Unified Apify-based retrieval for TikTok, Instagram, X/Twitter, and YouTube profile, post, and comment data with cross-platform normalization for analysis wo...
基于Apify的统一检索,获取TikTok、Instagram、X/Twitter和YouTube的个人资料、帖子和评论数据,跨平台标准化后用于分析。
galaxy-earth galaxy-earth 来源
未分类 clawhub v1.0.0 1 版本 100000 Key: 需要
★ 1
Stars
📥 434
下载
💾 0
安装
1
版本
#latest

概述

Social Media Data Hub

Use Apify to collect social media data from TikTok, Instagram, X/Twitter, and YouTube through one consistent workflow.

Core Capabilities

  1. Profile lookup - Retrieve profile metadata such as follower count, content volume, and bio.
  2. Bulk post retrieval - Pull recent or popular posts for an account together with engagement metrics.
  3. Single-post lookup - Input a post URL and return a normalized record with key metrics.
  4. Comment retrieval - Fetch comment threads for a supported post URL.
  5. Cross-platform normalization - Map all supported platforms into one shared schema for analysis.

Actor Mapping

PlatformActor IDPurpose
----------------------
TikTokclockworks/tiktok-scraperProfiles, posts, comments
Instagramapify/instagram-scraperProfiles, posts, comments
X/Twitter (bulk)apidojo/tweet-scraperBulk tweet retrieval (50+ items)
X/Twitter (precise)apidojo/twitter-scraper-liteSingle tweet, conversation, or small batches
YouTubestreamers/youtube-scraperChannels and videos

Usage

All scripts live under {baseDir}/scripts/, are executed with python3, and return JSON.

Profile Lookup

python3 {baseDir}/scripts/fetch_profile.py --platform tiktok --username "khaby.lame"
python3 {baseDir}/scripts/fetch_profile.py --platform instagram --username "natgeo"
python3 {baseDir}/scripts/fetch_profile.py --platform twitter --username "elonmusk"
python3 {baseDir}/scripts/fetch_profile.py --platform youtube --channel-url "https://www.youtube.com/@MrBeast"

Bulk Post Retrieval

python3 {baseDir}/scripts/fetch_posts.py --platform tiktok --username "khaby.lame" --count 20
python3 {baseDir}/scripts/fetch_posts.py --platform instagram --username "natgeo" --count 30
python3 {baseDir}/scripts/fetch_posts.py --platform twitter --username "elonmusk" --count 100
python3 {baseDir}/scripts/fetch_posts.py --platform youtube --channel-url "https://www.youtube.com/@MrBeast" --count 50

Single-Post Lookup

python3 {baseDir}/scripts/fetch_single_post.py --url "https://www.tiktok.com/@user/video/123456"
python3 {baseDir}/scripts/fetch_single_post.py --url "https://www.instagram.com/p/ABC123/"
python3 {baseDir}/scripts/fetch_single_post.py --url "https://x.com/user/status/123456"
python3 {baseDir}/scripts/fetch_single_post.py --url "https://www.youtube.com/watch?v=ABC123"

Comment Retrieval

python3 {baseDir}/scripts/fetch_comments.py --url "https://www.tiktok.com/@user/video/123456" --count 50
python3 {baseDir}/scripts/fetch_comments.py --url "https://www.instagram.com/p/ABC123/" --count 30
python3 {baseDir}/scripts/fetch_comments.py --url "https://x.com/user/status/123456" --count 100

Cross-Platform Normalization

normalize.py is used internally by the fetch scripts and can also be run directly:

echo '<raw_json>' | python3 {baseDir}/scripts/normalize.py --platform tiktok --type post

Unified Data Model

Normalized Post (NormalizedPost)

FieldTypeDescription
------------------
platformstringtiktok / instagram / twitter / youtube
post_idstringPlatform-native post ID
post_urlstringPost URL
textstringCaption, text body, or title
created_atstringISO 8601 timestamp
author_namestringAuthor username
author_display_namestringDisplay name
likesintLike count
commentsintComment count
sharesintShare or repost count, or null
viewsintView count, play count, or null
savesintSave or bookmark count, or null
hashtagslistHashtag list
media_typestringvideo / image / text / carousel

Normalized Profile (NormalizedProfile)

FieldTypeDescription
------------------
platformstringPlatform identifier
usernamestringUsername or handle
display_namestringDisplay name
biostringBiography or profile summary
followersintFollower count
followingintFollowing count, or null
posts_countintTotal content count, or null
profile_urlstringProfile URL
verifiedboolVerification status

Cost Reference (BRONZE Tier)

OperationTikTokInstagramX/TwitterYouTube
-------------------------------------------
Per post$0.003$0.0023$0.0004$0.003
Bulk 1K items$3.00$2.30$0.40$3.00
Single-item lookup~$0.50 (minimum charge)$0.0023$0.05$0.003

See {baseDir}/references/apify_actors_reference.md for actor inputs, selection guidance, and pricing details.

Notes

  • TikTok has a $0.50 minimum per run, so bulk retrieval is usually more cost-effective than single-item lookups.
  • For X/Twitter, use tweet-scraper when you need 50 or more items and twitter-scraper-lite for smaller batches or single tweets.
  • Instagram comments are limited to 50 comments per post.
  • YouTube comments are not handled by these scripts; use a dedicated YouTube Comments Scraper actor instead.
  • All scripts call the Apify REST API directly and do not rely on the apify-client SDK.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-30 23:59 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

AdMapix

fly0pants
AdMapix 原始数据层,提供广告创意、应用、排名、下载/收入及市场元数据。返回 AdMapix API 的结构化 JSON;调用方...
★ 298 📥 143,072
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 218 📥 71,643
data-analysis

Tavily 搜索

jacky1n7
通过 Tavily API 进行网页搜索(Brave 替代方案)。当用户要求搜索网页、查找来源或链接,且 Brave 网页搜索不可用时使用。
★ 278 📥 101,620