← 返回
未分类 中文

Manga Scraper

Download manga chapters from MangaBat (mangabats.com) directly via CDN — bypasses Cloudflare. Triggered when user asks to download/scrape manga chapters or m...
从 MangaBat (mangabats.com) 直接通过 CDN 下载漫画章节,绕过 Cloudflare。用户请求下载或抓取漫画章节时触发。
jrrqd
未分类 clawhub v1.0.2 1 版本 100000 Key: 无需
★ 0
Stars
📥 142
下载
💾 0
安装
1
版本
#cloudflare-bypass#download#latest#manga#mangabat#scrape#webp

概述

MangaBat Scraper

Download manga chapters directly from MangaBats CDN without hitting Cloudflare protection.

Auto-falls back to Playwright (headless browser) when CDN is IP-blocked.

How It Works

CDN Method (default): MangaBat serves images from storage.waitst.com — CDN is unprotected.

Pattern: https://storage.waitst.com/zin/[slug]/[chapter]/[page].webp

Browser Fallback: If CDN is IP-blocked, script launches Playwright (headless Chromium)

to extract image URLs directly from the chapter page JavaScript.

Script Location

Locate the script in your skills directory:

find ~ -name manga_scraper.py 2>/dev/null

Setup Check

python3 /path/to/manga_scraper.py --help

Setup (One-Time)

Required for CDN mode only (default):

# Nothing! Uses stdlib only — urllib + concurrent.futures

Required for browser fallback (optional):

pip install playwright && playwright install chromium

Usage

Single chapter (CDN, fast):

python3 manga_scraper.py "https://www.mangabats.com/manga/[manga-slug]/chapter-5"

Single chapter + force Playwright fallback (for IP-blocked networks):

python3 manga_scraper.py "https://www.mangabats.com/manga/[manga-slug]/chapter-5" \
  --fallback-browser

Chapter range (1–10):

python3 manga_scraper.py "https://www.mangabats.com/manga/[manga-slug]" \
  --start 1 --end 10 --workers 4

All chapters (auto-detect last by 404 scan):

python3 manga_scraper.py "https://www.mangabats.com/manga/[manga-slug]" \
  --all --workers 3

Skip browser fallback (faster, for CI):

python3 manga_scraper.py "..." --no-browser

Custom output folder:

python3 manga_scraper.py "URL" --output ~/Manga/MyManga

Flags

FlagDescription
-------------------
--allDownload all chapters (manga URL, auto-detects last by 404 scan)
--start NStart from chapter N
--end NEnd at chapter N
--workers NConcurrent downloads, default 3
--output -oOutput directory, default ./downloads/
--fallback-browserForce Playwright fallback (for IP-blocked networks)
--no-browserSkip Playwright fallback entirely (faster, CI/CD)

CDN Fallback Chain

If one CDN fails, script tries the next automatically:

  1. storage.waitst.comcurrent default (/zin/[slug]/[ch]/[page].webp)
  2. img-r1.2xstorage.com — legacy (/[slug]/[ch]/[page].webp)
  3. img-2xcdn.com — fallback (/[slug]/[ch]/[page].webp)

If all CDNs return 403 → auto-activates Playwright fallback (installs Chromium once).

Output

  • Saves to chapter_NNN/page_000.webp naming convention
  • Resume support: skips existing files
  • Some pages may be missing (MangaBat sometimes removes individual pages — placeholder is ~14 bytes, skipped automatically)
  • Image format: .webp

Troubleshooting

ProblemSolution
-------------------
0/0 downloaded — all 403IP blocked. Use --fallback-browser or activate VPN
0/0 downloaded — all 000No internet. Check connection
playwright import errorRun: pip install playwright && playwright install chromium
Missing pages (14 bytes each)Normal — MangaBat removes pages from CDN sometimes
Script breaks in futureRun with --fallback-browser — browser always works

Notes

  • Script is pure Python stdlib (urllib + concurrent.futures) for CDN mode
  • Playwright fallback requires Chromium (~150MB download, one-time)
  • Be respectful: use --workers 3 or lower for batch downloads
  • Mangabat rotates CDNs every few months — current CDN is storage.waitst.com

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-06-04 13:52

安全检测

腾讯云安全 (Keen)

队列中

腾讯云安全 (Sanbu)

队列中

🔗 相关推荐

Verosight Monitor

jrrqd
集成Verosight API用于社交媒体情报与网络监控。情感分析、趋势检测、影响者识别和机器人检测。
★ 0 📥 402

Prayer Times Skill

jrrqd
伊斯兰祈祷时间提醒、待办清单与日志。当用户询问祈祷时间(如sholat、waktu solat、jadwalsholat)或想设置位置时使用。
★ 0 📥 328

Wikipedia Nearby

jrrqd
根据地理位置查找附近的维基百科条目,适用于用户询问“附近地点”“我附近有什么”“[地点]附近的地方”等场景。
★ 0 📥 375