End-to-end workflow for cloning viral TikTok content structure and building a monetizable channel.
Step 0: Define Character (once) — create character bible, fill character sheet
Step 1: Analyze reference video — download, extract frame, decode content type
Step 2: Deep Viral Analysis — 7 triggers, frame breakdown, share mechanic
Step 3: Reverse-Engineer Viral Structure — hook/setup/conflict/peak/twist/CTA
Step 4: Generate New Video Concept — same structure, new character/scenario
Step 5: Scene-by-Scene AI Image Prompts — 6 frames, Midjourney/DALL-E
Step 6: Full Script with Text Overlays — per-frame spec with sound cues
Step 7: Algorithm Optimization — completion rate, hook formulas, length strategy
Step 8: CapCut Assembly Instructions — import, timing, effects, export
Step 9: 30-Day Posting Calendar + Monetization Roadmap
Before making any video, create a Character Bible. Read references/character-bible.md.
Key decisions:
Output: A completed character sheet saved to assets/character-sheet.md
See assets/character-sheet-template.md for the fill-in-the-blank template.
When user provides a TikTok URL:
# Download video
yt-dlp -o "/tmp/tiktok-ref.%(ext)s" "<URL>"
# Extract first frame for visual analysis
ffmpeg -i /tmp/tiktok-ref.mp4 -ss 00:00:01 -frames:v 1 /tmp/tiktok-thumb.jpg -update 1
# Try to get subtitles (may not be available)
yt-dlp --write-auto-sub --sub-lang "vi,en,zh" --skip-download -o "/tmp/tiktok-sub" "<URL>"
Then use the image tool to analyze /tmp/tiktok-thumb.jpg and describe:
This is the most important step. Read references/viral-analysis.md for full protocol.
Quick analysis (run for every reference video):
Output a Viral Analysis Report (template in viral-analysis.md) before writing any script.
Decode the video into this framework:
HOOK (0-3s): What stops the scroll?
SETUP (3-10s): What's the premise?
CONFLICT (10-25s): What's the tension/challenge?
PEAK (25-35s): What's the climax?
TWIST (35-45s): Surprise or punchline
CTA (45-55s): What action does it ask for?
Identify:
Create a new video using the SAME structure but DIFFERENT content:
Concept generation prompt:
Based on this viral structure: [DECODED STRUCTURE]
Generate 5 new video concepts using the same emotional arc but different characters/scenarios.
Each concept: Title + Character + Scenario + Twist
Format: AI-generated image series (6 frames)
Pick the best concept and proceed.
For each of the 6 frames, write Midjourney/DALL-E prompts following this formula:
[Subject] [Action] [Setting] [Emotion] [Style: photorealistic/cartoon/cinematic] [Aspect ratio: 9:16 vertical]
Quality modifiers to always include:
photorealistic, ultra detailed, professional photography9:16 vertical format (TikTok native)dramatic lighting or soft natural lightingfunny expression / shocked expression / confident poseConsistency tip: Start each prompt with "Same [character description]" to maintain visual consistency across frames.
> 🎀 Cute Factor: Cute characters — chubby body, round shape, big eyes — create instant emotional investment. The cuter the character, the faster viewers bond and the harder they laugh at failures. Lean into exaggerated cuteness. See references/character-bible.md for the Cute Factor framework.
Video generation options (choose by budget):
references/video-generation-tools.md for full comparison, pricing, and all 15 Kling AI prompts for Hammy video.See references/prompt-templates.md for 10 ready-to-use prompt templates.
See references/script-templates.md for 4 full script templates + 30 content ideas.
See references/character-bible.md for character archetypes, quality checklist, and emotional arc framework.
See references/viral-analysis.md for the 7 viral triggers, deep analysis protocol, and red flags checklist.
For EVERY scene (frame), specify ALL of the following in detail:
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
CẢNH [N] — [TIMESTAMP, e.g. 0:00-0:03]
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
👁️ HÌNH ẢNH (Visual):
- Nhân vật: [Ai? Đang làm gì? Biểu cảm gì? Tư thế?]
- Bối cảnh: [Địa điểm, ánh sáng, góc máy]
- Chuyển động: [Static / Ken Burns zoom in / Ken Burns zoom out / Pan]
- Chi tiết quan trọng: [Vật thể nào nổi bật? Màu sắc chủ đạo?]
🎨 DALL-E / MIDJOURNEY PROMPT:
"[Character description], [action], [setting], [emotion], [lighting], photorealistic, 9:16 vertical"
Negative prompt: "blurry, deformed, text, watermark"
📝 TEXT OVERLAY:
Nội dung: "[Tối đa 6 từ + emoji]"
Vị trí: [Top / Bottom / Center]
Font style: [Bold white + black stroke]
🔊 ÂM THANH:
Nhạc nền: [Tên loại nhạc / mood: sneaky / dramatic / happy / sad...]
SFX: [Tên sound effect cụ thể: "record scratch" / "vine boom" / "sad trombone" / không có]
Volume nhạc: [100% / 70% / 50%]
⏱️ THỜI LƯỢNG: [X giây]
🎭 CẢM XÚC MỤC TIÊU: [Người xem cảm thấy gì ở cảnh này?]
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
CẢNH 1 — 0:00-0:03 (HOOK)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
👁️ HÌNH ẢNH:
- Nhân vật: Hammy (hamster béo màu cam) đứng thẳng, hai tay chống hông
Biểu cảm: Tự tin, mắt ánh lên quyết tâm
Tư thế: Nhìn thẳng vào camera, ngực ưỡn ra
- Bối cảnh: Trước cổng ngân hàng khổng lồ, góc máy thấp nhìn lên
Ngân hàng cao như tòa nhà chọc trời so với Hammy
- Chuyển động: Ken Burns zoom out chậm để lộ kích thước ngân hàng
- Chi tiết: Ánh nắng buổi sáng, bóng Hammy dài trên mặt đường
🎨 DALL-E PROMPT:
"Hammy, a tiny chubby round orange hamster with big black eyes,
wearing yellow cherry-print shorts, standing confidently arms akimbo
in front of a massive marble bank building that towers above him,
dramatic low angle shot, morning golden light, photorealistic, 9:16 vertical"
Negative: "blurry, deformed, text, watermark, multiple characters"
📝 TEXT OVERLAY:
Nội dung: "Hammy có kế hoạch 😤"
Vị trí: Bottom 1/3
Font: Bold white + black stroke, size lớn
🔊 ÂM THANH:
Nhạc nền: Spy/sneaky piano theme (bắt đầu nhẹ)
SFX: Dramatic single piano chord khi text hiện ra
Volume nhạc: 60%
⏱️ THỜI LƯỢNG: 3 giây
🎭 CẢM XÚC: Tò mò + hào hứng — "Cái gì đây?!"
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
| Loại cảnh | Nhạc nền | SFX gợi ý |
|---|---|---|
| --- | --- | --- |
| HOOK | Dramatic sting / Silence → bang | Single piano chord |
| SETUP | Sneaky/spy theme | Footsteps, paper rustle |
| CONFLICT | Tension buildup | Machine hum, creaking |
| PEAK | Music STOPS → chaos | Explosion, scream |
| TWIST | Sad trombone / Magic sparkle | Record scratch, fail horn |
| CTA | Upbeat happy | Victory fanfare (mini) |
Mỗi video = 6-15 frames ảnh AI ghép thành video trong CapCut:
Ken Burns effect trên mỗi frame tạo cảm giác chuyển động dù ảnh tĩnh:
TikTok ranks videos by priority order:
Every creative decision must serve one of these metrics.
"Stop scrolling if you [situation]"
"POV: [character] just discovered [thing]"
"Watch until the end — [teaser of twist]"
"[Bold statement]. Let me explain."
"[Number] things about [topic] nobody talks about:"
1. Import all AI images in order
2. Set each image duration: 3-5 seconds
3. Add Ken Burns effect (slow zoom) on each image
4. Add transition: Fade or Whip Pan between frames
5. Add text overlay per frame (match script)
6. Add background music: search "sneaky/funny/dramatic" in CapCut sounds
7. Add sound effects at TWIST frame (e.g., "record scratch", "dramatic chipmunk")
8. Export: 1080x1920, 30fps, MP4
See references/posting-calendar.md for the full template.
Quick schedule:
Content mix per week:
3x Comedy/Story (core content)
2x Trend-riding (attach to current TikTok sounds)
1x Behind-the-scenes (how you make the AI images)
1x Engagement bait (poll, question, "guess what happens next")
See references/trend-riding.md for the full trend-riding playbook.
See references/channel-setup.md for new channel setup guide.
See references/monetization.md for full details.
Quick milestones:
1K followers → Enable TikTok Creator Fund
10K followers → Brand deals (AI tools, pet products, apps)
50K followers → Affiliate marketing ($500-2000/month)
100K+ → Sponsored content ($1000-5000/video)
Fastest monetization paths for AI comedy content:
共 1 个版本