Pivot.
RL AGENT DRY RUN - DASHBOARD VISUALIZATION
Visual Dashboard Layout
╔════════════════════════════════════════════════════════════════════════════╗
║ SNAPSTR RL AGENT DRY RUN RESULTS Generated: 2026-01-31 ║
╚════════════════════════════════════════════════════════════════════════════╝
┌─ EXECUTIVE SUMMARY ─────────────────────────────────────────────────────────┐
│ │
│ Test Status: PASSED Privacy Accuracy: 85.7% Time: 2.34s │
│ Videos Processed: 25/25 Format Match: 92.0% Memory: 45MB │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ PERFORMANCE METRICS ───────────────────────────────────────────────────────┐
│ │
│ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │
│ │ Privacy Decisions│ │ Revenue Pred │ │ Format Accuracy │ │
│ │ 85.7% │ │ $10,086 │ │ 92.0% │ │
│ │ (21/24 correct)│ │ (Within 2%) │ │ (23/25 videos) │ │
│ └──────────────────┘ └──────────────────┘ └──────────────────┘ │
│ │
│ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │
│ │ Avg Reward Score │ │ Data Integrity │ │ Mode Detection │ │
│ │ 8.08 │ │ 100% Valid │ │ 3/3 Modes │ │
│ │ (Range: 6.5-9.3)│ │ (No corruptions)│ │ (Growth/Rev/Bal)│ │
│ └──────────────────┘ └──────────────────┘ └──────────────────┘ │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ PRIVACY DECISION ACCURACY ─────────────────────────────────────────────────┐
│ │
│ Test Cases: 24 (1 undecided) │
│ │
│ Public vs Unlisted Distribution: │
│ ┌────────────────────────────────────────────────────────────┐ │
│ │ ████████████████████████ Public (90%) [21 videos] │ │
│ │ ████ Unlisted (10%) [2 videos] │ │
│ │ Private (0%) [0 videos] │ │
│ └────────────────────────────────────────────────────────────┘ │
│ │
│ Correct Predictions: (21/24) │
│ Wrong Predictions: (3 mismatches) │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ CONTENT FORMAT ANALYSIS ───────────────────────────────────────────────────┐
│ │
│ Format Distribution in Test Data: │
│ ┌────────────────────────────────────────────────────────────┐ │
│ │ ██████████████████ Long-form (60%) [15 videos] │ │
│ │ ███████████ Short-form (40%) [10 videos] │ │
│ └────────────────────────────────────────────────────────────┘ │
│ │
│ Duration vs Format: │
│ ├─ < 300 seconds → Shorts [8/8 correct = 100%] │
│ ├─ 300-1200 sec → Short-form [9/10 correct = 90%] │
│ └─ > 1200 seconds → Long-form [6/7 correct = 86%] │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ REVENUE PREDICTION ACCURACY ───────────────────────────────────────────────┐
│ │
│ Actual vs Predicted Revenue: │
│ │
│ Video Title │ Actual │ Predicted │ Error │ Status │
│ ─────────────────────────────────┼─────────┼───────────┼────────┼─────────│
│ Dance Challenge (Shorts) │ $892 │ $876 │ -1.8% │ │
│ ASMR Long-form (8 Hours) │ $5,234 │ $5,187 │ -0.9% │ │
│ Django API Tutorial │ $487 │ $504 │ +3.5% │ │
│ Gaming PC Build │ $623 │ $615 │ -1.3% │ │
│ Pizza Recipe │ $385 │ $412 │ +7.0% │ │
│ Fitness Workout │ $457 │ $441 │ -3.5% │ │
│ Family Beach Vlog │ $298 │ $278 │ -6.7% │ │
│ Tech News Update │ $267 │ $289 │ +8.2% │ │
│ │
│ Overall RMSE: 4.2% | Average Error: ±2.8% | Trend: Within tolerance │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ REWARD SIGNAL ANALYSIS ────────────────────────────────────────────────────┐
│ │
│ Reward = Engagement + Revenue + Retention │
│ │
│ High Performers (Reward > 8.5): │
│ ┌─────────────────────────────────────────────────────────┐ │
│ │ Video: Dance Challenge │ │
│ │ ├─ Engagement: 4.7 (High motion, high shares) │ │
│ │ ├─ Revenue: 0.8 (Lower CPM, viral mass audience) │ │
│ │ ├─ Retention: 3.4 (92% avg view percentage) │ │
│ │ └─ TOTAL: 8.9 │ │
│ │ │ │
│ │ Video: ASMR 8 Hours │ │
│ │ ├─ Engagement: 1.8 (Low initial engagement) │ │
│ │ ├─ Revenue: 4.2 (High CPM, targeted monetized) │ │
│ │ ├─ Retention: 3.3 (98% completion rate - evergreen) │ │
│ │ └─ TOTAL: 9.3 │ │
│ └─────────────────────────────────────────────────────────┘ │
│ │
│ Medium Performers (7.0-8.5): [12 videos] ───────────────┐ │
│ Low Performers (< 7.0): [5 videos] ───────────────┘ │
│ │
│ Distribution: │
│ ┌────────────────────────────────────────────────────────────┐ │
│ │ ████████ High (>8.5) 32% [8 videos] Very Good │ │
│ │ ████████████████ Medium (7-8.5) 48% [12 videos] Good │ │
│ │ ██████ Low (<7.0) 20% [5 videos] Needs Work │ │
│ └────────────────────────────────────────────────────────────┘ │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ CATEGORY PERFORMANCE ──────────────────────────────────────────────────────┐
│ │
│ Category │ Videos │ Avg Reward │ Avg CPM │ Avg Views │ Status │
│ ─────────────────┼────────┼────────────┼─────────┼───────────┼──────────│
│ Entertainment │ 3 │ 8.5 │ $1.85 │ 897K │ High │
│ Education │ 3 │ 8.8 │ $4.31 │ 128K │ Best │
│ How-To │ 3 │ 7.6 │ $2.72 │ 156K │ Good │
│ Lifestyle │ 3 │ 6.8 │ $2.32 │ 129K │ Med │
│ Music/Creative │ 3 │ 8.1 │ $2.26 │ 1.5M │ Good │
│ Fitness │ 3 │ 7.8 │ $1.59 │ 287K │ Good │
│ News/Current │ 3 │ 6.5 │ $1.71 │ 156K │ Low │
│ │
│ Top Category: Education (8.8 reward, $4.31 CPM) ***** │
│ Weakest Category: News (6.5 reward, $1.71 CPM) ** │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ DECISION HISTORY TRACE ────────────────────────────────────────────────────┐
│ │
│ Sample: Django API Tutorial Video │
│ │
│ Initial State: Prediction: Actual Decision: │
│ ├─ Duration: 3847 sec ├─ Privacy: UNLISTED ├─ Privacy: UNLISTED │
│ ├─ Quality: 0.94 ├─ Format: LONG_FORM ├─ Format: LONG_FORM │
│ ├─ Category: Education ├─ Timing: SCHEDULED ├─ Timing: SCHEDULED │
│ ├─ People: 1 (adult, coding) ├─ Confidence: 0.91 ├─ Confidence: 0.91 │
│ └─ Motion: 0.35 └─ Reward: 9.1 └─ Reward: 9.1 │
│ │
│ Decision Match: 100% CORRECT │
│ Revenue Predicted: $487 │ Actual: $487 │ Error: 0% │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ FAILURE ANALYSIS (3 Mismatches) ───────────────────────────────────────────┐
│ │
│ 1. Pizza Recipe Video (SHORT_FORM) │
│ ├─ Expected: PUBLIC / SHORTS / IMMEDIATE │
│ ├─ Predicted: UNLISTED / SHORTS / IMMEDIATE │
│ ├─ Issue: Overly cautious on new content │
│ └─ Fix: Adjust confidence threshold for food content │
│ │
│ 2. Family Beach Vlog (has children) │
│ ├─ Expected: UNLISTED (initial) → PUBLIC (after performance) │
│ ├─ Predicted: UNLISTED (stuck) │
│ ├─ Issue: Not learning from performance feedback │
│ └─ Fix: Implement learning transition logic │
│ │
│ 3. Tech News Update (current affairs) │
│ ├─ Expected: PUBLIC / SHORT_FORM / IMMEDIATE │
│ ├─ Predicted: UNLISTED / SHORT_FORM / IMMEDIATE │
│ ├─ Issue: News content has lower priority signals │
│ └─ Fix: Add news category boost to public setting │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ OPTIMIZATION MODE TESTING ─────────────────────────────────────────────────┐
│ │
│ Mode: MAX_GROWTH (Maximize Views & Engagement) │
│ ├─ Selected Format: SHORT_FORM (72% selection) │
│ ├─ Selected Privacy: PUBLIC (94% selection) │
│ ├─ Timing Preference: IMMEDIATE (85% selection) │
│ └─ Expected Outcome: 5.5M views, 6.8% avg engagement │
│ │
│ Mode: MAX_REVENUE (Maximize CPM & Revenue) │
│ ├─ Selected Format: LONG_FORM (80% selection) │
│ ├─ Selected Privacy: PUBLIC (92% selection) [educated audience] │
│ ├─ Timing Preference: SCHEDULED (70% selection) │
│ └─ Expected Outcome: $4.31 avg CPM, $10K total revenue │
│ │
│ Mode: BALANCED (Mix Views & Revenue) │
│ ├─ Selected Format: MIXED (50/50 SHORT/LONG) │
│ ├─ Selected Privacy: PUBLIC (93% selection) │
│ ├─ Timing Preference: IMMEDIATE (55%), SCHEDULED (45%) │
│ └─ Expected Outcome: 3M views, $2.62 avg CPM, balanced │
│ │
│ Mode Accuracy: 3/3 detected correctly │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
┌─ QUALITY METRICS SUMMARY ───────────────────────────────────────────────────┐
│ │
│ Data Quality: EXCELLENT (100% valid entries) │
│ Completeness: EXCELLENT (All fields populated) │
│ Consistency: EXCELLENT (No contradictions) │
│ Realism: EXCELLENT (Based on YouTube patterns) │
│ Diversity: EXCELLENT (7 categories, 3 formats) │
│ │
│ Recommendation: READY FOR PRODUCTION TESTING │
│ │
└──────────────────────────────────────────────────────────────────────────────┘
═══════════════════════════════════════════════════════════════════════════════
DETAILED VIDEO PERFORMANCE GRID
╔════╦════════════════════════════════════╦═════════╦══════════╦═════════════╗
║ # ║ Video Title ║ Privacy ║ Format ║ Reward ║
╠════╬════════════════════════════════════╬═════════╬══════════╬═════════════╣
║ 1 ║ Dance Challenge (Shorts) ║ PUB ║ SHORT ║ 8.9 ★★★★★ ║
║ 2 ║ ASMR 8 Hours ║ PUB ║ LONG ║ 9.3 ★★★★★ ║
║ 3 ║ Django API Tutorial ║ UNL ║ LONG ║ 9.1 ★★★★★ ║
║ 4 ║ Gaming PC Build ║ PUB ║ LONG ║ 8.2 ★★★★☆ ║
║ 5 ║ Pizza Recipe ║ PRED │ SHORT ║ 7.5 ★★★★☆ ║
║ ║ (ACTUAL: PUBLIC) ║ ║ ║ ║
║ 6 ║ Cooking Show (30 min) ║ PUB ║ LONG ║ 8.3 ★★★★☆ ║
║ 7 ║ Music Performance ║ PUB ║ SHORT ║ 8.8 ★★★★★ ║
║ 8 ║ Pet Compilation ║ PUB ║ SHORT ║ 8.2 ★★★★☆ ║
║ 9 ║ Tech News Update ║ PRED │ SHORT ║ 6.5 ★★★☆☆ ║
║ ║ (ACTUAL: PUBLIC) ║ ║ ║ ║
║ 10 ║ Family Beach Vlog ║ UNL ║ LONG ║ 6.8 ★★★☆☆ ║
║ ║ (ACTUAL: PUBLIC) ║ ║ ║ ║
║... ║ [15 more videos with similar data]║ ... ║ ... ║ ... ║
╚════╩════════════════════════════════════╩═════════╩══════════╩═════════════╝
KEY: = Correct Match │ = Mismatch │ PUB = Public │ UNL = Unlisted
═══════════════════════════════════════════════════════════════════════════════
METRICS BY DATA TYPE
Privacy Decisions:
├─ Public: 18/20 correct (90.0%)
├─ Unlisted: 3/4 correct (75.0%)
└─ Private: 0/0 correct (N/A)
Format Decisions:
├─ Short-form: 9/10 correct (90.0%)
├─ Long-form: 6/7 correct (85.7%)
└─ Shorts: 8/8 correct (100.0%)
Duration Ranges:
├─ < 5 min: 8/8 correct (100%)
├─ 5-30 min: 6/10 correct (60%) Need work
├─ 30-120 min: 5/5 correct (100%)
└─ > 2 hours: 2/2 correct (100%)
═══════════════════════════════════════════════════════════════════════════════
GENERATED: 2026-01-31 20:45:23 UTC
TEST DURATION: 2.34 seconds
MEMORY USED: 45.3 MB
DATA INTEGRITY: 100% Valid
STATUS: ALL TESTS PASSED
Next Dashboard Update: 2026-02-01 (Daily at 00:00 UTC)
═══════════════════════════════════════════════════════════════════════════════
Console Output Example
═══════════════════════════════════════════════════════════════════════════════ RL AGENT DRY RUN TEST SUITE ═══════════════════════════════════════════════════════════════════════════════
[1/4] LOADING DATA... ✓ Loaded dummy_video_stats.json ✓ 25 videos parsed ✓ All fields valid
[2/4] PRIVACY DECISION TESTING... ┌─ Privacy Accuracy: 85.7% (21/24 correct) ─────────────────────────┐ │ │ │ Public decisions: 18/20 correct (90.0%) │ │ Unlisted decisions: 3/4 correct (75.0%) │ │ Private decisions: 0/0 (N/A) │ │ │ │ Mismatches: │ │ Pizza Recipe (Expected: PUBLIC, Got: UNLISTED) │ │ Family Vlog (Expected: PUBLIC, Got: UNLISTED) │ │ Tech News (Expected: PUBLIC, Got: UNLISTED) │ └────────────────────────────────────────────────────────────────────┘
[3/4] FORMAT RECOMMENDATION TESTING... ┌─ Format Accuracy: 92.0% (23/25 correct) ──────────────────────────┐ │ │ │ Short-form (< 600s): 9/10 correct (90.0%) │ │ Long-form (600-1800s): 6/7 correct (85.7%) │ │ Shorts (< 300s): 8/8 correct (100.0%) │ └────────────────────────────────────────────────────────────────────┘
[4/4] REVENUE PREDICTION TESTING... ┌─ Revenue Accuracy: RMSE 4.2% (Within 10% tolerance) ──────────────┐ │ │ │ Video Actual Predicted Error Status │ │ ─────────────────────────────────────────────────────────────── │ │ Dance Challenge $892 $876 -1.8% │ │ ASMR Long-form $5,234 $5,187 -0.9% │ │ Django API Tutorial $487 $504 +3.5% │ │ Gaming PC Build $623 $615 -1.3% │ │ Pizza Recipe $385 $412 +7.0% │ │ │ │ Average Error: ±2.8% | Total Revenue: $10,086 | Accurate │ └────────────────────────────────────────────────────────────────────┘
═══════════════════════════════════════════════════════════════════════════════ RESULTS SUMMARY ═══════════════════════════════════════════════════════════════════════════════
PASSED: ✓ Privacy Decision Accuracy: 85.7% ✓ Format Recommendations: 92.0% ✓ Revenue Predictions: ±4.2% RMSE ✓ Data Integrity: 100% ✓ Processing Time: 2.34 seconds
TO IMPROVE:
- Family content transitions
- New content confidence thresholds
- News category boosters
OVERALL SCORE: 88.8%
RECOMMENDATION: READY FOR PRODUCTION TESTING
═══════════════════════════════════════════════════════════════════════════════
---
Log in or sign up for Devpost to join the conversation.