Reinforcement Learning System

Reinforcement learns from user feedback and outcome data:

core/reinforcement.py - Core RL System
- ReinforcementScorer: Calculates reward from YouTube metrics
- DecisionRewardTracker: Tracks decisions + outcomes
- UserFeedbackLearner: Learns from user corrections
- AdaptiveDecisionMaker: Makes decisions based on learned patterns
- LearningAgent: Orchestrates the learning loop
core/integrated_decision_engine.py - Updated Decision Engine
- Integrates Gemini reasoning with learned patterns
- Records decisions for learning
- Improves decisions based on historical performance
- Respects user overrides first
scripts/update_learning.py - Feedback Collection
- Fetches YouTube performance data
- Records user feedback & corrections
- Analyzes learning progress
- Usage: python scripts/update_learning.py --fetch-youtube
scripts/learning_dashboard.py - Visualization
- Interactive dashboard showing:
  - Decision quality trends
  - User feedback patterns
  - Learned preferences
  - Performance improvements
- Multiple views: main, history, feedback, trends

Result: Agent gets smarter with every video, learning your actual preferences without retraining models!

Log in or sign up for Devpost to join the conversation.