Inspiration
The inspiration came from a harsh reality: pharmaceutical signals are detected too late. Safety issues, regulatory changes, and competitive moves appear gradually across fragmented sources: news, journals, regulatory filings. By the time human teams notice the pattern, patients are already affected, recalls are issued, or opportunities are lost. The problem isn't lack of information: it's information overload. Pharmacovigilance teams review thousands of documents weekly but miss early-stage patterns. We asked: What if AI could detect when patterns change, not just what the data says? That question became Wardian.
What It Does
Wardian is an always-on intelligence agent that monitors pharmaceutical data and detects abnormal changes before they escalate. How it works:
- Auto-classifies 1,000+ documents daily into 15+ pharma event types (adverse events, regulatory changes, clinical trials, competitor moves)
- Detects anomalies using statistical models: flags spikes 72 hours before traditional methods
- Generates alerts with 90% noise reduction: only surfaces top 5% critical insights
- Explains context using AI: tells you why it matters and what to do
Real scenario: 15 adverse events spike in 48 hours -> Wardian alerts team -> Investigation starts -> Regulatory filing submitted 5 days before media coverage -> Crisis prevented. It answers: "What's changing right now, and should we act?"
How We Built It
Architecture: Six-component AI agent system
- Data Ingestion: Automated scrapers pull from FDA, news feeds, clinical trial registries, medical journals
- NLP Classification: Fine-tuned BERT model categorizes text into pharma events (85%+ accuracy, <500 training samples)
- Event Storage: MongoDB with time-stamped events for temporal analysis
- Anomaly Detection: Statistical engine (z-scores, moving averages) flags 2.5x+ baseline spikes
- LLM Insights: GPT-4/Claude generates natural language explanations and action items
- MERN Dashboard: React frontend with real-time alerts, Node.js API, multi-channel notifications
Tech Stack: Python, MongoDB, React, Node.js, AWS, HuggingFace Transformers, FastAPI Key innovation: Combining NLP classification with time-series anomaly detection: most tools do one or the other, not both.
Challenges We Ran Into
- False Positive Overload Early versions flagged everything. Solution: Multi-source validation (alerts trigger only when 3+ sources show same signal) + adaptive thresholds = 90% noise reduction.
- Limited Training Data Pharma data is proprietary. Solution: Active learning with <500 high-quality samples + data augmentation achieved 85% accuracy.
- Explainability Gap Users asked "Why did this trigger?" Solution: Integrated LLMs to generate contextual explanations: "15 events, 2.5x baseline, correlated with trial results": built immediate trust.
- Scale & Speed Processing 1,000+ docs daily in real-time was tough. Solution: Queue-based async processing with Redis, horizontally scalable architecture.
- Defining "Abnormal" What's a meaningful spike? Solution: Configurable thresholds per customer/event type, refined through user feedback loops.
Accomplishments That We're Proud Of
- Validated real demand through 15+ pharma industry interviews
- Built a true AI agent: autonomous monitoring, not manual queries
- Achieved 72-hour early detection in pilot tests vs. 7-day industry standard
- 90% noise reduction while capturing 100% of critical signals
- Proved business viability: clear B2B SaaS model with $500K Year 1 ARR target Most proud of: Showing how it detected a real adverse event spike days before regulatory action. That's when we knew this could actually save lives.
What We Learned
Technical:
- Statistical models (moving averages, z-scores) often outperform complex ML for anomaly detection
- Explainability builds trust faster than raw accuracy in high-stakes domains
- LLMs are perfect for contextual intelligence layers, not just chat
Product:
- Start with workflow, not models: users care about reducing time to action, not ML metrics
- One false positive erodes more trust than ten good alerts build: be conservative
- 3-month pilots convert better than presentations: show, don't tell
Business:
- Pharma moves slowly but retention is sticky once integrated (compliance creates moats)
- Same detection engine solves multiple use cases: safety, trials, regulatory, competitive intelligence
- Market is bigger than expected: $8B pharma intelligence sector growing 15% annually
What's Next for Wardian
Near-Term (3-6 months):
- Launch with 5 paying pilots ($5K/pilot -> $50K annual)
- Multi-channel alerts (email, Slack, Teams, SMS)
- Scale to 500+ data sources with global regulatory coverage
Medium-Term (6-12 months):
- Multi-language support for EU/APAC markets
- API integrations with pharmacovigilance systems (Oracle Argus, ArisGlobal)
- Predictive analytics: "DrugX shows patterns similar to past recalls"
Long-Term Vision:
- Expand to medical devices, healthcare policy, ESG compliance
- Self-learning system with reinforcement learning from analyst feedback
- Become the industry standard for pharmaceutical early-warning intelligence
The Mission:
Pharmaceutical intelligence is reactive today: companies wait for crises, then respond. Wardian makes it proactive. The next safety crisis is preventable. The next regulatory change is predictable. The next competitive threat is detectable. We exist to make sure no signal goes unnoticed.
Built With
- express.js
- flask-fast-api
- machine-learning
- mongodb
- node.js
- python
- react
- tailwind
Log in or sign up for Devpost to join the conversation.