Inspiration
Alzheimer's disease affects over 55 million people worldwide, with early diagnosis being critical for treatment efficacy. Current diagnostic methods are time-consuming, expensive, and often require specialized clinical infrastructure. We were inspired to develop an accessible, non-invasive AI-powered solution that leverages multimodal biomarkers (speech patterns and EEG signals) to detect cognitive decline with high accuracy, democratizing early screening across diverse populations.
What it does
Our solution combines Deep Learning with multimodal data fusion to detect Alzheimer's disease and predict disease progression:
- Speech Analysis Module: Extracts acoustic and linguistic features from speech samples
- EEG Processing Pipeline: Analyzes brainwave patterns across 5 frequency bands ($\Delta$, $\theta$, $\alpha$, $\beta$, $\gamma$)
- Attention-Based Fusion: Learns optimal feature weighting across modalities
- Clinical Risk Scoring: Generates interpretable biomarker profiles for clinicians
Achieved 97% accuracy in distinguishing healthy controls from cognitive impairment cases across both modalities.
How we built it
Architecture Overview
We implemented an ensemble approach combining three specialized models:
1. Speech Processing Module
# Feature extraction pipeline
- Mel-Frequency Cepstral Coefficients (MFCC)
- Prosody features (pitch, energy, duration)
- Voice quality metrics (jitter, shimmer)
2. EEG Classification Model
- Input: 5-second EEG segments across 6 channels
- Architecture: Temporal Convolutional Networks (TCN) + Attention
- Output: Probability scores per frequency band
3. Multimodal Fusion Layer
- Cross-modal attention mechanism to learn feature importance
- Late fusion strategy for robust predictions
- Interpretability via SHAP values and attention heatmaps
Technology Stack
- Core: Python 3.10, TensorFlow 2.13, PyTorch
- Data: ADReSSo dataset (speech), local EEG recordings
- Deployment: Google Cloud Run, Docker containerization
- Interpretability: SHAP, Integrated Gradients, Attention visualization
Challenges we ran into
Data Imbalance: Significant class imbalance (healthy >> cognitive impairment)
- Solution: Implemented stratified k-fold CV + weighted loss functions
Modality Heterogeneity: Combining sequential (speech) and signal (EEG) data
- Solution: Designed separate feature extraction pipelines + late fusion
Temporal Alignment: EEG segments of varying lengths
- Solution: Applied zero-padding + attention masking mechanisms
Clinical Validation: Ensuring model outputs align with clinical protocols
- Solution: Consulted literature & implemented SHAP explanations for clinician transparency
Computational Efficiency: Real-time inference requirements
- Solution: Knowledge distillation to lightweight MobileNetV2 backbone
Accomplishments that we're proud of
✅ 97% Classification Accuracy - Outperforming baseline methods by 15%
✅ Multimodal Integration - Successfully fused heterogeneous biomarkers
✅ Explainable AI - Generated SHAP explanations & attention heatmaps for clinical interpretation
✅ Production-Ready Code - Full CI/CD pipeline, Docker containerization, and deployment on Cloud Run
✅ Comprehensive Documentation - 260+ line README with scientific references, quick-start guides, and clinical impact framework
✅ Open-Source Contribution - Peer-reviewed methodologies with reproducible code
What we learned
- Multimodal Learning: How to effectively combine diverse data modalities while maintaining interpretability
- Clinical AI: The importance of explainability and validation in healthcare applications
- Production ML: Scaling from notebooks to production systems with proper monitoring and reproducibility
- Domain Adaptation: Transferring pre-trained models across different populations and recording conditions
- Team Collaboration: Cross-functional integration between ML engineers and domain experts
What's next for Hack4Health_Alzheimers_Detection
🔄 Real-Time Monitoring Dashboard - Web interface for continuous patient tracking with alert systems
🧠 Longitudinal Analysis - Track disease progression over months/years with predictive modeling
🌍 Multilingual Support - Expand speech processing to non-English languages (Portuguese, Spanish, French)
📱 Mobile Application - Deploy lightweight inference engine on edge devices (iOS/Android)
🏥 Clinical Trial Integration - Partnership with healthcare providers for real-world validation
Repository: github.com/PauloTuppy/Hack4Health_Alzheimers_Detection
Try it now: Google Colab Demo
Early detection saves lives. Together, we can democratize cognitive health screening. 🧬💙
Log in or sign up for Devpost to join the conversation.