Inspiration

Alzheimer's disease affects over 55 million people worldwide, with early diagnosis being critical for treatment efficacy. Current diagnostic methods are time-consuming, expensive, and often require specialized clinical infrastructure. We were inspired to develop an accessible, non-invasive AI-powered solution that leverages multimodal biomarkers (speech patterns and EEG signals) to detect cognitive decline with high accuracy, democratizing early screening across diverse populations.

What it does

Our solution combines Deep Learning with multimodal data fusion to detect Alzheimer's disease and predict disease progression:

  • Speech Analysis Module: Extracts acoustic and linguistic features from speech samples
  • EEG Processing Pipeline: Analyzes brainwave patterns across 5 frequency bands ($\Delta$, $\theta$, $\alpha$, $\beta$, $\gamma$)
  • Attention-Based Fusion: Learns optimal feature weighting across modalities
  • Clinical Risk Scoring: Generates interpretable biomarker profiles for clinicians

Achieved 97% accuracy in distinguishing healthy controls from cognitive impairment cases across both modalities.

How we built it

Architecture Overview

We implemented an ensemble approach combining three specialized models:

1. Speech Processing Module

# Feature extraction pipeline
- Mel-Frequency Cepstral Coefficients (MFCC)
- Prosody features (pitch, energy, duration)
- Voice quality metrics (jitter, shimmer)

2. EEG Classification Model

  • Input: 5-second EEG segments across 6 channels
  • Architecture: Temporal Convolutional Networks (TCN) + Attention
  • Output: Probability scores per frequency band

3. Multimodal Fusion Layer

  • Cross-modal attention mechanism to learn feature importance
  • Late fusion strategy for robust predictions
  • Interpretability via SHAP values and attention heatmaps

Technology Stack

  • Core: Python 3.10, TensorFlow 2.13, PyTorch
  • Data: ADReSSo dataset (speech), local EEG recordings
  • Deployment: Google Cloud Run, Docker containerization
  • Interpretability: SHAP, Integrated Gradients, Attention visualization

Challenges we ran into

  1. Data Imbalance: Significant class imbalance (healthy >> cognitive impairment)

    • Solution: Implemented stratified k-fold CV + weighted loss functions
  2. Modality Heterogeneity: Combining sequential (speech) and signal (EEG) data

    • Solution: Designed separate feature extraction pipelines + late fusion
  3. Temporal Alignment: EEG segments of varying lengths

    • Solution: Applied zero-padding + attention masking mechanisms
  4. Clinical Validation: Ensuring model outputs align with clinical protocols

    • Solution: Consulted literature & implemented SHAP explanations for clinician transparency
  5. Computational Efficiency: Real-time inference requirements

    • Solution: Knowledge distillation to lightweight MobileNetV2 backbone

Accomplishments that we're proud of

97% Classification Accuracy - Outperforming baseline methods by 15%

Multimodal Integration - Successfully fused heterogeneous biomarkers

Explainable AI - Generated SHAP explanations & attention heatmaps for clinical interpretation

Production-Ready Code - Full CI/CD pipeline, Docker containerization, and deployment on Cloud Run

Comprehensive Documentation - 260+ line README with scientific references, quick-start guides, and clinical impact framework

Open-Source Contribution - Peer-reviewed methodologies with reproducible code

What we learned

  • Multimodal Learning: How to effectively combine diverse data modalities while maintaining interpretability
  • Clinical AI: The importance of explainability and validation in healthcare applications
  • Production ML: Scaling from notebooks to production systems with proper monitoring and reproducibility
  • Domain Adaptation: Transferring pre-trained models across different populations and recording conditions
  • Team Collaboration: Cross-functional integration between ML engineers and domain experts

What's next for Hack4Health_Alzheimers_Detection

🔄 Real-Time Monitoring Dashboard - Web interface for continuous patient tracking with alert systems

🧠 Longitudinal Analysis - Track disease progression over months/years with predictive modeling

🌍 Multilingual Support - Expand speech processing to non-English languages (Portuguese, Spanish, French)

📱 Mobile Application - Deploy lightweight inference engine on edge devices (iOS/Android)

🏥 Clinical Trial Integration - Partnership with healthcare providers for real-world validation


Repository: github.com/PauloTuppy/Hack4Health_Alzheimers_Detection

Try it now: Google Colab Demo

Early detection saves lives. Together, we can democratize cognitive health screening. 🧬💙

Built With

Share this project:

Updates