CareLens AI: From Messy Healthcare Data to Trusted Decisions
Team: Lakehouse Lifelines
Inspiration
Healthcare planning is only as good as the data behind it. Yet, in many regions, healthcare facility information is fragmented, inconsistent, and difficult to trust.
A facility may claim to offer ICU services, emergency care, or specialized procedures, but the supporting evidence is often buried in unstructured descriptions, incomplete records, or contradictory data fields.
This creates a critical challenge for healthcare planners, NGOs, and coordinators:
How do you distinguish between a true gap in care and a gap in data?
We built CareLens AI to answer that question.
Our goal was not to create another dashboard. We wanted to build a trust layer that helps decision-makers confidently identify healthcare access gaps while understanding the limitations of the underlying data.
What It Does
CareLens AI transforms 10,000 messy healthcare facility records into evidence-backed, uncertainty-aware insights.
The application enables users to:
- Evaluate whether facility claims are supported by evidence
- Identify high-risk medical deserts across regions
- Distinguish between underserved areas and data-poor areas
- Explore facility capabilities with source citations
- Review confidence scores and uncertainty explanations
- Save notes, overrides, and planning scenarios
Every recommendation includes:
- Supporting evidence snippets
- Source references
- Confidence scores
- Human review workflows
We believe users should never have to trust AI blindly.
How We Built It
CareLens AI was built entirely on the Databricks platform.
Data Engineering
- Ingested and profiled the healthcare facility dataset
- Standardized structured fields including location, specialties, and facility metadata
- Cleaned noisy and repetitive text fields
- Created a unified evidence model across descriptions, procedures, equipment, and source URLs
AI Pipeline
We treated all extracted fields as claims rather than ground truth.
Using LLM-powered extraction and classification workflows, we:
- Extracted facility capabilities from unstructured text
- Mapped evidence to standardized healthcare concepts
- Generated trust signals for each capability
- Calculated confidence scores based on evidence quality, consistency, and completeness
- Identified contradictory or sparse records requiring human review
User Experience
The Databricks App provides a simple workflow for non-technical users:
- Explore healthcare coverage on an interactive map
- Filter by geography and care need
- Review trust-weighted facility recommendations
- Inspect supporting evidence and uncertainty indicators
- Save notes and planning decisions
Databricks Technologies Used
- Databricks Apps
- Delta Lake
- Unity Catalog
- Mosaic AI
- Vector Search
- Databricks SQL
- MLflow
- Lakehouse Monitoring
Challenges We Faced
Messy Data Is Harder Than Missing Data
The dataset was not simply incomplete—it was inconsistent.
The same capability could appear across multiple fields with varying terminology, conflicting descriptions, and uneven evidence quality.
Trust Is More Important Than Accuracy
Early prototypes produced confident answers that lacked sufficient evidence.
We realized that planners need transparency more than precision.
Instead of optimizing only for prediction quality, we prioritized:
- Explainability
- Evidence traceability
- Honest uncertainty
Designing for Non-Technical Users
Healthcare planners do not want to inspect model outputs or prompts.
We focused on creating a workflow that answers practical questions:
- Where are the care gaps?
- How certain are we?
- What evidence supports this conclusion?
What We Learned
Building CareLens AI reinforced an important lesson:
AI should not replace human judgment—it should strengthen it.
The most valuable feature was not the model itself, but the ability to explain recommendations, quantify uncertainty, and keep humans in control.
We learned that trustworthy AI requires more than predictions.
It requires evidence.
What's Next
Future enhancements include:
- Population-adjusted healthcare access scoring
- Travel-time analysis and routing
- Continuous learning from user feedback
- Integration with public health datasets
- Multilingual support for regional users
Conclusion
CareLens AI helps healthcare planners make better decisions by transforming messy healthcare data into evidence-backed insights.
Because when resources are limited and lives are affected, decisions should be driven by trust—not assumptions.
Built With
- ai/bidashboard
- databricks
- genie
- lakebase
- lakehouse
- lakehouseapp
- mlflow
- pyspark
- unitycatalog
Log in or sign up for Devpost to join the conversation.