Hospital Vital Indicators Dashboard for NGOs
Team: Los Mesoneros
Track #2: NGOs
Background
Our objective was to produce a dashboard that enables NGOs to easily look up the status and needs of hospitals in Venezuela. This helps NGOs decide what hospitals to target, and what their greatest needs are. The dashboard gives a high-level overview of hospital status across a few key categories: medical supplies, operability, water, power, and nutritional availability.
To accomplish this, we created a Python data cleaning script, and a Tableau dashboard. The Python script cleans and prepares the DFH survey data, and outputs a CSV file for use in Tableau. Also, we created lookup table file with all the information about the hospitals, by code and state.
All code and data files can be found at this GitHub repository: https://github.com/yung-ethan/c4v
The Tableau dashboard file is Hospital Vital Indicators.twb.
Data Quality Issues
As the initial analysis on the dataset showed, C4V can improve the quality of its information, by doing some changes in the google form.
Question #4: "Ente Administrativo" There is only five options allowed multiple selections. The suggestion is to have all the possible regions as fields, as they already exists in Question 2.
Question #48: "Power Outage Number by Day" In the survey, manual input for number. The suggestion is to have a single selection field rather than a multiple selection field Allowed answers: Yes | No | No, Yes
Question #49: "Power Outage Number by Day" In the survey, manual input for number. The suggestion is to have a drop drown list rather than a typed field
Question #53: "Equipment Names" In the survey, manual input for equipments' names. The suggestion is to have a drop drown list rather than a typed field
Data cleaning script
Run data cleaning by executing the Python script:
python clean_survey_data.py
This outputs the file "cleaned_survey_dataset.csv".
Cleaning operations:
- Created an extra string column with the report week, formatted as "[Year]-[Week of Year]". This allows for sorting the data in report week order.
- De-duplicated the reports by only using the latest submitted report by hospital and week.
- Created a corresponding numeric (ordinal) column for each of the medicine supply fields. Also calculate an aggregated rating of medical supplies.
- Similarly, created a corresponding numeric column for each of the operability fields.
Log in or sign up for Devpost to join the conversation.