The Problem

Clinical trials

For a novel drug to gain approval and become available to patients, it has to undergo three phases of clinical trials. Each phase requires dozens to thousands of participants to determine the appropriate dosage, effectiveness, and safety of the drug. However, there is a significant need to improve communication between potential participants and researchers.

Participants' experiences

The protocols and study plans accessible to potential participants are often filled with technical jargon, making it difficult for patients to understand what to expect from the trial. This confusion can lead to participants unknowingly accepting risks or being discouraged from participating due to misunderstandings. A notable example is the BIA 10-2474 trial, which resulted in one participant being declared brain dead and four others suffering irreversible brain damage (Kaur, R., 2016). This tragic outcome highlights the gap between what patients understand and the risks they take.

Researchers' experiences

The current communication between potential participants and researchers is also not beneficial for researchers. This one-sided approach relies on patients to seek out information, often resulting in insufficient recruitment—approximately 80% of clinical trials close due to low enrollment (Clinical Trials Arena, 2022). Our team proposes ClinicalConsent as a solution to facilitate communication between potential participants and researchers, fostering a safer and more engaging interaction throughout the clinical trial process.

Our Solution

ClinicalConsent

By utilizing the GPT API, ClinicalConsent simplifies complex study plans and protocols into clear, easily understandable information for participants. This approach reduces barriers to engagement in clinical trials. Additionally, our platform includes a recommendation system that matches potential participants with suitable clinical trials based on their profiles. Ultimately, we aim to enhance the clinical trial process and improve safety outcomes.

How we built it

Our team retrieved and processed existing clinical trial datasets from the ClinicalTrials.gov APIgenerating clear and concise summaries for each trial using the OpenAI API to make protocol descriptions easily understandable for non-experts. We also developed a real-time chatbot with the OpenAI assistant API to support users with any questions or gaps in understanding. To streamline the application’s functionality, we utilized Firebase DB to store data on participants, researchers, and clinical trials, which we later incorporated into a filtering system to provide optimal matches between participants and trials. The entire application is hosted on Streamlit.

Challenges we ran into

Our main challenges came from our first-time use of Firebase DB and the ClinicalTrials.gov API. Setting up four separate databases in Firebase for participants, researchers, clinical trials, and studies was a big challenge we managed to get through. We also hit Firebase's rate limit when attempting to process 67,000 records, which led us to upgrade our account. Additionally, understanding the data structure of the ClinicalTrials.gov API was challenging and took extra time, which slowed down our data import process. The retrieved data had many missing values and was unclean, requiring us to clean it for the filter function to work properly in our matching recommendation system.

Accomplishments we are proud of

One of our key accomplishments is adding over 67,000 clinical studies to ClinicalConsent, with the capacity to include more. We also optimized and implemented an assistance API to generate clear, concise summaries for each clinical trial. This resulted in over 200,000 API requests processed throughout the project. To handle the high volume of requests, we utilized UNC’s Research Computing Cluster, LongLeaf, and implemented batch processing for efficiency. For a smoother user experience, we created a dummy page in Streamlit, allowing the app to transition seamlessly between pages.

What's Next for ClinicalConsent

Currently, the data we retrieve from ClinicalTrials.gov does not categorize participant eligibility into high, medium, and low; instead, it either matches participants to a study or excludes it from the participant view if they do not meet eligibility criteria. In the future, if we convert this data into a categorized format or if researchers upload trials through our platform, we can utilize them to provide more optimal matches due to the more granular options we offer. Furthermore, our system can be expanded to enable researchers to automate outreach to potential participants who are optimal matches taking the burden off of researchers recruiting human subjects for their work and allowing more time to be spent on research.

Built With

api
clinicaltrials.gov
css3
firebase
geopy
html5
json
jupyterlab
openai
pandas
python
streamlit

Submitted to

Duke AI Hackathon 2024

Created by

I'm Yeonseo Kim (or Eunice), a Master's student in Biomedical Engineering at Duke University. My primary interest is in drug research, and I am expanding my scope into computational drug development. This was my first experience working on an AI-related project, and I enjoyed every aspect of it as much as I learned!

In this project, I was responsible for conceptualizing the idea, prompt engineering, and data extraction. I designed the core features, including the trial summaries and a matching recommendation system. For prompt engineering, I extracted the dataset from ClinicalTrials.gov API and based on that, crafted specific prompts for the OpenAI API to generate the most optimal summaries of the protocols.

Yeonseo-Eunice Kim
My name is Claire Kim, and I’m a first-year Master’s student at Duke University, pursuing a degree in Biomedical Engineering with a focus on biomedical data science. I am passionate about enhancing patient outcomes through precision treatments and advancing healthcare from reactive to preventive care.

For this project, I was responsible for fetching clinical trial data using the ClinicalTrials.gov API, retrieving a total of 67,327 records, and storing them in the Firestore database. I leveraged this data to generate user-friendly keywords and summaries with the OpenAI API, ensuring they were accessible to a general audience and updating them in our existing database. Additionally, I developed a real-time chatbot that responds to user questions based on the clinical trial information displayed above the chatbox. I also refined the markdown settings for Streamlit to enhance user experience and presentation.

HyoJoo Kim
My name is Riley Harper, and I’m a senior at the University of North Carolina at Chapel Hill majoring in Statistics and Analytics and Applied Mathematics. In my free time, I enjoy reading, conducting research, and participating in AI@UNC. I lead a research team as Technical Chair for an AI-integrated awake craniotomy project in partnership with the UNC School of Medicine. My coursework includes Deep Learning, Machine Learning, Data Science, Data Structures and Algorithms, Linear Algebra, and Models of Language and Computation.

In this project, I developed the backend of our Streamlit application, enabling seamless page navigation and integrating user authentication with Firebase. This system manages a database of over 67,000 active clinical trials and AI-generated summaries. I implemented user profile-based filtering to connect individuals with relevant trials and designed a custom researcher interface for detailed participant criteria. Additionally, I utilized the LongLeaf UNC Research Computing Cluster to handle over 200,000 API requests, facilitating the retrieval of clinical studies and communication with the OpenAI Assistants API.

Riley Harper
My name is Michael Puglise, I’m a junior at Duke University pursuing a BS in Computer Science with a concentration in Artificial Intelligence. I love programming in Java, Python, and React and am proficient in topics like Data Structures and Algorithms, Computer Systems, Data Science, and Machine Learning.

For this project I was responsible for integrating Firebase’s Authentication and FireStore Database system. This helped new users create a profile in our landing page and store crucial profile and demographic information. Additionally, I implemented the POST operations for the participant/researcher profile creation forms and led the CSS styling for the web-app. I hope you all enjoy exploring our site!

Michael Puglise

Updates

Yeonseo-Eunice Kim started this project — Nov 03, 2024 01:08 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.