Our training and deployment pipeline
Our training and deployment pipeline
Finding similar questions in dimensionally-reduced latent space
Question answering with GPT-2
Logo2
Retrieval Model
Retrieval Part 1
Retrieval Part 2
Retrieval Part 3
Retrieval Part 4
GIF
t-SNE gif

Quality medical information is valuable to everyone, but it's not always readily available. Doc Product aims to fix that.

Whether you've hit your head and are unsure if you need to see a doctor, caught a bad bug halfway up the Himalayas with no idea how to treat it, or made a pact with the ancient spaghetti gods to never accept healthcare from human doctors, Doc Product has you covered with up-to-date information and unique AI-generated advice to address your medical concerns.

We wanted to use TensorFlow 2.0 to explore how well state-of-the-art natural language processing models like BERT and GPT-2 could respond to medical questions by retrieving and conditioning on relevant medical data, and this is the result.

How we built Doc Product

As a group of friends with diverse backgrounds ranging from broke undergrads to data scientists to top-tier NLP researchers, we drew inspiration for our design from various different areas of machine learning. By combining the power of transformer architectures, latent vector search, negative sampling, and generative pre-training within TensorFlow 2.0's flexible deep learning framework, we were able to come up with a novel solution to a difficult problem that at first seemed like a herculean task.

700,000 medical questions and answers scraped from Reddit, HealthTap, WebMD, and several other sites
Fine-tuned TF 2.0 BERT with pre-trained BioBERT weights for extracting representations from text
Fine-tuned TF 2.0 GPT-2 with OpenAI's GPT-2-117M parameters for generating answers to new questions
Network heads for mapping question and answer embeddings to metric space, made with a Keras.Model feedforward network
Over a terabyte of TFRECORDS, CSV, and CKPT data

If you're interested in the whole story of how we built Doc Product and the details of our architecture, take a look at our GitHub README!

Challenges

Our project was wrought with too many challenges to count, from compressing astronomically large datasets, to re-implementing the entirety of BERT in TensorFlow 2.0, to running GPT-2 with 117 million parameters in Colaboratory, to rushing to get the last parts of our project ready with a few hours left until the submission deadline. Oddly enough, the biggest challenges were often when we had disagreements about the direction that the project should be headed. However, although we'd disagree about what the best course of action was, in the end we all had the same end goal of building something meaningful and potentially valuable for a lot of people. That being said, we would always eventually be able to sit down and come to an agreement and, with each other's support and late-night pep talks over Google Hangouts, rise to the challenges and overcome them together.

What's next?

Although Doc Product isn't ready for widespread commercial use, its surprisingly good performance shows that advancements in general language models like BERT and GPT-2 have made previously intractable problems like medical information processing accessible to deep NLP-based approaches. Thus, we hope that our work serves to inspire others to tackle these problems and explore the newly-open NLP frontier themselves.

Nevertheless, we still plan to continue work on Doc Product, specifically expanding it to take advantage of the 345M, 762M, and 1.5B parameter versions of GPT-2 as OpenAI releases them as part of their staged release program. We also intend to continue training the model, since we still have quite a bit more data to go through.

Colaboratory demos

Take a look at our Colab demos! We plan on adding more demos as we go, allowing users to explore more of the functionalities of Doc Product. All new demos will be added to the same Google Drive folder.

DISCLAIMER

The recommendations of an open-source AI application is not a substitute for professional medical care. If your condition is worsening, please go to your primary care provider. If you are having an emergency, please go to the nearest hospital or call your country's emergency number.

The purpose of this project is to explore the capabilities of deep learning language models. Although the application will help find publicly available medical advice, you are following it AT YOUR OWN RISK.

DoctorBert

Thanks!

We give our thanks to the TensorFlow team for providing the #PoweredByTF2.0 Challenge as a platform through which we could share our work with others, and a special thanks to Dr. Llion Jones, whose insights and guidance had an important impact on the direction of our project.

Built With

colaboratory
keras
python
tensorflow-2.0

Submitted to

⚡#PoweredByTF 2.0 Challenge!
- Winner Present your project to the TensorFlow engineering team!
- Winner Share your project with the TensorFlow Community!

Created by

Ideated and developed the crazy retrieval algorithm using vector search to augment language models with external knowledge. I focused on generative NLP and knowledge retrieval, including fine-tuning the GPT-2 model as well as designing/developing the retrieval pipeline from generating embeddings for similarity search to conditioning on retrieved sequences for generation. I was also in charge of the project presentation, including the DevPost page and GitHub README.

Alex Sheng
Background research/development/design of architecture; capabilities and limitations. Recruited members. Coordinated tasks. Proofread code and flow of tensors among different code contributions. Hyperparameter optimization of the FFNN section of the architecture. Train FFNN version (frozen BERT weights) of architecture. Data curation/cleaning/wrangling. Code packaging.

Santosh Gupta
I love Machine Learning / NLP , especially for analyzing scientific texts
Build data input pipeline from CSV to tf records to tf tensor. Build FFNN embedding model and BertFFNN embedding model. Integrate FAISS similarity searching code into pipeline. Build inference pipeline. Optimize code performance. Train BertFFNN and GPT2.

Jay Yip
WebMD and Healthtap data gathering. TSNE dimensionality reduction, Plotly interactive visualization, FAISS similarity search, Demo construction

Ryan Chesler
Iordanis Fostiropoulos
Sanny Lu