Inspiration
The inspiration for DeepStudy came from the challenges researchers and students face when going through vast amounts of literature. Sifting through research papers to find relevant information can be time-consuming, and sometimes the information is buried deep within the text. I wanted to create an AI-driven tool that simplifies this process by enabling users to upload research papers, ask specific questions, and receive insightful, contextually relevant answers—mimicking the experience of having a knowledgeable mentor at your side.
What it does
DeepStudy allows users to upload PDF documents, particularly research papers, extract the text, and query the content in a conversational manner. The tool utilizes state-of-the-art language models from Hugging Face, integrated with FAISS for efficient vector-based search, enabling users to receive accurate, context-aware answers to their queries. It’s a step toward making research more interactive and accessible, enabling users to explore literature at a deeper level by asking precise questions.
How we built it
We built DeepStudy using a combination of modern technologies:
I used Streamlit as the front-end framework to enable easy PDF uploads and interaction, while FAISS handles efficient vector storage and similarity search for quick retrieval of relevant text from large documents. The application leverages Hugging Face Transformers to implement advanced NLP models that power query answering, with pre-trained models ensuring accurate understanding of research language and structure. OpenAI Embeddings are used to convert text into embeddings, facilitating semantic search and contextual comprehension of user queries. LangChain orchestrates the interaction between language models and the query system, ensuring smooth conversational exchanges, while PyMuPDF extracts text from uploaded PDFs for further analysis.
Challenges we ran into
Text extraction quality: PDF documents often have complex formatting, and ensuring clean and reliable text extraction from different formats was a major hurdle. Embedding large documents: Handling large research papers required efficient embedding and storage mechanisms, which we addressed using FAISS, but balancing accuracy with performance was a challenge.
Accomplishments that we're proud of
Successfully integrated a system that allows users to explore research papers interactively, bridging the gap between static documents and dynamic information retrieval.
What we learned
I gained a deep understanding of the challenges involved in extracting clean and usable text from complex PDF documents. Enhanced my skills in vector storage and similarity search, particularly in applying FAISS to large datasets. Learned how to fine-tune pre-trained models from Hugging Face for more specialized tasks, improving the overall performance of the system. This project furthered my knowledge of integrating different components (NLP, vector search, and text extraction) into a seamless user experience.
What's next for DeepStudy
Mobile support: We are planning to expand DeepStudy to support mobile devices, making it more accessible to users on the go.
Built With
- langchain
- llms
- python
Log in or sign up for Devpost to join the conversation.