Inspiration
The inspiration for our project stemmed from the growing interest and utilization of large language models (LLMs) like GPT-3. We noticed that while these models are incredibly powerful, they sometimes struggle with coherence and staying on topic, especially when interacting with user-generated content. This led us to brainstorm a solution that could harness the knowledge and material provided by users to improve the LLM's abilities in a controlled manner.
What it does
Our project is a web platform designed to enable users, particularly in educational settings such as schools and universities, to contribute their own material and knowledge to enhance the capabilities of large language models. With a safe mode enabled, the chatbot utilizes the information it receives as context, preventing it from generating hallucinated or off-topic responses. Additionally, we developed a method to augment the LLM's knowledge on specific topics, even if it hasn't encountered that data during its training. The core logic of our system revolves around two components: the llama-indexer, which transforms each sentence of the context into a vector, and a vectorial database that efficiently stores and retrieves this data based on correlations in the vector space.
How we built it
We built our platform using a combination of programming languages and frameworks. The web interface was constructed using TypeScript with the React framework, while the backend is built in Python.
Challenges we ran into
One of the primary challenges we encountered was navigating a completely new set of technologies. As a team, we had to familiarize ourselves with various programming languages, frameworks, and libraries to effectively build our platform. Learning the intricacies of this stack while simultaneously orchestrating the components to achieve our goal posed a significant hurdle.
Accomplishments that we're proud of
We're proud to have developed a functional platform that not only enhances the capabilities of large language models but also empowers users to contribute to their improvement. The integration of the llama-indexer and vectorial database provides a robust foundation for efficiently handling and utilizing user-generated content, paving the way for more effective interactions between humans and AI.
What we learned
Through this project, we gained valuable insights into the challenges and opportunities associated with leveraging user-generated content to enhance AI capabilities. We learned the importance of balancing safety and creativity in AI interactions, as well as the significance of efficient data indexing and retrieval in large-scale applications. Additionally, we deepened our understanding of natural language processing techniques and vector space modeling, refining our skills in these areas through practical implementation.
What's next for !AGW
As students enrolled in the Master of Computer Science and Engineering program with a specialization in Artificial Intelligence, our journey doesn't end here; in fact, it's just beginning. This event has ignited our enthusiasm for AI and its potential to transform various domains, from education to healthcare and beyond. We are more enthusiastic than ever to continue learning and exploring new advancements in AI technology. Our next steps involve diving deeper into the intricacies of natural language processing, machine learning, and other AI-related disciplines. We plan to leverage our experiences from this project to further our understanding and contribute meaningfully to the field.
Built With
- chatgpt
- flask
- iris
- python
- react
- typescript
- vector
Log in or sign up for Devpost to join the conversation.