Inspiration
We were inspired by the Talking Tom Cat game and wanted to add a human touch to a LLM by adding audio and visual elements into the interactions with users.
What it does
Connec.GPT receives a prompt via voice or text and will provide a reply both in text and audio.
How we built it
We used React.js as the frontend and babylon.js to render the humanoid. We made API calls to the OpenAI API to process the prompts and the GCP Voice API for the audio response.
Challenges we ran into
We faced difficulties in figuring out how to make valid OpenAI API calls. We tried to use the Assistant's API but could not figure out the documentation. We were also limited by the amount of time we had, so we switched to directly prompting the LLM.
We tried to fine-tune an open source model like Mistral using QLORA but were limited by time and VRAM.
We could not deploy the model onto vercel and we plan that with additional time, we will deploy this as a service.
Accomplishments that we're proud of
We managed to build something cool and fun.
What we learned
Working with new people from various disciplines with a different tech stack.
What's next for Connec.GPT
We were thinking of connecting people with similar interests or personalities together via the chatbot. This could be done through a profile matching algorithm to match people's personality summary generated backend,
Built With
- babylon.js
- gcp
- openai
- react.js
Log in or sign up for Devpost to join the conversation.