Tutor AI is a multimodal LAM that can be used for academic work, helping students when needed.

We built it on Android Studio, using Java. It is multimodal, so you can use text and images as inputs for your prompt. One thing we are proud of with our project is the fact that it is multimodal, and that is because we have put a lot of work into perfecting the system. Some challenges we faced were involved in the input. For our multimodal system, we needed a way to open a menu to select an image.

Something that we have learned while making this is that we cannot expect our project to be 100% flawless. This is our first time in Java, so it took us a long time to figure out the syntax.

Here are a couple of ideas we had to improve on in the future:

  • Port Tutor AI to iOS
  • Give a UI refresh
  • Allow voice recordings to be used as input
Share this project:

Updates