Online Class? No Confusion!

Inspiration

Facing with hundreds of online attendees on a single computer screen, can professors find an elegant way to test students' understanding of the course materials? Some students are just too shy to ask questions when they are confused. Or sometimes the professor is going too fast that students, though confused, don't have time to ask questions. So we are inspired to make a software to help professors quickly find out how many students are confused so that they can slow down or explain more at some points.

What it does

Our program is a python program running on the backend. When the professor is doing zoom lecture with the students, the program takes a screenshot of the zoom's video gallery. Then the program will split the whole picture into individual pictures and send them to the Azure server. The model we have trained will classify the individual pictures as "confused" or "unconfused". Then the confused/all ratio is calculated. If the ratio is higher than 50%, an alert is sent to the professor indicating that the students may no understand the materials very well.

How I built it

We created our own dataset of confused / not-confused faces to train the model on Custom Vision. The program runs on python and takes screenshot of the meeting on regular intervals, process it to retrieve the individual faces and then determines whether or not each student is confused or not.

Challenges I ran into

At first, we wanted to use the pre-trained Face api under Azure Cognitive Services to detect students' facial expressions. However, we found that this model only includes emotions like "happiness" and "sadness", but without "confusion". So, we decided to train our own model with our own pictures using the Azure Custom Vision. And it turned out to make a pretty good prediction.

Accomplishments that I'm proud of / What I learned

We have learnt the powerful Azure Cognitive Services and how to integrate them with our own python program.

What's next for Online Class? No Confusion!

After the hackathon, we want to train a model with a larger database that contains more pictures labeled as confused. In this way, we can predict the "confusion" expression with higher precision. We also want to try to integrate APIs from Zoom and other online chatting softwares so that we can directly get the video stream instead of taking screenshot. We may even collaborate with Zoom, lnc. to integrate the confusion detection feature into Zoom for educational purpose.

Built With

Submitted to

MAIS Hacks 2020

Created by

I worked on training the Model on Azure. Tried making the appropriate API call to get the information.

Tom SARRY
Computer Science student at McGill University
Kevin Xu
Yu-Cheng (Kevin) Cho
Currently, a Ph.D. student at McGill. Demonstrated skills in Python programming language and the development of machine learning models.

Updates

Kevin Xu started this project — Oct 03, 2020 10:03 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.