Aper.io

Logo
Rapid Drop, then Slow Improvement
Landing Page
GIF
Drag and Drop

Inspiration

Quality enhancement is a field that seems to have stagnated. Static mathematical methods have been assumed to be the best when, in this age of computing, static formulas may no longer be king.

What it does

Aper.io is a Machine Learning Web App that can takes in low quality web apps and makes the comprehensible. The application of deep learning within the field of quality enhancement has been largely unexplored, especially in the realm of video.

Given any low quality video, Aper.io utilizes a series of Deep Convolution Neural Network Style Transfer Models which, given a series of frames, is able to predict smooth interpolation images. Due to the fact that this application is a largely unexplored space, we trained our predictive style transfer models from scratch. We did our prototyping on our own local machines, and once we had a solid foundation, we moved all our training onto Google Collaborate, which was key in meeting the deadline.

See Our Model In Action

How we built it

We were a little lost as to where to start on this process, so we just decided to see if there were any research papers on the topic. While there were no directly applicable papers, we did find a two particular papers whose ideas we found interesting and used as a starting point for our models and data processing.

From there we split up duties: some of us focused more on updating and debugging the models, and figuring the right architecture ( which involved a great deal of CNN tracing and implementing custom modules in Keras). Others focused on the backend of correctly combining the model output and the original video into a new higher resolution video and then smoothing out the abrupt insertions. While at the same time, others of us focused on developing a UI/UX that would get people actually use this technology.

Challenges we ran into

There were a lot so I'll bullet point

Crazy Merge Conflicts
Dependency Clashing
The Tensorflow Graph is Broken ... Maybe it's actually me :/
Slow Wifi

Accomplishments that we're proud of

We were able to bring it all together. There were a ton of moving parts in this model, from the preproccessing, to training deep predictive models, to sharpening video quality, to developing a slick UI and flask backend, we were just in a grind to get everything done and working up to standards.

What we learned

A lot!

What's next for Aper.io

We are planning on running this on a much larger dataset that we can scrape from the web, try to get the algorithm to generalize much better, and publish a paper potentially detailing a rigorous evaluation of our various methods and outcomes so that other DL practitioners can build off our work, just as we built of others.

Built With

flask
opencv
python
tensorflow
veu.js

Submitted to

LA Hacks 2018
- Winner Make Something People Want Grant: 1517
- Winner LA Hacks Second Place

Created by

I worked on the backend using Flask, pre-processed data, created all the graphics, and worked on the model. I also created the front end, which consisted of a landing page, a visual explanation of our algorithm, and a video upload interface.

Markie Wagner
Developed Deep CNN frame-prediction model, and a precise data pre-processing method, and worked on integrating the model outputs to fit smoothly with the original video

Kian Ghodoussi
Worked on breaking videos down into frames, running them through our deep CNN model, inserting new frames, and putting them back together in an efficient way. Also worked on our Flask backend.

Nihar Sheth
Worked on and tested frame prediction Deep CNN model and frame resolution algorithm

Zane Durante
Undergraduate Machine Learning Researcher at USC