ByteSize

Website Landing Page
GIF
The original video
GIF
The compressed, asciified video
GIF
The uncompressed grayscale video
GIF
The uncompressed video with color inference

Inspiration

We understand that storage space is a huge concern in the world. For the approximately four billion people with low-bandwidth connections, it is difficult to send large video files in a small amount of time. We wanted to provide a novel, yet efficient solution for compressing videos.

What it does

Submit a video file
Converts it to ASCII text using Computer Vision with OpenCV
You can view the compressed ASCII file as a video
Submit a ByteSize’d text file
Converts it back to mp4 video using Machine Learning with PyTorch
Can re-add color to images (from absolutely zero color data in the text)

In the end, our compression algorithm halves the size of an equivalent .MP4 file.

How we built it

Front-end
- React
- Next.js
- TailwindCSS
Back-end
- Python
- Flask for API
- AWS S3
Machine Learning
- Python
- Torch
- OpenCV
- NumPy

Challenges we ran into

Each of us have fairly distinct Computer Science skill sets which made collaboration fun but challenging at times. Installing python libraries and figuring out how to get Torch and CUDA working was a pain. Connecting the front-end, back-end, and machine learning models together was especially challenging.

ML in our model

For our ML application, we used a wide variety of sources to achieve the many features in our project. FIrst, we experimented with bicubic interpolation, after generating a pixel file based on ascii encoded brightness in black and white. This allowed a slightly blurry, but less pixelated file to be created. From there we utilized a pretrained SRGAN model in order to create a super resolution version of our model, which is the basis of our upscaling and interpolation that allows us to create quality images from mere ascii text. The SRGAN model utilizes a CNN neural network with a custom fit 8 conv2d layers we modified for the task. Moreover, we utilized a color interpolation software that allows this super resolution image to become colored for minimal time. At 1 frame per second, it has room for improvement, but for a 24 hackathon we say a job well done! Moreover, we also have a functionality that can double the framerate of the video for a mere 25% increase in space.

Accomplishments that we're proud of

Very proud of everyone for stepping outside their comfort zones. The collaboration went extremely well and we were honestly surprised we were able to get as complete a final product as we did. The machine learning model we trained ended up being way better than we could have imagined (especially the color inferring WOW!) and we are very proud to have developed this novel yet useful compression format.

Also, our logo is AMAZING and was designed by Li. There are so many levels to it! It's an angler fish ready to take BITE of something. But from the other side, it's also a CAMERA with a BITE taken out of it!

What we learned

We all learned a lot about front-end design and machine learning. Making every component of the project work together was also a challenging but rewarding task.

What's next for ByteSize

We would definitely like to continue improving our Machine Learning model to decrease compressed file sizes and increase uncompressed resolution and accuracy. Currently, the model was trained for less than 24 hours. If we trained it for weeks or even months, we would approach a near lossless file compression algorithm.

Built With

Submitted to

HackTX 2022

Created by

I created the Flask backend and was responsible for tying it to the machine learning model. I also helped with various machine learning tasks pertaining to the input and output data formats.

Naveen Iyer
I am a human and I live on Earth
I used React, Next.jk, and TailwindCSS to work on the Front-end, as well as design and visual components.

Li Shen
I worked on the ml model which took in an ascii text file, edited it with bicubic interpolation, enhanced it with a state-of-the-art SRGAN model, and used color interpolation to fully decompress the text into a full mp4 with minimal loss.

Anish Karthik
CS Junior interested in AI Disc: anishfish#5103 Old Devpost Acc: anishfish
Abhishek More
Retired Hackathon Enjoyer

Updates

Naveen Iyer started this project — Oct 16, 2022 11:23 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.