BookLens

Choose your native language
Take a picture of the page
Click on the word and learn
Get the translation of a selected sentence

Inspiration

Nowadays, kids spend most of their time on a tablet or computer instead of reading books. This interactive application will make the whole reading experience more fun.

What it does

BookLens captures any textual document and provides the ability to read aloud texts of the whole page, a sentence, or just a word in any language desired on click. It is also able to display the picture on word-click and translation.

How we built it

Back-end and front-end are written in Node.js and React respectively, using multiple apis from Google Cloud (Vision, Text-to-Speech, Translation) to manipulate both images and texts, as well as the Shutterstock API to retrieve images.

Challenges I ran into

Canvas and structuring data. The way that Google Vision API returns back OCR data, it is unsuitable to our needs and the data and to be parsed and restructured to follow the models we require. The algorithm was redone three times at 5 am but it finally worked!

Accomplishments that I'm proud of

Most base features are completed and work as expected.

What I learned

How to integrate via Google Cloud whether it is with their REST API, gRPC, or their client libraries. Also canvas and how it will never be used again :').

What's next for BookLens

For features, it would be translation of whole pages, read translated texts aloud, word visualization. and the ability to go back to previous page, among many other possible ideas!

Built With

cloud
google
google-cloud
node.js
ocr
react
shutterstock
text-to-speech
translation
vision

Submitted to

ConUHacks V
- Winner 1st Place - Nintendo Switch + Jetbrains licences + 1password 3 year licenses + digital ocean $2000 credit + 4x crystal cubes with a laser etching of a RADARSAT Constellation Mission satellite (from MDA)

Created by

I created the front-end app which communicated with the backed, scanned in text, and allowed users to tap on words to learn.

Caleb Mech
I worked on the back-end, mainly on providing APIs that restructures the data the Google Cloud Vision API returns. I have also built a custom in-memory database for persisting past pages. I haven't had much experience with Node and Javascript, but I learned that I definitely prefer Typescript.

Andy Ta
I worked on the back-end, mainly providing APIs for features such as text translation and word to images using Google's translation and shutterstock's images search services. I also spent a lot of time debugging and refactoring at 3AM...

Yufeng Ding
Rosa Phung