Inspiration

Most times, I prefer consuming books as if I'm being told a story by someone. Hence the idea pop up with using python for it

What it does

It extract texts from pdf file and produce audio output of it using .mp3 format.

How we built it

I made use of the pypdf library for reading contents off pdf file. After that, the text content it's sent to the function for handling the text to mp3 which is made possible by pyttsx package

Challenges we ran into

I ran into dependencies issues that pyttsx depends on and also haven't figured out how to make the text to audio conversion works for all pages in a pdf

Accomplishments that we're proud of

I was able to attempt the python script I've wanted to do

What we learned

They're different kinds of pdf files when working with text extraction in pdf, some are images scanned, some generated by pdf softwares and the encrypted ones

What's next for PDF-AudioBook

Lots of features are still missing from the script. So i plan on adding them gradually. The script hasn't been able to convert full a single page of pdf.

Built With

Share this project:

Updates