Inspiration
Most times, I prefer consuming books as if I'm being told a story by someone. Hence the idea pop up with using python for it
What it does
It extract texts from pdf file and produce audio output of it using .mp3 format.
How we built it
I made use of the pypdf library for reading contents off pdf file. After that, the text content it's sent to the function for handling the text to mp3 which is made possible by pyttsx package
Challenges we ran into
I ran into dependencies issues that pyttsx depends on and also haven't figured out how to make the text to audio conversion works for all pages in a pdf
Accomplishments that we're proud of
I was able to attempt the python script I've wanted to do
What we learned
They're different kinds of pdf files when working with text extraction in pdf, some are images scanned, some generated by pdf softwares and the encrypted ones
What's next for PDF-AudioBook
Lots of features are still missing from the script. So i plan on adding them gradually. The script hasn't been able to convert full a single page of pdf.
Log in or sign up for Devpost to join the conversation.