Audio Video Transcript AI

Inspiration

Audio and Video Files transcriptions plays a very vital roles in our everyday activities.

1.) Various Organizations, Companies, Agencies etc. rely on AssemblyAI for transcription of their Phone calls, Voice Notes, Video Files to better serve their Clients and Customers.

2.)Lecturers and Students rely on AssemblyAI for transcription Video Lecture Notes to better get insight into Topics of Discussion, Lecture Summary etc.

3.) Governmental Agencies,Politicians, Presenters etc. rely on AssemblyAI for transcription Video and Audio files to better get insights into Speaking Points, Topics of Discussions, Name Entities, Places and People involves, Video Content Summary, Sentiments etc.

4.) Farmers, Agriculturists etc. rely on AssemblyAI for transcription Video and Audio files to better get insights into reading and analyzing of Farmers Audio Legal Contracts Agreements to break the content into easy and digestible form and to prevent Cheating and Fraudulent activities by their Contracting Companies/Parties.

5.) Doctors and Patients: Patients go to Medical Appointment with Doctors but most often they forget most of what Doctors told them they should do especially deaf and hard of hearing People thus, developing an apps that transcribe a person’s speech to text in real time are on the rise in the deaf and Hard of Hearing community.

I also want to break down each of the Phone Calls, Audio Contents, Voices Notes, Video Contents etc. into more simpler, easier and digestible form for Hard of hearing People.

The question is? How do we get every bit of message or main details of the message in the Phone Calls, audio calls/voice notes, Video Contents etc, the sentiments, entities involved etc. in order to facilitate a smooth conversations and decisions.

I realized that If We can convert Phone Calls, audio Calls/Voice notes, Video Contents etc. to Text Messages and then run Sentimental, Topics Detections, Keywords & Keyphrases and Name Entities Analysis to detect Callers Moods, Sentiments and main People, Organizations, Entities, involved in the Audio Calls, it will have a long way in helping various businesses alot at long run.

I also realized that we can save a lots of time and energy if we can run Text Summary of the Phone calls, Voice Notes/audio call, Video Files by pointing the User to the actual content of the Video, audio/voice note messages.

What it does

Audio-Video Transcription AI is an interactive app that keep all your Audio and Video files in one secured place. It is an interactive application that provides Text Messages equivalents of your Phone Calls, Audio Calls/Voice Notes, Audio Files, Video contents etc. leveraging AssemblyAI

It analyze your Video/Audio files for

Speaking Points(Speakers Main Points)
Keywords and Keyphrases Detection
Topic Detection
Get Important Key Phrases and Keyword Words
Sentiment Analysis
Summarization
Names and Entity Detection

After Phone Calls, Voice Notes, Audio Files, Video Files etc. is being Converted to Text Messages. The Application Leverages

1.)AssemblyAI Sentiments API to detect and analyze Sentiments in the Audio Calls/Voice Notes, Video Contents for text message positivity(Happy Mood), negativity(Sad Mood) and Neutrality(Mild Mood) Statements.

2.) AssemblyAI Name Entity API to detect and analyze all the names, People, Locations entities etc. involved in the Audio/Voice Notes, Video Files Text Messages.

3.) AssemblyAI Summary API to summarize the main details of the Audio/Video Files Text messages. This help to save more time by providing the summary of Video, Audios, Voice-Notes Text version which points the user to the main details of the audio calls/Voice Notes, Video Contents etc..

4.) AssemblyAI Topic Detection API to detect Speakers Main Points, topics etc. that are spoken in audio/video files Text Documents Transcriptions

5.) Keywords and Keyphrases Detection Detection API to detect main keywords, keyphrases etc. that are spoken in audio/video files Text Documents Transcriptions

6.) The Applications leverages Google Statistical Graphs/Charts to display Graphical/Charts Visualization of each Video Contents, Phone Calls, Audio Calls or Voice Notes Sentiments and Name Entities Analytical data leveraging AssemblyAI Sentimental and Name Entities Analytical Data

Additional features that makes our app the best:is that Transcripts can be saved and edited in real-time so that you can share contents with others any time any day.

How to Test the Application Online

1.)Signup and Login

2.) Add your Video Files, audio/voice notes. You can either enter URL of the Video/Audio Files or by browsing Files locations and You are there....

How we built it

Built with AssemblyAI, php, Mysql Ajax/Jquery, Bootstraps etc.

. I used Google Charts to build Name Entities Vs Start Points, Sentiments Polarity vs Score Chart distribution Analysis.

How to run the application locally.

Please see the readme.txt file included in the application project codes

What's next for Audio-video-transcript AI

More Features coming soon

Built With

Updates

Fredrick C. Esedo started this project — Dec 11, 2022 01:39 PM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.