SpeechSculpt

Landing page
analyse our audio
communication ai
non verbalmate ai
virtual meeting room

Inspiration

The inspiration for SpeakPro AI stems from recognizing the critical importance of effective communication in today's competitive world. Several key insights drove the creation of this project:

1 Job Interview Struggles - Witnessing many talented peers struggle during interviews despite having strong technical skills, highlighting the gap between knowledge and communication ability 2 Limited Practice Opportunities - Recognizing that meaningful feedback on communication is typically only available in high-stakes situations like actual interviews or presentations 3 The "50% Rule" - Understanding that non-verbal cues account for approximately half of the communication effectiveness, yet most people receive little formal training in this area 4 Technology Gap - Identifying that while AI has advanced rapidly, few applications focus on developing essential human skills like conversation and presentation abilities 5 Personal Growth Challenges - Experiencing firsthand the difficulty of improving communication without objective feedback and structured practice opportunities 6 Educational Inequity - Realizing that access to communication coaching is often limited to those with financial resources for professional training 6 Virtual Communication Revolution - Acknowledging the increasing importance of effective communication in remote and hybrid work environments

What it does

SpeakPro AI is a comprehensive communication skills development platform that analyzes both verbal and non-verbal aspects of conversations. It provides personalized feedback on speaking patterns 🗣️, offers AI-powered interview simulations with customizable scenarios 🤖, analyzes body language through our Nonverbal AI Vibe feature 👁️, and enables practice in virtual 3D meeting environments 🌐. Users can record real conversations for analysis 🎙️ or engage with AI interviewers to prepare for specific professional scenarios, receiving detailed feedback on their performance 📊.

How we built it

We developed SpeakPro AI using a multi-technology stack. The frontend was built with React.js to create an intuitive user interface 💻. For the backend, we used Node.js to handle API integrations and data processing ⚙️. The Nonverbal AI Vibe component was implemented using Python and Streamlit 📈, leveraging MediaPipe for facial landmark detection, head pose estimation, and gesture recognition 👤. We integrated Google's Gemini API for conversation analysis and interview simulation 🧠, ElevenLabs for voice generation 🔊, and Groq LLM for fast response processing ⚡. The system uses machine learning models trained on custom datasets to classify facial expressions, hand gestures, and body postures 📱.

Challenges we ran into

As first-year B.Tech students at NIT Hamirpur, we faced significant challenges. Integrating multiple AI technologies proved complex 🔄, especially ensuring real-time processing for immediate feedback ⏱️. Achieving accurate non-verbal signal detection required extensive model calibration 🔍. Creating natural-sounding AI interviewers that adapt to user responses was particularly difficult 🗨️. We struggled with cross-platform compatibility between React, Node.js, and Streamlit components 🔗. Performance optimization was crucial as running multiple ML models simultaneously caused initial latency issues ⏳. Learning these advanced technologies within the hackathon timeframe required intensive self-study and collaboration 📚.

Accomplishments that we're proud of

We're most proud of creating a functional full-stack application with AI capabilities despite being first-year students with limited prior experience 🌟. Successfully implementing MediaPipe for non-verbal communication analysis was a significant achievement 💪, especially the head pose estimation and eye contact detection features 👀. We're proud of our Streamlit dashboard that visualizes complex AI analysis in an intuitive format 📊. The adaptive nature of our AI interviewer, which adjusts questions based on previous responses, represents a technical milestone for our team 🎯. Most importantly, we built something with practical value that addresses a universal need for better communication skills 🌍.

What we learned

This project was an immense learning experience in both technical skills and teamwork. We gained practical experience with React, Node.js, Python, and Streamlit development 💻. We learned how to work with AI APIs and implement machine learning for computer vision tasks 🤖. Understanding API integration, environment configuration, and deployment processes provided valuable real-world development experience 🔄. Beyond technical skills, we learned the importance of project planning, task distribution, and collaboration 👥. Perhaps most significantly, we discovered how to break down a complex problem into manageable components and integrate them into a cohesive solution—a fundamental skill for our future engineering careers 🚀.

What's next for Speak Pro AI

1 Enhanced Emotion Detection 😊 - We plan to integrate more sophisticated emotion recognition to provide deeper insights into both the speaker's and listener's emotional states during conversations. 2 Integration with AR/VR 🥽 - Developing immersive practice environments using augmented and virtual reality for even more realistic interview and presentation simulations. 3 Performance Analytics Dashboard 📊 - Developing more comprehensive progress tracking with actionable insights and personalized improvement recommendations. 4 Accessibility Features ♿ - Ensuring the platform is fully accessible to users with different abilities, including adaptations for hearing or vision impairments.

Built With

css
elevenlabs
express.js
geminiapi
groq
mediapip
numpy
opencv
plotpy
streamlit

Updates

ARPIT SINGH started this project — Mar 20, 2025 11:20 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.