πŸš€ AutoVoice AI – Project Story

πŸ“Œ About the Project

AutoVoice AI is an Agentforce-powered text-to-speech enhancement that converts AI-generated responses into natural-sounding speech using AWS Polly. This innovation allows users to receive audio responses instead of text, making AI interactions more engaging, accessible, and hands-free.

With Slack integration, users can hear responses directly in their conversations, making AI feel more conversational and lifelike. Whether for customer service, technical support, or enterprise AI assistants, AutoVoice AI bridges the gap between text-based AI and voice-driven experiences.

🎯 Inspiration

We live in a world dominated by AI chatbots, yet most AI responses remain purely text-based. We wanted to break that barrier and introduce a solution that makes AI interactions feel more human and accessible.

Inspired by advancements in AI voice synthesis and accessibility tools, we saw an opportunity to transform Agentforce AI responses into dynamic, spoken messages. Whether for professionals multitasking, visually impaired users, or people who simply prefer listening over reading, AutoVoice AI enhances engagement and user experience across industries.

πŸ—οΈ How We Built It

1️⃣ Integrating AWS Polly with Agentforce

  • We created a custom action in Agentforce that processes AI responses and forwards them to AWS Polly.
  • AWS Polly generates an MP3 audio file from the text using realistic, AI-driven voices.

2️⃣ Building the AWS Lambda Function

  • An AWS Lambda function receives text responses via API Gateway.
  • The function sends the text to AWS Polly, which converts it into speech.
  • The resulting audio file is uploaded to Amazon S3 or sent directly back to Salesforce.

3️⃣ Slack Integration for Audio Playback

  • We developed a Slack app that allows users to receive AI-generated speech responses directly in Slack messages.
  • The app posts a clickable MP3 link for users to listen instantly.

4️⃣ Enhancing Salesforce with Audio Responses

  • Users interacting with Agentforce AI inside Salesforce can listen to responses via an embedded audio player.
  • This allows support agents, sales reps, and enterprise users to consume AI-generated content in a more interactive way.

⚠️ Challenges Faced

  • Agentforce Custom Actions – Fine-tuning Agentforce to trigger external API calls seamlessly.
  • Audio File Storage & Delivery – Deciding whether to use temporary file storage in S3 or return audio directly.
  • Voice Optimization – Choosing the best AWS Polly voice settings for clarity, engagement, and realism.
  • Latency Management – Ensuring minimal delay between AI text generation and speech output.

πŸš€ What We Learned

βœ… Voice-enabling AI boosts engagement – Users interact more naturally when AI talks back.

βœ… Hands-free AI is powerful – AutoVoice AI opens up new accessibility and productivity use cases.

βœ… Slack & Salesforce make a great combo – Using Slack as a delivery channel for AI speech enhances team collaboration.

🎯 Future Enhancements

🌟 Real-time Streaming – Reduce latency by enabling live voice streaming instead of file playback.

🌟 Multilingual AI Voices – Expand support for multiple languages for a global audience.

🌟 Personalized Voice Settings – Let users choose from different AI voices (e.g., professional, casual, energetic).


πŸ”₯ Conclusion

AutoVoice AI redefines AI interactions by adding a voice to Agentforce responses. Whether in Slack, Salesforce, or customer applications, this innovation makes AI more engaging, natural, and accessible.

Text-based AI is great, but hearing is believing! πŸ”ŠπŸš€

Built With

+ 8 more
Share this project:

Updates