My Journey: Building AgriSathi

What Inspired Me

Living in a country where agriculture is the backbone of the economy, I noticed a painful irony: while the internet is overflowing with data, the people who need it most—our farmers—are often the most disconnected. I realized that for a farmer in rural India, the "Digital Divide" isn't just about a lack of internet; it’s a barrier of literacy, language, and hardware.

I asked myself: What if a farmer didn't need a smartphone or data pack? What if they could just pick up any basic phone, dial a number, and talk to the internet in their own mother tongue? That "what if" became AgriSathi. My mission was to turn a simple phone call into a gateway for expert agricultural wisdom.


How I Built It

I designed AgriSathi as a first-of-its-kind, end-to-end Voice-to-Voice AI platform. I focused on making the interaction as natural as talking to a neighbor.

The Technical Workflow

  1. Telephony & Orchestration: I used LiveKit to handle the SIP trunking, allowing farmers to connect via a standard cellular call and their AI agent SDK for entire Agentic Orchestration.
  2. The Speech Pipeline:
    • STT (Speech-to-Text): I integrated Deepgram Nova-3 for its exceptional multilingual support, converting local dialects into text.
    • The Brain (LLM): I utilized Gemini 3 for ultra-low latency processing, coupled with the excellent reasoning to appropriately call the tools and talk to the user in a respectful and easy manner, making sure the answers are properly formatted and are in a multilingual way for the TTS module to convert it into voice.
    • TTS (Text-to-Speech): To give our agent a human-like voice, I used Cartesia Sonic 3, which provides high-quality, multilingual vocal outputs.
  3. Information Retrieval: The system isn't just a chatbot; it's a researcher. It uses Google Web Search APIs to fetch real-time Mandi prices, weather forecasts, and the latest policy schemes.

Cost Efficiency

I optimized the architecture to keep the total estimated cost at approximately $0.0698/min, ensuring that the solution remains scalable and economically viable for public impact.


Challenges I Faced

The road wasn't easy. Our biggest hurdle was Latency. In a voice call, a delay of even two seconds feels like an eternity. I had to move toward an asynchronous programming model using Python and specialized LLMs like Gemini to ensure the "Voice-to-Voice" loop felt instantaneous.

Another challenge was Context Retention. Farmers often call back with follow-up questions. I solved this by implementing a Database system that generates a conversation summary after every call, allowing the AI to "remember" the farmer’s previous issues (like a specific pest infestation) when they call again.


What I Learned

This project taught me that accessibility is the highest form of innovation. Building a complex LLM is one thing, but making it accessible to someone with a ₹1,200 feature phone is where the real impact lies. I learned how to orchestrate multiple cutting-edge AI tools (LiveKit, Gemini) into a single, seamless pipeline that bridges the gap between high-tech AI and the grassroots level of Bharat.


The Impact

  • Zero Literacy Barrier: No English or tech-literacy required.
  • Infrastructure Independent: Works without smartphones or 4G/5G.
  • Localized: Real-time Mandi prices and weather advice in local languages.

AgriSathi is more than just a tool; it’s a digital companion for every Indian farmer.

Built With

Share this project:

Updates