Inspiration

Government services and benefits can be overwhelming to navigate, with complex eligibility criteria and application processes. Many citizens struggle to understand what benefits they qualify for and how to access them. CivicSense was inspired by the need to make government assistance programs more accessible and understandable to the general public, starting with Illinois Department of Human Services (IDHS) programs.

What it does

CivicSense is an intelligent chatbot that helps citizens navigate government policies and benefits, specifically focused on IDHS services. Key features include:

  • Provides easy-to-understand information about benefits like SNAP (Supplemental Nutrition Assistance Program)
  • Helps users understand eligibility criteria and application processes
  • Maintains contextual memory for natural follow-up conversations
  • Includes direct references to official sources and documentation
  • Offers a comprehensive analytics dashboard for monitoring system performance
  • Provides real-time groundedness and relevance metrics using TruLens evaluation

How we built it

Technical Architecture CivicSense is engineered with a sophisticated architecture that combines modern AI technologies with robust evaluation systems:

  1. Frontend (StreamlitChatBot):

    • Implements a responsive chat interface
    • Features a comprehensive dashboard for performance analytics and monitoring
  2. RAG System Architecture:

    • Snowflake Cortex Search for efficient document retrieval
    • Mistral Large LLM for natural language understanding and generation
    • Custom context processing for accurate responses
  3. Evaluation System (TruLens Integration):

    • CortexEvaluator implements sophisticated feedback mechanisms:
      • Groundedness measurement with chain-of-thought reasoning
      • Context relevance scoring
      • Answer relevance metrics
      • Cost estimation feedback
  4. Performance Analytics:

    • Comprehensive metrics tracking through custom SQL queries
    • Real-time monitoring of:
      • Quality Metrics
      • Cost analysis
      • Performance
      • Model Evaluation
    • Version comparison capabilities for continuous improvement

Challenges we ran into

Resource Management: Credits management became a significant challenge as we scaled our testing and development. We had to strategically use multiple accounts and optimize our usage patterns to maintain development momentum while staying within resource limits.

Accomplishments that we're proud of

  1. Comprehensive Evaluation System: Successfully implemented TruLens for sophisticated evaluation of response quality, including:

    • Groundedness measurement
    • Context relevance scoring
    • Answer relevance metrics
    • Cost tracking
  2. Advanced Analytics Dashboard: Created a detailed monitoring system that tracks:

    • Real-time performance metrics
    • Cost analysis
    • Quality metrics
    • Version comparisons
  3. Natural Interaction: Achieved a conversational interface that maintains context while providing accurate, sourced information.

What we learned

Building CivicSense has been an intensive learning experience in modern RAG system development. The project taught us the critical importance of evaluation metrics in AI systems - it's not just about getting answers, but ensuring they're accurate, relevant, and grounded in source documentation. Through TruLens integration, we learned to quantify and monitor these aspects systematically, leading to continuous improvement in response quality. This experience has given us deep insights into building production-grade AI systems that can handle sensitive information while maintaining high standards of accuracy and reliability.

What's next for CivicSense

  1. Expansion of Services:

    • Include additional government programs and services
    • Expand to other states beyond Illinois
  2. Enhanced Features:

    • Multi-language support
    • Personalized eligibility checking
    • Interactive application assistance
  3. Technical Improvements:

    • Advanced document preprocessing
    • Enhanced context understanding
    • More sophisticated performance optimization

Built With

  • mistral-ai
  • python
  • snowflake
  • streamlit
  • trulens
Share this project:

Updates