About ChatLore

Inspiration

ChatLore was born from a simple observation: our digital conversations contain a wealth of sensitive information, yet we have limited tools to understand and protect this data. As messaging apps become central to our lives, we share phone numbers, addresses, passwords, and personal details without realizing the security implications. We wanted to create a solution that empowers users to take control of their digital conversations while maintaining privacy.

What We Learned

Building ChatLore taught us valuable lessons about:

  • Privacy-Preserving Analysis: Developing techniques to analyze sensitive data without compromising security
  • Context-Aware NLP: Creating systems that understand the nuanced context of conversations
  • Real-Time Security Analysis: Identifying potential risks in conversational data
  • User-Centric Design: Presenting complex security concepts in an accessible way

How We Built It

ChatLore is built on a privacy-first architecture with five key layers:

  1. Data Processing Layer: Our custom WhatsApp parser extracts and normalizes messages
  2. Analysis Layer: Identifies sensitive information and security risks using pattern recognition and ML
  3. Context Engine: Builds semantic understanding using Google's Gemini model and vector embeddings
  4. Query Layer: Enables context-aware search and question answering with RAG (Retrieval-Augmented Generation)
  5. Presentation Layer: Provides an intuitive interface built with React, TypeScript, and Tailwind CSS

For the backend, we used FastAPI to create efficient endpoints, while the frontend leverages React with TypeScript for type safety and Shadcn UI for a polished user experience.

Challenges We Faced

Privacy vs. Functionality

Our biggest challenge was balancing powerful analysis capabilities with strict privacy requirements. We solved this by:

  • Implementing local-first processing for sensitive operations
  • Using secure hashing for pattern matching without exposing data
  • Designing APIs that minimize data transfer

Context-Aware Search

Traditional keyword search fails to capture conversation context. We developed a system using:

  • Semantic embeddings to understand message meaning
  • Conversation threading based on temporal and semantic similarity
  • Custom context window algorithms to provide complete information

Efficient Processing

Chat histories can contain thousands of messages, making real-time analysis challenging. We implemented:

  • Incremental processing in manageable chunks
  • Parallel execution of independent analysis tasks
  • Strategic caching to avoid redundant processing

Handling Informal Text

Chat messages are often informal, with abbreviations, emojis, and incomplete sentences. We created specialized text processing techniques, including:

  • Chat-specific preprocessing pipelines
  • Emoji understanding as part of message meaning
  • Context-based interpretation of ambiguous messages

What's Next for ChatLore

We're excited to expand ChatLore with:

  • Support for more messaging platforms (Telegram, Discord, Slack)
  • Advanced threat detection for potential phishing or scam messages
  • Personalized security recommendations based on user behavior
  • End-to-end encrypted cloud backup options
  • Integration with privacy-focused identity management systems

ChatLore represents a new approach to chat data management that puts privacy and security first while still providing powerful analysis capabilities. We believe everyone deserves to understand and protect their digital conversations.

Team LazyDevs: Dhruv Khara: dk67537n@pace.edu, dhruvkhara167@gmail.com Meet Bhanushali: meetdinesh.bhanushali@pace.edu, meetbhanushali2001@gmail.com

Built With

Share this project:

Updates