About ChatLore
Inspiration
ChatLore was born from a simple observation: our digital conversations contain a wealth of sensitive information, yet we have limited tools to understand and protect this data. As messaging apps become central to our lives, we share phone numbers, addresses, passwords, and personal details without realizing the security implications. We wanted to create a solution that empowers users to take control of their digital conversations while maintaining privacy.
What We Learned
Building ChatLore taught us valuable lessons about:
- Privacy-Preserving Analysis: Developing techniques to analyze sensitive data without compromising security
- Context-Aware NLP: Creating systems that understand the nuanced context of conversations
- Real-Time Security Analysis: Identifying potential risks in conversational data
- User-Centric Design: Presenting complex security concepts in an accessible way
How We Built It
ChatLore is built on a privacy-first architecture with five key layers:
- Data Processing Layer: Our custom WhatsApp parser extracts and normalizes messages
- Analysis Layer: Identifies sensitive information and security risks using pattern recognition and ML
- Context Engine: Builds semantic understanding using Google's Gemini model and vector embeddings
- Query Layer: Enables context-aware search and question answering with RAG (Retrieval-Augmented Generation)
- Presentation Layer: Provides an intuitive interface built with React, TypeScript, and Tailwind CSS
For the backend, we used FastAPI to create efficient endpoints, while the frontend leverages React with TypeScript for type safety and Shadcn UI for a polished user experience.
Challenges We Faced
Privacy vs. Functionality
Our biggest challenge was balancing powerful analysis capabilities with strict privacy requirements. We solved this by:
- Implementing local-first processing for sensitive operations
- Using secure hashing for pattern matching without exposing data
- Designing APIs that minimize data transfer
Context-Aware Search
Traditional keyword search fails to capture conversation context. We developed a system using:
- Semantic embeddings to understand message meaning
- Conversation threading based on temporal and semantic similarity
- Custom context window algorithms to provide complete information
Efficient Processing
Chat histories can contain thousands of messages, making real-time analysis challenging. We implemented:
- Incremental processing in manageable chunks
- Parallel execution of independent analysis tasks
- Strategic caching to avoid redundant processing
Handling Informal Text
Chat messages are often informal, with abbreviations, emojis, and incomplete sentences. We created specialized text processing techniques, including:
- Chat-specific preprocessing pipelines
- Emoji understanding as part of message meaning
- Context-based interpretation of ambiguous messages
What's Next for ChatLore
We're excited to expand ChatLore with:
- Support for more messaging platforms (Telegram, Discord, Slack)
- Advanced threat detection for potential phishing or scam messages
- Personalized security recommendations based on user behavior
- End-to-end encrypted cloud backup options
- Integration with privacy-focused identity management systems
ChatLore represents a new approach to chat data management that puts privacy and security first while still providing powerful analysis capabilities. We believe everyone deserves to understand and protect their digital conversations.
Team LazyDevs: Dhruv Khara: dk67537n@pace.edu, dhruvkhara167@gmail.com Meet Bhanushali: meetdinesh.bhanushali@pace.edu, meetbhanushali2001@gmail.com


Log in or sign up for Devpost to join the conversation.