The toshokAI Journey
Inspiration
The inspiration for toshokAI came from a common challenge in academic research - the difficulty in efficiently exploring and understanding research papers. As a researcher, I identified three key pain points:
- arXiv's minimalistic interface limits advanced exploration
- Reading and comprehending papers is time-intensive
- Lack of interactive ways to engage with research content
I saw an opportunity to leverage AI Large Language Models to create a more intuitive and intelligent way to interact with academic papers.
What it does
toshokAI transforms research paper interaction through:
Smart Library Management:
- Metadata and Semantic search
- Intelligent paper recommendations based on library analysis
Interactive Paper Chat:
- Natural witty easy-to-understand conversations with paper content
- Contextual Q&A
- AI-generated insights and deep-dive questions
How I built it
My modern web application consists of:
- Frontend: Streamlit for clean, responsive UI
- Backend:
- Snowflake for as backend databse
- Multi-step RAG implementation with hybrid search
- Smart recommendation system using LLM
- Document Processing:
- Automated paper chunking
- Context-aware chat system
- Conversation memory management
Challenges and Solutions
- Document Processing: Optimized speed by using html version of the paper whenever possible
- RAG Quality: Improved through:
- Refined chunking approaches
- Enhanced prompt engineering
- Optimized context retrieval
- User Experience: Balanced functionality with simplicity through iterative design
- Performance: Optimized LLM context, search pagination, and response generation
Key Achievements
- Intelligent Search: Semantic search with AI-generated relevance explanations
- Smart Discovery: LLM-powered recommendation system analyzing library patterns
- Advanced RAG: Multi-step system for comprehensive paper insights
- Smooth UX: Clean interface with clear operation feedback
- Robust Architecture: Efficient paper processing with maintained chat context
Future Development
Enhanced Discovery:
- Citation network analysis
- Cross-paper relationship mapping
Advanced Features:
- Multi-paper synthesis
- Automated literature reviews
- Figure and equation understanding
toshokAI demonstrates the potential of combining modern AI with academic research tools, making research more efficient and insightful while maintaining an engaging user experience.
Built With
- docling
- langchain
- python
- snowflake
- streamlit
Log in or sign up for Devpost to join the conversation.