Inspiration
Our project was inspired by the growing need for individuals to navigate the complex world of financial content on the internet. Misleading claims, financial jargon, and sensationalism can make it challenging to separate valuable information from deceptive practices. We aimed to create a solution that empowers users to make informed decisions by identifying potentially misleading content and enhancing their financial knowledge
What it does
It's a one-stop solution for jargon and sensationalism detection in the financial domain. Authentix empowers users to distinguish between authentic and potentially misleading claims, helping them make informed decisions in the world of finance. By analyzing transcripts, comments, and posts, our platform provides real-time insights into the presence of intricate jargon and sensational language.
How we built it
We started by collecting a diverse dataset of financial transcripts and comments, meticulously annotated to serve as the training data for our models. Leveraging powerful NLP models like BERT, we fine-tuned them to detect financial jargon and sensationalism accurately.
We also implemented a Bag-of-Words (BoW) mechanism to identify specific keywords associated with misleading claims. This two-pronged approach ensures that Authentix can effectively analyze content, no matter how intricate or sensational.
Challenges we ran into
One of the primary hurdles was acquiring a comprehensive and annotated dataset for training our models. We had to invest significant time and effort in data collection and annotation. Additionally, fine-tuning and optimizing NLP models demanded computational resources and expertise
What we learned
Data Annotation: Annotating financial data for machine learning models is a meticulous process that demands attention to detail.
Model Fine-Tuning: Fine-tuning large-scale NLP models like BERT is a powerful technique for domain-specific applications.
Balancing Complexity: Striking a balance between complex NLP models and real-time processing capabilities was a challenge. We learned how to optimize both for efficiency.
Built With
- bert
- django
- googleapiclient
- javascript
- openai-api
- python
- scikit-learn
- spacy
- sqlite
- tensorflow
Log in or sign up for Devpost to join the conversation.