Inspiration

As a 4th-year B.Tech CSE student specialising in AI and Data Science, I've always found data preprocessing to be a time-consuming and tedious part of creating AI or ML models. This frustration inspired me to develop DataLens, a tool designed to simplify and streamline the data preparation process, allowing data scientists to focus more on analysis and insights rather than the mundane tasks of cleaning and formatting data.

What it does

DataLens is an online tool that automates the preprocessing of datasets and provides detailed visualisations and comprehensive reports with minimal effort. It has two main features:

  1. AutoPreprocessor: Simplifies data processing by transforming CSV files with a single click, making your data ready for analysis in no time.
  2. BirdEye: Uncovers insights instantly by generating detailed visualisations and reports from your data, providing actionable insights quickly and efficiently.

How we built it

We built DataLens using a combination of Python for backend processing and Next.js for the frontend. The AutoPreprocessor feature was designed to handle a variety of data types and structures, automating the most time-consuming aspects of data preparation. BirdEye was developed to provide instant visualisations and comprehensive reports, using data visualisation libraries to present the data in an accessible and insightful way.

Challenges we ran into

One of the main challenges we faced was ensuring that the AutoPreprocessor could handle diverse datasets, each with its own unique structure and quirks. It required extensive testing and refinement to make sure the tool was robust and versatile enough to manage different data scenarios. Integrating the visualisation and reporting features into a seamless user experience was another significant challenge, as it was important to balance power with ease of use.

Accomplishments that we're proud of

We are particularly proud of how DataLens has successfully automated the preprocessing phase, reducing the time and effort required to prepare datasets for analysis. The BirdEye feature, with its ability to generate detailed visualisations and reports instantly, is another accomplishment that we’re proud of. These features make DataLens a valuable tool for data scientists and analysts.

What we learned

Through this project, we learned the importance of automation in data science workflows and the need for intuitive tools that can simplify complex processes. We also gained a deeper understanding of the challenges involved in data preprocessing and visualisation, and how to overcome them to create a user-friendly product.

What's next for DataLens

Looking ahead, we plan to expand DataLens by adding more advanced data preprocessing options, support for additional data formats, and enhanced visualization capabilities. We also aim to integrate machine learning model training directly into the platform, making DataLens a comprehensive tool for data scientists from preprocessing to model deployment.

Built With

Share this project:

Updates