D-Bit HackUTD2022 EOG challenge
Inspiration Given a wide variety of data, with an open ended prompt, we were excited to use classic unsupervised and supervised learning techniques to classify techniques.
What it does D-Day is a website that takes in data about asteroid drills sent in by the user and uses ML techniques to present accurate and useful analytics. The code promotes and very modular design for reproducibility and easy access. Our website design illustrates flawless user interface for the most efficient, streamlined process possible.
How we built it We utilized primarily Python, Numpy, Pandas, and Scikit-learn for the true processes behind our project. We used a variety of mathematical and statistical techniques in order to normalize and transform the given data in order to achieve the desired results.
Challenges we ran into One problem we faced was the numerous occurrences of bad data which we needed to clean. Most of our time went into creating features and doing routine data processing in order to create a strong foundation for our model. In addition, we faced the challenge of selecting a model in order to best fit the problem. While we could have used some sort of CNN and deep learning for the base of the project, we decided it would be much more efficient and cost-smart to utilize a simpler model such as a K-nearest neighbors algorithm.
Accomplishments that we're proud of We were able to answer all of the given questions given as part of the project prompt. In addition, we were able to successfully devise a way to monitor and check live user data through the functionality of data submission on the website. We were able to normalize and process the results in order to make an readable output for any user. We were able to determine that the best drilled Asteroid was Asteroid #3 and that the most efficient drill-bit per depth/time was the "Apollo Drill-bit."
What we learned We learned even further about the importance of data preprocessing and cleaning. Furthermore, we learned more about the variety of solutions that occur per any problem. I believe we advanced our ML and AI skills by a significant degree while also maintaining respect for UI/UX and frontend.
What's next for D-Day The next step for our project in fully fleshing out and connecting our website. Moreover, we thought that an efficient way to generate the closest permutation of specific labels would be through reinforcement learning, a system based on weights and rewards. While we were unable to fully realize this approach, we believe we have enough understanding of this technique in order to more accurately solve this problem.