Krampus is a distributed real time stream data processing and anomaly detection system.
Developing data science systems with modern technologies like Spark and Scala.
How I built it
I developed the system step by step from bottom up - starting with basic data representation, to training anomaly detection algorithm.
Challenges I ran into
It was the first time I developed an aplication with Apache Spark. Starting and running such distributed system on local machine is pretty difficult.