This project generates utilizes differential privacy methods on the NYC taxi dataset to mask columns with sensitive data and deploy a private dataset. We utilize 5 methodologies: Total Anonymity, Min/Max Uniform Distribution Sampling, Gaussian Sampling, Laplacian Sampling, and GUPT. Please see the Google Drive link below for a project description and the GitHub link for our implementation. The Google Docs link is for the slideshow (only accessible from a Columbia email).
Log in or sign up for Devpost to join the conversation.