Adding suite of custom expectations

We're looking to modernize data validation by contributing to the Great Expectations Library as part of their hackathon.

Inspiration

Data validation is a key principle of data engineering, and tools that do this well have the potential to shape the future of the industry and how we derive confidence from data.

What it does

We're contributing a number of custom expectations to the Great Expectations Hackathon:

expectations that valid zip codes are in place
expectations that shape files should either overlap or not
expectations that lines should fall either within or outside of boundaries
expectations that us state and territory codes are valid
Geospatial expectations to see if geometries are valid, overlap, have elevation
expectation to see if model results are fair from a binary model

How we built it

We followed their template and added code that would validate the fields we wanted to by using a combination of existing python libraries and data.

Challenges we ran into

No one on the team has contributed to open source projects before, and so a large challenge was not only coding custom expectations, but ensuring that we were following style guides, contribution guides, and standard procedures.

Accomplishments that we're proud of

_ number of custom expectations added!

What we learned

Open source projects are beautiful and messy.

What's next for Adding suite of custom expectations

Implementing the expectations we've developed into our own ecosystem

Built With

geopandas
great-expectations
python
us
zipcodes

Updates

Luis Diaz posted an update — Apr 06, 2022 11:56 PM EDT

Some of the pull requests that are also ours but weren't linked in time: https://github.com/great-expectations/great_expectations/pull/4745 https://github.com/great-expectations/great_expectations/pull/4749 https://github.com/great-expectations/great_expectations/pull/4755 https://github.com/great-expectations/great_expectations/pull/4757 https://github.com/great-expectations/great_expectations/pull/4761 https://github.com/great-expectations/great_expectations/pull/4764 https://github.com/great-expectations/great_expectations/pull/4765 https://github.com/great-expectations/great_expectations/pull/4772 https://github.com/great-expectations/great_expectations/pull/4775 https://github.com/great-expectations/great_expectations/pull/4779 https://github.com/great-expectations/great_expectations/pull/4780 https://github.com/great-expectations/great_expectations/pull/4786 https://github.com/great-expectations/great_expectations/pull/4788 https://github.com/great-expectations/great_expectations/pull/4789 https://github.com/great-expectations/great_expectations/pull/4790 https://github.com/great-expectations/great_expectations/pull/4791 https://github.com/great-expectations/great_expectations/pull/4792 https://github.com/great-expectations/great_expectations/pull/4793 https://github.com/great-expectations/great_expectations/pull/4794 https://github.com/great-expectations/great_expectations/pull/4795 https://github.com/great-expectations/great_expectations/pull/4796 https://github.com/great-expectations/great_expectations/pull/4797 https://github.com/great-expectations/great_expectations/pull/4798 https://github.com/great-expectations/great_expectations/pull/4799 https://github.com/great-expectations/great_expectations/pull/4800 https://github.com/great-expectations/great_expectations/pull/4801 https://github.com/great-expectations/great_expectations/pull/4802 https://github.com/great-expectations/great_expectations/pull/4803 https://github.com/great-expectations/great_expectations/pull/4804

Log in or sign up for Devpost to join the conversation.

Luis Diaz started this project — Apr 05, 2022 04:56 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.