Data Docs created by Great Expectations

How to work with Great Expectations

Welcome! Now that you have initialized your project, the best way to work with Great Expectations is in this iterative dev loop:

Let Great Expectations create a (terrible) first draft suite, by running great_expectations suite new.
View the suite here in Data Docs.
Edit the suite in a Jupyter notebook by running great_expectations suite edit
Repeat Steps 2-3 until you are happy with your suite.
Commit this suite to your source control repository.

1 of 7

expect_column_to_exist
expect_table_row_count_to_be_between
expect_column_values_to_be_unique
expect_column_values_to_not_be_null
expect_column_values_to_be_between
expect_column_values_to_match_regex
expect_column_mean_to_be_between
expect_column_kl_divergence_to_be_less_than
... and many more

An expectation is a falsifiable, verifiable statement about data.

Expectations provide a language to talk about data characteristics and data quality - humans to humans, humans to machines and machines to machines.

Expectations are both data tests and docs!

Back

2 of 7

{
    "expectation_type": "expect_column_values_to_not_be_null",
    "kwargs": {
        "column": "user_id"
    }
}

A machine can test if a dataset conforms to the expectation.

Back

3 of 7

{
    "success": false,
    "result": {
        "element_count": 253405,
        "unexpected_count": 7602,
        "unexpected_percent": 2.999
    },
    "expectation_config": {
        "expectation_type": "expect_column_values_to_not_be_null",
        "kwargs": {
            "column": "user_id"
        }
}

Here's an example Validation Result (not from your data) in JSON format. This object has rich context about the test failure.

Back

4 of 7

This is an example of what a single failed Expectation looks like in Data Docs. Note the failure includes unexpected values from your data. This helps you debug pipelines faster.

Back

5 of 7

Nearly 50 built in expectations allow you to express how you understand your data, and you can add custom expectations if you need a new one.

Back

6 of 7

This sample suite shows you a few examples of expectations.

Note this is not a production suite and was generated using only a small sample of your data.

When you are ready, press the How to Edit button to kick off the iterative dev loop.

Back

7 of 7