Glossary of Expectations

Dataset

Dataset objects model tabular data and include expectations with row and column semantics. Many Dataset expectations are implemented using column_map_expectation and column_aggregate_expectation decorators.

Table shape

  • expect_column_to_exist
  • expect_table_columns_to_match_ordered_list
  • expect_table_row_count_to_be_between
  • expect_table_row_count_to_equal

Missing values, unique values, and types

  • expect_column_values_to_be_unique
  • expect_column_values_to_not_be_null
  • expect_column_values_to_be_null
  • expect_column_values_to_be_of_type
  • expect_column_values_to_be_in_type_list

Sets and ranges

  • expect_column_values_to_be_in_set
  • expect_column_values_to_not_be_in_set
  • expect_column_values_to_be_between
  • expect_column_values_to_be_increasing
  • expect_column_values_to_be_decreasing

String matching

  • expect_column_value_lengths_to_be_between
  • expect_column_value_lengths_to_equal
  • expect_column_values_to_match_regex
  • expect_column_values_to_not_match_regex
  • expect_column_values_to_match_regex_list
  • expect_column_values_to_not_match_regex_list

Datetime and JSON parsing

  • expect_column_values_to_match_strftime_format
  • expect_column_values_to_be_dateutil_parseable
  • expect_column_values_to_be_json_parseable
  • expect_column_values_to_match_json_schema

Aggregate functions

  • expect_column_mean_to_be_between
  • expect_column_median_to_be_between
  • expect_column_stdev_to_be_between
  • expect_column_unique_value_count_to_be_between
  • expect_column_proportion_of_unique_values_to_be_between
  • expect_column_most_common_value_to_be_in_set
  • expect_column_max_to_be_between
  • expect_column_min_to_be_between
  • expect_column_sum_to_be_between

Column pairs

  • expect_column_pair_values_A_to_be_greater_than_B
  • expect_column_pair_values_to_be_equal
  • expect_column_pair_values_to_be_in_set

Distributional functions

  • expect_column_kl_divergence_to_be_less_than
  • expect_column_bootstrapped_ks_test_p_value_to_be_greater_than
  • expect_column_chisquare_test_p_value_to_be_greater_than
  • expect_column_parameterized_distribution_ks_test_p_value_to_be_greater_than