Profile and verify data

The profile commands let you count records, total numeric fields, and create a statistical profile of data.

The verify commands provide different ways to examine the integrity of a data set. For example, you can test for data validity, data sequence, gaps, and duplicates.

Command descriptions

Command Description

BENFORD

Counts the number of times each leading digit (1–9) or leading digit combination occurs in a field, and compares the actual count to the expected count. The expected count is calculated using the Benford formula.

COUNT

Counts the total number of records in the current view, or only those records that meet the specified condition.

DUPLICATES

Detects whether duplicate values or entire duplicate records exist in an Analytics table.

FUZZYDUP

Detects nearly identical values (fuzzy duplicates) in a character field.

GAPS

Detects whether a numeric or datetime field in an Analytics table contains one or more gaps in sequential data.

OUTLIERS

Identifies statistical outliers in a numeric field. Outliers can be identified for the field as a whole, or for separate groups based on identical values in one or more character, numeric, or datetime key fields.

PROFILE

Generates summary statistics for one or more numeric fields, or numeric expressions, in an Analytics table.

SEQUENCE

Determines if one or more fields in an Analytics table are in sequential order, and identifies out-of-sequence items.

STATISTICS

Calculates statistics for one or more numeric or datetime fields in an Analytics table.

TOTAL

Calculates the total value of one or more fields in an Analytics table.

VERIFY

Checks for data validity errors in one or more fields in an Analytics table by verifying that the data is consistent with the field definitions in the table layout.