HCL data analysis methods

Perform operations on an existing HCL dataframe.

HCL method Description
append() method

Combines rows from two or more dateframes into a single dataframe by appending one dateframe to the bottom of another dataframe.

benford() method

Counts the number of times each leading digit (1–9), or leading digit combination, occurs in a numeric column, and compares the actual count to the expected count. The expected count is calculated using the Benford formula.

count() method

Counts the number of rows in a dataframe.

duplicates() method

Detects whether duplicate values or entire duplicate rows exist in a dataframe.

extract() method

Extracts columns from a dataframe to a new dataframe.

head() method

Returns a specified number of rows from the top of a dataframe.

join() method

Based on a common key, combines columns from two dataframes into a single dataframe.

outliers() method

Identifies statistical outliers in a numeric column. Outliers can be identified for the column as a whole, or for separate groups based on identical values in one or more key columns.

rename() method

Renames one or more columns in a dataframe.

select() method

Selects the specified columns in a dataframe for inclusion in a new dataframe.

sort() method

Sorts rows in a dataframe into an ascending or descending sequential order, based on a specified key column or columns.

summarize() method

Groups rows in a dataframe based on identical values in one or more columns. Counts the number of rows in each group, and also subtotals specified numeric columns for each group.