HCL data analysis methods
Perform operations on an existing HCL dataframe.
HCL method | Description |
---|---|
append() method |
Combines rows from two or more dateframes into a single dataframe by appending one dateframe to the bottom of another dataframe. |
benford() method |
Counts the number of times each leading digit (1–9), or leading digit combination, occurs in a numeric column, and compares the actual count to the expected count. The expected count is calculated using the Benford formula. |
count() method |
Counts the number of rows in a dataframe. |
duplicates() method |
Detects whether duplicate values or entire duplicate rows exist in a dataframe. |
extract() method |
Extracts columns from a dataframe to a new dataframe. |
head() method |
Returns a specified number of rows from the top of a dataframe. |
join() method |
Based on a common key, combines columns from two dataframes into a single dataframe. |
outliers() method |
Identifies statistical outliers in a numeric column. Outliers can be identified for the column as a whole, or for separate groups based on identical values in one or more key columns. |
rename() method |
Renames one or more columns in a dataframe. |
select() method |
Selects the specified columns in a dataframe for inclusion in a new dataframe. |
sort() method |
Sorts rows in a dataframe into an ascending or descending sequential order, based on a specified key column or columns. |
summarize() method |
Groups rows in a dataframe based on identical values in one or more columns. Counts the number of rows in each group, and also subtotals specified numeric columns for each group. |