Comparison of data combining methods
The advantages and disadvantages of the different data combining methods in Analytics are outlined below.
Note
Appending, extracting and appending, and merging are compared because these methods combine tables with identical or similar record structures.
Joining is compared with relating because these two methods combine tables with different record structures.
For more information, see Data structure and data format requirements.
Appending, extracting and appending, and merging
Requirement/Capability |
Appending |
Extracting and Appending |
Merging |
---|---|---|---|
Tables being combined must have an identical data structure | No |
Yes |
Yes |
Resulting combined table is sorted |
No Records extracted from source tables are appended as groups in the output table. |
No Records extracted from the source table are appended as a group to the end of the target table. |
Yes Records from both tables are inserted into a new, third table based on a sort order. |
Access and analyze data from two tables |
Yes |
Yes |
Yes |
Access and analyze data from more than two tables |
Yes |
Not supported by a single extract and append operation. Requires multiple operations. |
Not supported by a single merge operation. Requires multiple operations. |
Outputs results to a new, physically separate Analytics table |
Yes |
No |
Yes |
Key fields in both tables must be:
|
Not applicable Appending does not use key fields. |
Not applicable Extracting and appending does not use key fields. |
Yes |
Number of key fields |
Not applicable Appending does not use key fields. |
Not applicable Extracting and appending does not use key fields. |
One or more key fields can be selected from each table. |
Joining and relating
Note
If the tables that you want to combine have identical record structures, you should probably use appending, extracting and appending, or merging.
Comparison of capabilities
Capability |
Joining |
Relating |
---|---|---|
Use case | Good for as a preliminary step for investigative work because it outputs a permanently joined new third table. | Good for informational work because it creates a virtual table without any requirement that it be made permanent. |
Access and analyze data from two tables simultaneously |
Yes |
Yes |
Access and analyze data from more than two tables simultaneously |
No Not supported by a single join operation. Requires multiple join operations. |
Yes A single relation operation supports access/analysis of up to 18 tables simultaneously. |
Outputs results to a new, physically separate Analytics table |
Yes |
No If required, you can perform a separate operation and extract any combination of fields from the related tables to a new table. |
Number of key fields |
One or more key fields can be selected from each table. |
Limited to one key field per table pair If more than one key field is required to establish an accurate relation between a table pair, create a computed field in each table to concatenate the required key fields. |
Execution speed of operation |
Slower The duration of the join operation varies depending on the complexity of the join, and whether or not the primary table is sorted. |
Faster No actual record matching is done during the relation operation. For this reason, it takes considerably less time than joining. |
Subsequent processing of a file |
Faster The results of joining are stored in a flat file (.fil source data file). Flat files can be processed very quickly. |
Slower Record matching between related tables is done at the time of subsequent processing, which adds to processing time. |
Updateable from source data files |
No Join results are sent to a new, third table with a new source data file no longer associated with the source data files involved in the join. |
Yes Related tables remain associated with, and can be refreshed from, the source data files involved in the relation. |
Matched primary and secondary records (1st secondary match) |
Yes |
Not directly supported After relating tables, use filters to isolate primary records that have matching secondary records. |
Matched primary and secondary records (all secondary matches) Also called many-to-many matching |
Yes |
No |
Unmatched primary records |
Yes |
Not directly supported After relating tables, use filters to isolate primary records that do not have matching secondary records. |
All primary records and matched secondary records |
Yes |
Yes |
All secondary records and matched primary records |
Yes |
No |
All primary and secondary records, matched and unmatched |
Yes |
No |
Comparison of requirements
Requirement |
Joining |
Relating |
---|---|---|
Tables being combined must have an identical data structure | No | No |
Key field data types must be identical for each table pair |
Varies Not required for character-numeric or numeric-character joins. Automatically harmonized by Analytics. Required for all other possibilities. |
Yes |
Key field lengths must be identical for each table pair |
Recommended (not enforced) Lengths of two character key fields automatically harmonized by Analytics. |
Recommended (not enforced) |
Required disk space for processing |
More Joining creates a new, third table that can be larger than both original tables combined, depending on the nature of the join. |
Less Minimal disk space is required to create an index for the child table(s). |
Tables must be sorted or indexed |
Sort, Presort, or Index required for secondary table, optional for primary table. |
Index required for child tables (created automatically when relating tables), Sort or Index optional for primary table. |