Sample selection methods

Sample selection methods are the specific methods used to select the records contained in a sample.

For record sampling and monetary unit sampling, Analytics supports three sample selection methods:

  • fixed interval
  • cell
  • random

For classical variables sampling, the random selection method is the only possibility.

Sample selection method versus sampling type

It is important to understand the distinction between sample selection method and sampling type.

Sampling type refers to the overall statistical method used to arrive at an estimate about a population.

Sample selection method refers to the way in which records are drawn from a population for inclusion in a sample.

Sampling type Sample selection methods available Details
Record sampling
  • fixed interval
  • cell
  • random
The records contained in the sample are directly selected
Monetary unit sampling
  • fixed interval
  • cell
  • random
The records contained in the sample are those that correspond to the selected monetary units
Classical variables sampling
  • random
The records contained in the sample are directly selected

Fixed interval selection method

With the fixed interval selection method, an initial monetary unit or record is selected, and all subsequent selections are a fixed interval or distance apart – for example, every 5000th monetary unit, or every 20th record, after the initial selection.

Cell selection method

With the cell selection method, the data set is divided into multiple equal-sized cells or groups, and one monetary unit, or one record, is randomly selected from each cell.

Random selection method

With the random selection method, all monetary units or records are randomly selected from the entire data set, or from each stratum if you are using classical variables sampling.

Add a record number field

You may find it useful to add a record number field to the Analytics table from which you are drawing a sample. After you draw the sample, the specific record numbers that were selected from the source table are displayed in the output table containing the sample.

Note

A record number field is automatically included in the output table when you use classical variables sampling.