Sample from Worksheet dialog

 

 

 

 

 

 

 

 

 

 

 

 

Data Range

Either type the address directly into this field, or use the reference button, to enter the data range from the worksheet. If the cell pointer (active cell) is already somewhere in the data range, Analytic Solver automatically picks up the contiguous data range surrounding the active cell. After the data range is selected, Analytic Solver displays the number of records in the selected range.

First Row Contains Headers

When this box is checked, Analytic Solver picks up the headings from the first row of the selected data range. When the box is unchecked, Analytic Solver follows the default naming convention, i.e., the variable in the first column of the selected range will be called "Var1", the second column "Var2," etc.

Variables

This list box contains the names of the variables in the selected data range. If the first row of the range contains the variable names, then these names appear in this list box. If the first row of the dataset does not contain the headers, then Analytic Solver lists the variable names using its default naming convention. In this case the first column is named Var1; the second column is named Var2 and so on. To select a variable for sampling, select the variable, then click the ">" button. Use the CTRL key to select multiple variables.

Sample with Replacement

If this option is checked the data will be sampled with replacement. The default is sampling without replacement.

Set Seed

Enter the desired sorting seed here. The default seed is 12345.

Desired sample size

Enter the desired sample size here. (Note that the actual sample size in the output may vary a little, depending on additional options selected.)

Simple random sampling

The data is sorted using the simple random sampling technique, taking into account the additional parameter settings.

Stratified random sampling

If selected, Analytic Solver Data Science enables the following additional options.

Stratum Variable

Select the variable to be used for stratified random sampling by clicking the down arrow and selecting the desired variable. (Note: Analytic Solver Comprehensive and Data Science support an unlimited number of variables each having an unlimited number of distinct values. Analytic Solver Basic supports variables with 2 to 30 distinct values.) As the user selects the variable name, Analytic Solver displays the #Strata that variable contains in a box to the left and the smallest stratum size in a field beside the option Equal from each stratum, #records = smallest stratum size.

Proportionate to stratum size

Analytic Solver detects the proportion of each stratum in the dataset and maintains the same in sampling.

Equal from each stratum, please specify # records

On specifying the number of records, Analytic Solver generates a sample which has the same number of records from each stratum. In this case the number chosen automatically decides the desired sample size. As a result, the option to enter the desired sample size is disabled.

Equal from each stratum, #records = smallest stratum size

Analytic Solver detects the smallest stratum size and generates a sample wherein every stratum has a representation of that size. If this option is selected, Sample with replacement and Desired sample size are both disabled.

Analytic Solver performs the stratified random sampling with or without replacement. If Sample with replacement is not selected, the desired sample size must be less than the number of records in the dataset.

If Sample with Replacement is selected, Analytic Solver is limited to 1,000,000 records in the sample output.